International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38B9A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󋚀
CB680
󋚁
CB681
󋚂
CB682
󋚃
CB683
󋚄
CB684
󋚅
CB685
󋚆
CB686
󋚇
CB687
󋚈
CB688
󋚉
CB689
󋚊
CB68A
󋚋
CB68B
󋚌
CB68C
󋚍
CB68D
󋚎
CB68E
󋚏
CB68F
80
90
󋚐
CB690
󋚑
CB691
󋚒
CB692
󋚓
CB693
󋚔
CB694
󋚕
CB695
󋚖
CB696
󋚗
CB697
󋚘
CB698
󋚙
CB699
󋚚
CB69A
󋚛
CB69B
󋚜
CB69C
󋚝
CB69D
󋚞
CB69E
󋚟
CB69F
90
A0
󋚠
CB6A0
󋚡
CB6A1
󋚢
CB6A2
󋚣
CB6A3
󋚤
CB6A4
󋚥
CB6A5
󋚦
CB6A6
󋚧
CB6A7
󋚨
CB6A8
󋚩
CB6A9
󋚪
CB6AA
󋚫
CB6AB
󋚬
CB6AC
󋚭
CB6AD
󋚮
CB6AE
󋚯
CB6AF
A0
B0
󋚰
CB6B0
󋚱
CB6B1
󋚲
CB6B2
󋚳
CB6B3
󋚴
CB6B4
󋚵
CB6B5
󋚶
CB6B6
󋚷
CB6B7
󋚸
CB6B8
󋚹
CB6B9
󋚺
CB6BA
󋚻
CB6BB
󋚼
CB6BC
󋚽
CB6BD
󋚾
CB6BE
󋚿
CB6BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]