International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18CA4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񌤀
4C900
񌤁
4C901
񌤂
4C902
񌤃
4C903
񌤄
4C904
񌤅
4C905
񌤆
4C906
񌤇
4C907
񌤈
4C908
񌤉
4C909
񌤊
4C90A
񌤋
4C90B
񌤌
4C90C
񌤍
4C90D
񌤎
4C90E
񌤏
4C90F
80
90
񌤐
4C910
񌤑
4C911
񌤒
4C912
񌤓
4C913
񌤔
4C914
񌤕
4C915
񌤖
4C916
񌤗
4C917
񌤘
4C918
񌤙
4C919
񌤚
4C91A
񌤛
4C91B
񌤜
4C91C
񌤝
4C91D
񌤞
4C91E
񌤟
4C91F
90
A0
񌤠
4C920
񌤡
4C921
񌤢
4C922
񌤣
4C923
񌤤
4C924
񌤥
4C925
񌤦
4C926
񌤧
4C927
񌤨
4C928
񌤩
4C929
񌤪
4C92A
񌤫
4C92B
񌤬
4C92C
񌤭
4C92D
񌤮
4C92E
񌤯
4C92F
A0
B0
񌤰
4C930
񌤱
4C931
񌤲
4C932
񌤳
4C933
񌤴
4C934
񌤵
4C935
񌤶
4C936
񌤷
4C937
񌤸
4C938
񌤹
4C939
񌤺
4C93A
񌤻
4C93B
񌤼
4C93C
񌤽
4C93D
񌤾
4C93E
񌤿
4C93F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]