International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18AA4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񊤀
4A900
񊤁
4A901
񊤂
4A902
񊤃
4A903
񊤄
4A904
񊤅
4A905
񊤆
4A906
񊤇
4A907
񊤈
4A908
񊤉
4A909
񊤊
4A90A
񊤋
4A90B
񊤌
4A90C
񊤍
4A90D
񊤎
4A90E
񊤏
4A90F
80
90
񊤐
4A910
񊤑
4A911
񊤒
4A912
񊤓
4A913
񊤔
4A914
񊤕
4A915
񊤖
4A916
񊤗
4A917
񊤘
4A918
񊤙
4A919
񊤚
4A91A
񊤛
4A91B
񊤜
4A91C
񊤝
4A91D
񊤞
4A91E
񊤟
4A91F
90
A0
񊤠
4A920
񊤡
4A921
񊤢
4A922
񊤣
4A923
񊤤
4A924
񊤥
4A925
񊤦
4A926
񊤧
4A927
񊤨
4A928
񊤩
4A929
񊤪
4A92A
񊤫
4A92B
񊤬
4A92C
񊤭
4A92D
񊤮
4A92E
񊤯
4A92F
A0
B0
񊤰
4A930
񊤱
4A931
񊤲
4A932
񊤳
4A933
񊤴
4A934
񊤵
4A935
񊤶
4A936
񊤷
4A937
񊤸
4A938
񊤹
4A939
񊤺
4A93A
񊤻
4A93B
񊤼
4A93C
񊤽
4A93D
񊤾
4A93E
񊤿
4A93F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]