International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48BA4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􋤀
10B900
􋤁
10B901
􋤂
10B902
􋤃
10B903
􋤄
10B904
􋤅
10B905
􋤆
10B906
􋤇
10B907
􋤈
10B908
􋤉
10B909
􋤊
10B90A
􋤋
10B90B
􋤌
10B90C
􋤍
10B90D
􋤎
10B90E
􋤏
10B90F
80
90
􋤐
10B910
􋤑
10B911
􋤒
10B912
􋤓
10B913
􋤔
10B914
􋤕
10B915
􋤖
10B916
􋤗
10B917
􋤘
10B918
􋤙
10B919
􋤚
10B91A
􋤛
10B91B
􋤜
10B91C
􋤝
10B91D
􋤞
10B91E
􋤟
10B91F
90
A0
􋤠
10B920
􋤡
10B921
􋤢
10B922
􋤣
10B923
􋤤
10B924
􋤥
10B925
􋤦
10B926
􋤧
10B927
􋤨
10B928
􋤩
10B929
􋤪
10B92A
􋤫
10B92B
􋤬
10B92C
􋤭
10B92D
􋤮
10B92E
􋤯
10B92F
A0
B0
􋤰
10B930
􋤱
10B931
􋤲
10B932
􋤳
10B933
􋤴
10B934
􋤵
10B935
􋤶
10B936
􋤷
10B937
􋤸
10B938
􋤹
10B939
􋤺
10B93A
􋤻
10B93B
􋤼
10B93C
􋤽
10B93D
􋤾
10B93E
􋤿
10B93F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]