International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B8A2

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𸢀
38880
𸢁
38881
𸢂
38882
𸢃
38883
𸢄
38884
𸢅
38885
𸢆
38886
𸢇
38887
𸢈
38888
𸢉
38889
𸢊
3888A
𸢋
3888B
𸢌
3888C
𸢍
3888D
𸢎
3888E
𸢏
3888F
80
90
𸢐
38890
𸢑
38891
𸢒
38892
𸢓
38893
𸢔
38894
𸢕
38895
𸢖
38896
𸢗
38897
𸢘
38898
𸢙
38899
𸢚
3889A
𸢛
3889B
𸢜
3889C
𸢝
3889D
𸢞
3889E
𸢟
3889F
90
A0
𸢠
388A0
𸢡
388A1
𸢢
388A2
𸢣
388A3
𸢤
388A4
𸢥
388A5
𸢦
388A6
𸢧
388A7
𸢨
388A8
𸢩
388A9
𸢪
388AA
𸢫
388AB
𸢬
388AC
𸢭
388AD
𸢮
388AE
𸢯
388AF
A0
B0
𸢰
388B0
𸢱
388B1
𸢲
388B2
𸢳
388B3
𸢴
388B4
𸢵
388B5
𸢶
388B6
𸢷
388B7
𸢸
388B8
𸢹
388B9
𸢺
388BA
𸢻
388BB
𸢼
388BC
𸢽
388BD
𸢾
388BE
𸢿
388BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]