International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48CA5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􌥀
10C940
􌥁
10C941
􌥂
10C942
􌥃
10C943
􌥄
10C944
􌥅
10C945
􌥆
10C946
􌥇
10C947
􌥈
10C948
􌥉
10C949
􌥊
10C94A
􌥋
10C94B
􌥌
10C94C
􌥍
10C94D
􌥎
10C94E
􌥏
10C94F
80
90
􌥐
10C950
􌥑
10C951
􌥒
10C952
􌥓
10C953
􌥔
10C954
􌥕
10C955
􌥖
10C956
􌥗
10C957
􌥘
10C958
􌥙
10C959
􌥚
10C95A
􌥛
10C95B
􌥜
10C95C
􌥝
10C95D
􌥞
10C95E
􌥟
10C95F
90
A0
􌥠
10C960
􌥡
10C961
􌥢
10C962
􌥣
10C963
􌥤
10C964
􌥥
10C965
􌥦
10C966
􌥧
10C967
􌥨
10C968
􌥩
10C969
􌥪
10C96A
􌥫
10C96B
􌥬
10C96C
􌥭
10C96D
􌥮
10C96E
􌥯
10C96F
A0
B0
􌥰
10C970
􌥱
10C971
􌥲
10C972
􌥳
10C973
􌥴
10C974
􌥵
10C975
􌥶
10C976
􌥷
10C977
􌥸
10C978
􌥹
10C979
􌥺
10C97A
􌥻
10C97B
􌥼
10C97C
􌥽
10C97D
􌥾
10C97E
􌥿
10C97F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]