International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48EA5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􎥀
10E940
􎥁
10E941
􎥂
10E942
􎥃
10E943
􎥄
10E944
􎥅
10E945
􎥆
10E946
􎥇
10E947
􎥈
10E948
􎥉
10E949
􎥊
10E94A
􎥋
10E94B
􎥌
10E94C
􎥍
10E94D
􎥎
10E94E
􎥏
10E94F
80
90
􎥐
10E950
􎥑
10E951
􎥒
10E952
􎥓
10E953
􎥔
10E954
􎥕
10E955
􎥖
10E956
􎥗
10E957
􎥘
10E958
􎥙
10E959
􎥚
10E95A
􎥛
10E95B
􎥜
10E95C
􎥝
10E95D
􎥞
10E95E
􎥟
10E95F
90
A0
􎥠
10E960
􎥡
10E961
􎥢
10E962
􎥣
10E963
􎥤
10E964
􎥥
10E965
􎥦
10E966
􎥧
10E967
􎥨
10E968
􎥩
10E969
􎥪
10E96A
􎥫
10E96B
􎥬
10E96C
􎥭
10E96D
􎥮
10E96E
􎥯
10E96F
A0
B0
􎥰
10E970
􎥱
10E971
􎥲
10E972
􎥳
10E973
􎥴
10E974
􎥵
10E975
􎥶
10E976
􎥷
10E977
􎥸
10E978
􎥹
10E979
􎥺
10E97A
􎥻
10E97B
􎥼
10E97C
􎥽
10E97D
􎥾
10E97E
􎥿
10E97F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]