International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48CAC

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􌬀
10CB00
􌬁
10CB01
􌬂
10CB02
􌬃
10CB03
􌬄
10CB04
􌬅
10CB05
􌬆
10CB06
􌬇
10CB07
􌬈
10CB08
􌬉
10CB09
􌬊
10CB0A
􌬋
10CB0B
􌬌
10CB0C
􌬍
10CB0D
􌬎
10CB0E
􌬏
10CB0F
80
90
􌬐
10CB10
􌬑
10CB11
􌬒
10CB12
􌬓
10CB13
􌬔
10CB14
􌬕
10CB15
􌬖
10CB16
􌬗
10CB17
􌬘
10CB18
􌬙
10CB19
􌬚
10CB1A
􌬛
10CB1B
􌬜
10CB1C
􌬝
10CB1D
􌬞
10CB1E
􌬟
10CB1F
90
A0
􌬠
10CB20
􌬡
10CB21
􌬢
10CB22
􌬣
10CB23
􌬤
10CB24
􌬥
10CB25
􌬦
10CB26
􌬧
10CB27
􌬨
10CB28
􌬩
10CB29
􌬪
10CB2A
􌬫
10CB2B
􌬬
10CB2C
􌬭
10CB2D
􌬮
10CB2E
􌬯
10CB2F
A0
B0
􌬰
10CB30
􌬱
10CB31
􌬲
10CB32
􌬳
10CB33
􌬴
10CB34
􌬵
10CB35
􌬶
10CB36
􌬷
10CB37
􌬸
10CB38
􌬹
10CB39
􌬺
10CB3A
􌬻
10CB3B
􌬼
10CB3C
􌬽
10CB3D
􌬾
10CB3E
􌬿
10CB3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]