International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B5AC

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𵬀
35B00
𵬁
35B01
𵬂
35B02
𵬃
35B03
𵬄
35B04
𵬅
35B05
𵬆
35B06
𵬇
35B07
𵬈
35B08
𵬉
35B09
𵬊
35B0A
𵬋
35B0B
𵬌
35B0C
𵬍
35B0D
𵬎
35B0E
𵬏
35B0F
80
90
𵬐
35B10
𵬑
35B11
𵬒
35B12
𵬓
35B13
𵬔
35B14
𵬕
35B15
𵬖
35B16
𵬗
35B17
𵬘
35B18
𵬙
35B19
𵬚
35B1A
𵬛
35B1B
𵬜
35B1C
𵬝
35B1D
𵬞
35B1E
𵬟
35B1F
90
A0
𵬠
35B20
𵬡
35B21
𵬢
35B22
𵬣
35B23
𵬤
35B24
𵬥
35B25
𵬦
35B26
𵬧
35B27
𵬨
35B28
𵬩
35B29
𵬪
35B2A
𵬫
35B2B
𵬬
35B2C
𵬭
35B2D
𵬮
35B2E
𵬯
35B2F
A0
B0
𵬰
35B30
𵬱
35B31
𵬲
35B32
𵬳
35B33
𵬴
35B34
𵬵
35B35
𵬶
35B36
𵬷
35B37
𵬸
35B38
𵬹
35B39
𵬺
35B3A
𵬻
35B3B
𵬼
35B3C
𵬽
35B3D
𵬾
35B3E
𵬿
35B3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]