International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F384AC

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄬀
C4B00
󄬁
C4B01
󄬂
C4B02
󄬃
C4B03
󄬄
C4B04
󄬅
C4B05
󄬆
C4B06
󄬇
C4B07
󄬈
C4B08
󄬉
C4B09
󄬊
C4B0A
󄬋
C4B0B
󄬌
C4B0C
󄬍
C4B0D
󄬎
C4B0E
󄬏
C4B0F
80
90
󄬐
C4B10
󄬑
C4B11
󄬒
C4B12
󄬓
C4B13
󄬔
C4B14
󄬕
C4B15
󄬖
C4B16
󄬗
C4B17
󄬘
C4B18
󄬙
C4B19
󄬚
C4B1A
󄬛
C4B1B
󄬜
C4B1C
󄬝
C4B1D
󄬞
C4B1E
󄬟
C4B1F
90
A0
󄬠
C4B20
󄬡
C4B21
󄬢
C4B22
󄬣
C4B23
󄬤
C4B24
󄬥
C4B25
󄬦
C4B26
󄬧
C4B27
󄬨
C4B28
󄬩
C4B29
󄬪
C4B2A
󄬫
C4B2B
󄬬
C4B2C
󄬭
C4B2D
󄬮
C4B2E
󄬯
C4B2F
A0
B0
󄬰
C4B30
󄬱
C4B31
󄬲
C4B32
󄬳
C4B33
󄬴
C4B34
󄬵
C4B35
󄬶
C4B36
󄬷
C4B37
󄬸
C4B38
󄬹
C4B39
󄬺
C4B3A
󄬻
C4B3B
󄬼
C4B3C
󄬽
C4B3D
󄬾
C4B3E
󄬿
C4B3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]