International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F191AC

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񑬀
51B00
񑬁
51B01
񑬂
51B02
񑬃
51B03
񑬄
51B04
񑬅
51B05
񑬆
51B06
񑬇
51B07
񑬈
51B08
񑬉
51B09
񑬊
51B0A
񑬋
51B0B
񑬌
51B0C
񑬍
51B0D
񑬎
51B0E
񑬏
51B0F
80
90
񑬐
51B10
񑬑
51B11
񑬒
51B12
񑬓
51B13
񑬔
51B14
񑬕
51B15
񑬖
51B16
񑬗
51B17
񑬘
51B18
񑬙
51B19
񑬚
51B1A
񑬛
51B1B
񑬜
51B1C
񑬝
51B1D
񑬞
51B1E
񑬟
51B1F
90
A0
񑬠
51B20
񑬡
51B21
񑬢
51B22
񑬣
51B23
񑬤
51B24
񑬥
51B25
񑬦
51B26
񑬧
51B27
񑬨
51B28
񑬩
51B29
񑬪
51B2A
񑬫
51B2B
񑬬
51B2C
񑬭
51B2D
񑬮
51B2E
񑬯
51B2F
A0
B0
񑬰
51B30
񑬱
51B31
񑬲
51B32
񑬳
51B33
񑬴
51B34
񑬵
51B35
񑬶
51B36
񑬷
51B37
񑬸
51B38
񑬹
51B39
񑬺
51B3A
񑬻
51B3B
񑬼
51B3C
񑬽
51B3D
񑬾
51B3E
񑬿
51B3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]