International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18C97

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񌗀
4C5C0
񌗁
4C5C1
񌗂
4C5C2
񌗃
4C5C3
񌗄
4C5C4
񌗅
4C5C5
񌗆
4C5C6
񌗇
4C5C7
񌗈
4C5C8
񌗉
4C5C9
񌗊
4C5CA
񌗋
4C5CB
񌗌
4C5CC
񌗍
4C5CD
񌗎
4C5CE
񌗏
4C5CF
80
90
񌗐
4C5D0
񌗑
4C5D1
񌗒
4C5D2
񌗓
4C5D3
񌗔
4C5D4
񌗕
4C5D5
񌗖
4C5D6
񌗗
4C5D7
񌗘
4C5D8
񌗙
4C5D9
񌗚
4C5DA
񌗛
4C5DB
񌗜
4C5DC
񌗝
4C5DD
񌗞
4C5DE
񌗟
4C5DF
90
A0
񌗠
4C5E0
񌗡
4C5E1
񌗢
4C5E2
񌗣
4C5E3
񌗤
4C5E4
񌗥
4C5E5
񌗦
4C5E6
񌗧
4C5E7
񌗨
4C5E8
񌗩
4C5E9
񌗪
4C5EA
񌗫
4C5EB
񌗬
4C5EC
񌗭
4C5ED
񌗮
4C5EE
񌗯
4C5EF
A0
B0
񌗰
4C5F0
񌗱
4C5F1
񌗲
4C5F2
񌗳
4C5F3
񌗴
4C5F4
񌗵
4C5F5
񌗶
4C5F6
񌗷
4C5F7
񌗸
4C5F8
񌗹
4C5F9
񌗺
4C5FA
񌗻
4C5FB
񌗼
4C5FC
񌗽
4C5FD
񌗾
4C5FE
񌗿
4C5FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]