International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B597

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󵗀
F55C0
󵗁
F55C1
󵗂
F55C2
󵗃
F55C3
󵗄
F55C4
󵗅
F55C5
󵗆
F55C6
󵗇
F55C7
󵗈
F55C8
󵗉
F55C9
󵗊
F55CA
󵗋
F55CB
󵗌
F55CC
󵗍
F55CD
󵗎
F55CE
󵗏
F55CF
80
90
󵗐
F55D0
󵗑
F55D1
󵗒
F55D2
󵗓
F55D3
󵗔
F55D4
󵗕
F55D5
󵗖
F55D6
󵗗
F55D7
󵗘
F55D8
󵗙
F55D9
󵗚
F55DA
󵗛
F55DB
󵗜
F55DC
󵗝
F55DD
󵗞
F55DE
󵗟
F55DF
90
A0
󵗠
F55E0
󵗡
F55E1
󵗢
F55E2
󵗣
F55E3
󵗤
F55E4
󵗥
F55E5
󵗦
F55E6
󵗧
F55E7
󵗨
F55E8
󵗩
F55E9
󵗪
F55EA
󵗫
F55EB
󵗬
F55EC
󵗭
F55ED
󵗮
F55EE
󵗯
F55EF
A0
B0
󵗰
F55F0
󵗱
F55F1
󵗲
F55F2
󵗳
F55F3
󵗴
F55F4
󵗵
F55F5
󵗶
F55F6
󵗷
F55F7
󵗸
F55F8
󵗹
F55F9
󵗺
F55FA
󵗻
F55FB
󵗼
F55FC
󵗽
F55FD
󵗾
F55FE
󵗿
F55FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]