International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3AC82

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󬂀
EC080
󬂁
EC081
󬂂
EC082
󬂃
EC083
󬂄
EC084
󬂅
EC085
󬂆
EC086
󬂇
EC087
󬂈
EC088
󬂉
EC089
󬂊
EC08A
󬂋
EC08B
󬂌
EC08C
󬂍
EC08D
󬂎
EC08E
󬂏
EC08F
80
90
󬂐
EC090
󬂑
EC091
󬂒
EC092
󬂓
EC093
󬂔
EC094
󬂕
EC095
󬂖
EC096
󬂗
EC097
󬂘
EC098
󬂙
EC099
󬂚
EC09A
󬂛
EC09B
󬂜
EC09C
󬂝
EC09D
󬂞
EC09E
󬂟
EC09F
90
A0
󬂠
EC0A0
󬂡
EC0A1
󬂢
EC0A2
󬂣
EC0A3
󬂤
EC0A4
󬂥
EC0A5
󬂦
EC0A6
󬂧
EC0A7
󬂨
EC0A8
󬂩
EC0A9
󬂪
EC0AA
󬂫
EC0AB
󬂬
EC0AC
󬂭
EC0AD
󬂮
EC0AE
󬂯
EC0AF
A0
B0
󬂰
EC0B0
󬂱
EC0B1
󬂲
EC0B2
󬂳
EC0B3
󬂴
EC0B4
󬂵
EC0B5
󬂶
EC0B6
󬂷
EC0B7
󬂸
EC0B8
󬂹
EC0B9
󬂺
EC0BA
󬂻
EC0BB
󬂼
EC0BC
󬂽
EC0BD
󬂾
EC0BE
󬂿
EC0BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]