International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3849A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄚀
C4680
󄚁
C4681
󄚂
C4682
󄚃
C4683
󄚄
C4684
󄚅
C4685
󄚆
C4686
󄚇
C4687
󄚈
C4688
󄚉
C4689
󄚊
C468A
󄚋
C468B
󄚌
C468C
󄚍
C468D
󄚎
C468E
󄚏
C468F
80
90
󄚐
C4690
󄚑
C4691
󄚒
C4692
󄚓
C4693
󄚔
C4694
󄚕
C4695
󄚖
C4696
󄚗
C4697
󄚘
C4698
󄚙
C4699
󄚚
C469A
󄚛
C469B
󄚜
C469C
󄚝
C469D
󄚞
C469E
󄚟
C469F
90
A0
󄚠
C46A0
󄚡
C46A1
󄚢
C46A2
󄚣
C46A3
󄚤
C46A4
󄚥
C46A5
󄚦
C46A6
󄚧
C46A7
󄚨
C46A8
󄚩
C46A9
󄚪
C46AA
󄚫
C46AB
󄚬
C46AC
󄚭
C46AD
󄚮
C46AE
󄚯
C46AF
A0
B0
󄚰
C46B0
󄚱
C46B1
󄚲
C46B2
󄚳
C46B3
󄚴
C46B4
󄚵
C46B5
󄚶
C46B6
󄚷
C46B7
󄚸
C46B8
󄚹
C46B9
󄚺
C46BA
󄚻
C46BB
󄚼
C46BC
󄚽
C46BD
󄚾
C46BE
󄚿
C46BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]