International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1809A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񀚀
40680
񀚁
40681
񀚂
40682
񀚃
40683
񀚄
40684
񀚅
40685
񀚆
40686
񀚇
40687
񀚈
40688
񀚉
40689
񀚊
4068A
񀚋
4068B
񀚌
4068C
񀚍
4068D
񀚎
4068E
񀚏
4068F
80
90
񀚐
40690
񀚑
40691
񀚒
40692
񀚓
40693
񀚔
40694
񀚕
40695
񀚖
40696
񀚗
40697
񀚘
40698
񀚙
40699
񀚚
4069A
񀚛
4069B
񀚜
4069C
񀚝
4069D
񀚞
4069E
񀚟
4069F
90
A0
񀚠
406A0
񀚡
406A1
񀚢
406A2
񀚣
406A3
񀚤
406A4
񀚥
406A5
񀚦
406A6
񀚧
406A7
񀚨
406A8
񀚩
406A9
񀚪
406AA
񀚫
406AB
񀚬
406AC
񀚭
406AD
񀚮
406AE
񀚯
406AF
A0
B0
񀚰
406B0
񀚱
406B1
񀚲
406B2
񀚳
406B3
񀚴
406B4
񀚵
406B5
񀚶
406B6
񀚷
406B7
񀚸
406B8
񀚹
406B9
񀚺
406BA
񀚻
406BB
񀚼
406BC
񀚽
406BD
񀚾
406BE
񀚿
406BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]