International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38A9A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󊚀
CA680
󊚁
CA681
󊚂
CA682
󊚃
CA683
󊚄
CA684
󊚅
CA685
󊚆
CA686
󊚇
CA687
󊚈
CA688
󊚉
CA689
󊚊
CA68A
󊚋
CA68B
󊚌
CA68C
󊚍
CA68D
󊚎
CA68E
󊚏
CA68F
80
90
󊚐
CA690
󊚑
CA691
󊚒
CA692
󊚓
CA693
󊚔
CA694
󊚕
CA695
󊚖
CA696
󊚗
CA697
󊚘
CA698
󊚙
CA699
󊚚
CA69A
󊚛
CA69B
󊚜
CA69C
󊚝
CA69D
󊚞
CA69E
󊚟
CA69F
90
A0
󊚠
CA6A0
󊚡
CA6A1
󊚢
CA6A2
󊚣
CA6A3
󊚤
CA6A4
󊚥
CA6A5
󊚦
CA6A6
󊚧
CA6A7
󊚨
CA6A8
󊚩
CA6A9
󊚪
CA6AA
󊚫
CA6AB
󊚬
CA6AC
󊚭
CA6AD
󊚮
CA6AE
󊚯
CA6AF
A0
B0
󊚰
CA6B0
󊚱
CA6B1
󊚲
CA6B2
󊚳
CA6B3
󊚴
CA6B4
󊚵
CA6B5
󊚶
CA6B6
󊚷
CA6B7
󊚸
CA6B8
󊚹
CA6B9
󊚺
CA6BA
󊚻
CA6BB
󊚼
CA6BC
󊚽
CA6BD
󊚾
CA6BE
󊚿
CA6BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]