International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48A9A

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􊚀
10A680
􊚁
10A681
􊚂
10A682
􊚃
10A683
􊚄
10A684
􊚅
10A685
􊚆
10A686
􊚇
10A687
􊚈
10A688
􊚉
10A689
􊚊
10A68A
􊚋
10A68B
􊚌
10A68C
􊚍
10A68D
􊚎
10A68E
􊚏
10A68F
80
90
􊚐
10A690
􊚑
10A691
􊚒
10A692
􊚓
10A693
􊚔
10A694
􊚕
10A695
􊚖
10A696
􊚗
10A697
􊚘
10A698
􊚙
10A699
􊚚
10A69A
􊚛
10A69B
􊚜
10A69C
􊚝
10A69D
􊚞
10A69E
􊚟
10A69F
90
A0
􊚠
10A6A0
􊚡
10A6A1
􊚢
10A6A2
􊚣
10A6A3
􊚤
10A6A4
􊚥
10A6A5
􊚦
10A6A6
􊚧
10A6A7
􊚨
10A6A8
􊚩
10A6A9
􊚪
10A6AA
􊚫
10A6AB
􊚬
10A6AC
􊚭
10A6AD
􊚮
10A6AE
􊚯
10A6AF
A0
B0
􊚰
10A6B0
􊚱
10A6B1
􊚲
10A6B2
􊚳
10A6B3
􊚴
10A6B4
􊚵
10A6B5
􊚶
10A6B6
􊚷
10A6B7
􊚸
10A6B8
􊚹
10A6B9
􊚺
10A6BA
􊚻
10A6BB
􊚼
10A6BC
􊚽
10A6BD
􊚾
10A6BE
􊚿
10A6BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]