International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F0B9A4

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
𹤀
39900
𹤁
39901
𹤂
39902
𹤃
39903
𹤄
39904
𹤅
39905
𹤆
39906
𹤇
39907
𹤈
39908
𹤉
39909
𹤊
3990A
𹤋
3990B
𹤌
3990C
𹤍
3990D
𹤎
3990E
𹤏
3990F
80
90
𹤐
39910
𹤑
39911
𹤒
39912
𹤓
39913
𹤔
39914
𹤕
39915
𹤖
39916
𹤗
39917
𹤘
39918
𹤙
39919
𹤚
3991A
𹤛
3991B
𹤜
3991C
𹤝
3991D
𹤞
3991E
𹤟
3991F
90
A0
𹤠
39920
𹤡
39921
𹤢
39922
𹤣
39923
𹤤
39924
𹤥
39925
𹤦
39926
𹤧
39927
𹤨
39928
𹤩
39929
𹤪
3992A
𹤫
3992B
𹤬
3992C
𹤭
3992D
𹤮
3992E
𹤯
3992F
A0
B0
𹤰
39930
𹤱
39931
𹤲
39932
𹤳
39933
𹤴
39934
𹤵
39935
𹤶
39936
𹤷
39937
𹤸
39938
𹤹
39939
𹤺
3993A
𹤻
3993B
𹤼
3993C
𹤽
3993D
𹤾
3993E
𹤿
3993F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]