International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38E82

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󎂀
CE080
󎂁
CE081
󎂂
CE082
󎂃
CE083
󎂄
CE084
󎂅
CE085
󎂆
CE086
󎂇
CE087
󎂈
CE088
󎂉
CE089
󎂊
CE08A
󎂋
CE08B
󎂌
CE08C
󎂍
CE08D
󎂎
CE08E
󎂏
CE08F
80
90
󎂐
CE090
󎂑
CE091
󎂒
CE092
󎂓
CE093
󎂔
CE094
󎂕
CE095
󎂖
CE096
󎂗
CE097
󎂘
CE098
󎂙
CE099
󎂚
CE09A
󎂛
CE09B
󎂜
CE09C
󎂝
CE09D
󎂞
CE09E
󎂟
CE09F
90
A0
󎂠
CE0A0
󎂡
CE0A1
󎂢
CE0A2
󎂣
CE0A3
󎂤
CE0A4
󎂥
CE0A5
󎂦
CE0A6
󎂧
CE0A7
󎂨
CE0A8
󎂩
CE0A9
󎂪
CE0AA
󎂫
CE0AB
󎂬
CE0AC
󎂭
CE0AD
󎂮
CE0AE
󎂯
CE0AF
A0
B0
󎂰
CE0B0
󎂱
CE0B1
󎂲
CE0B2
󎂳
CE0B3
󎂴
CE0B4
󎂵
CE0B5
󎂶
CE0B6
󎂷
CE0B7
󎂸
CE0B8
󎂹
CE0B9
󎂺
CE0BA
󎂻
CE0BB
󎂼
CE0BC
󎂽
CE0BD
󎂾
CE0BE
󎂿
CE0BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]