International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18E82

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񎂀
4E080
񎂁
4E081
񎂂
4E082
񎂃
4E083
񎂄
4E084
񎂅
4E085
񎂆
4E086
񎂇
4E087
񎂈
4E088
񎂉
4E089
񎂊
4E08A
񎂋
4E08B
񎂌
4E08C
񎂍
4E08D
񎂎
4E08E
񎂏
4E08F
80
90
񎂐
4E090
񎂑
4E091
񎂒
4E092
񎂓
4E093
񎂔
4E094
񎂕
4E095
񎂖
4E096
񎂗
4E097
񎂘
4E098
񎂙
4E099
񎂚
4E09A
񎂛
4E09B
񎂜
4E09C
񎂝
4E09D
񎂞
4E09E
񎂟
4E09F
90
A0
񎂠
4E0A0
񎂡
4E0A1
񎂢
4E0A2
񎂣
4E0A3
񎂤
4E0A4
񎂥
4E0A5
񎂦
4E0A6
񎂧
4E0A7
񎂨
4E0A8
񎂩
4E0A9
񎂪
4E0AA
񎂫
4E0AB
񎂬
4E0AC
񎂭
4E0AD
񎂮
4E0AE
񎂯
4E0AF
A0
B0
񎂰
4E0B0
񎂱
4E0B1
񎂲
4E0B2
񎂳
4E0B3
񎂴
4E0B4
񎂵
4E0B5
񎂶
4E0B6
񎂷
4E0B7
񎂸
4E0B8
񎂹
4E0B9
񎂺
4E0BA
񎂻
4E0BB
񎂼
4E0BC
񎂽
4E0BD
񎂾
4E0BE
񎂿
4E0BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]