International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2898E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򉎀
89380
򉎁
89381
򉎂
89382
򉎃
89383
򉎄
89384
򉎅
89385
򉎆
89386
򉎇
89387
򉎈
89388
򉎉
89389
򉎊
8938A
򉎋
8938B
򉎌
8938C
򉎍
8938D
򉎎
8938E
򉎏
8938F
80
90
򉎐
89390
򉎑
89391
򉎒
89392
򉎓
89393
򉎔
89394
򉎕
89395
򉎖
89396
򉎗
89397
򉎘
89398
򉎙
89399
򉎚
8939A
򉎛
8939B
򉎜
8939C
򉎝
8939D
򉎞
8939E
򉎟
8939F
90
A0
򉎠
893A0
򉎡
893A1
򉎢
893A2
򉎣
893A3
򉎤
893A4
򉎥
893A5
򉎦
893A6
򉎧
893A7
򉎨
893A8
򉎩
893A9
򉎪
893AA
򉎫
893AB
򉎬
893AC
򉎭
893AD
򉎮
893AE
򉎯
893AF
A0
B0
򉎰
893B0
򉎱
893B1
򉎲
893B2
򉎳
893B3
򉎴
893B4
򉎵
893B5
򉎶
893B6
򉎷
893B7
򉎸
893B8
򉎹
893B9
򉎺
893BA
򉎻
893BB
򉎼
893BC
򉎽
893BD
򉎾
893BE
򉎿
893BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]