International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3908E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󐎀
D0380
󐎁
D0381
󐎂
D0382
󐎃
D0383
󐎄
D0384
󐎅
D0385
󐎆
D0386
󐎇
D0387
󐎈
D0388
󐎉
D0389
󐎊
D038A
󐎋
D038B
󐎌
D038C
󐎍
D038D
󐎎
D038E
󐎏
D038F
80
90
󐎐
D0390
󐎑
D0391
󐎒
D0392
󐎓
D0393
󐎔
D0394
󐎕
D0395
󐎖
D0396
󐎗
D0397
󐎘
D0398
󐎙
D0399
󐎚
D039A
󐎛
D039B
󐎜
D039C
󐎝
D039D
󐎞
D039E
󐎟
D039F
90
A0
󐎠
D03A0
󐎡
D03A1
󐎢
D03A2
󐎣
D03A3
󐎤
D03A4
󐎥
D03A5
󐎦
D03A6
󐎧
D03A7
󐎨
D03A8
󐎩
D03A9
󐎪
D03AA
󐎫
D03AB
󐎬
D03AC
󐎭
D03AD
󐎮
D03AE
󐎯
D03AF
A0
B0
󐎰
D03B0
󐎱
D03B1
󐎲
D03B2
󐎳
D03B3
󐎴
D03B4
󐎵
D03B5
󐎶
D03B6
󐎷
D03B7
󐎸
D03B8
󐎹
D03B9
󐎺
D03BA
󐎻
D03BB
󐎼
D03BC
󐎽
D03BD
󐎾
D03BE
󐎿
D03BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]