International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48D92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􍒀
10D480
􍒁
10D481
􍒂
10D482
􍒃
10D483
􍒄
10D484
􍒅
10D485
􍒆
10D486
􍒇
10D487
􍒈
10D488
􍒉
10D489
􍒊
10D48A
􍒋
10D48B
􍒌
10D48C
􍒍
10D48D
􍒎
10D48E
􍒏
10D48F
80
90
􍒐
10D490
􍒑
10D491
􍒒
10D492
􍒓
10D493
􍒔
10D494
􍒕
10D495
􍒖
10D496
􍒗
10D497
􍒘
10D498
􍒙
10D499
􍒚
10D49A
􍒛
10D49B
􍒜
10D49C
􍒝
10D49D
􍒞
10D49E
􍒟
10D49F
90
A0
􍒠
10D4A0
􍒡
10D4A1
􍒢
10D4A2
􍒣
10D4A3
􍒤
10D4A4
􍒥
10D4A5
􍒦
10D4A6
􍒧
10D4A7
􍒨
10D4A8
􍒩
10D4A9
􍒪
10D4AA
􍒫
10D4AB
􍒬
10D4AC
􍒭
10D4AD
􍒮
10D4AE
􍒯
10D4AF
A0
B0
􍒰
10D4B0
􍒱
10D4B1
􍒲
10D4B2
􍒳
10D4B3
􍒴
10D4B4
􍒵
10D4B5
􍒶
10D4B6
􍒷
10D4B7
􍒸
10D4B8
􍒹
10D4B9
􍒺
10D4BA
􍒻
10D4BB
􍒼
10D4BC
􍒽
10D4BD
􍒾
10D4BE
􍒿
10D4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]