International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18E92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񎒀
4E480
񎒁
4E481
񎒂
4E482
񎒃
4E483
񎒄
4E484
񎒅
4E485
񎒆
4E486
񎒇
4E487
񎒈
4E488
񎒉
4E489
񎒊
4E48A
񎒋
4E48B
񎒌
4E48C
񎒍
4E48D
񎒎
4E48E
񎒏
4E48F
80
90
񎒐
4E490
񎒑
4E491
񎒒
4E492
񎒓
4E493
񎒔
4E494
񎒕
4E495
񎒖
4E496
񎒗
4E497
񎒘
4E498
񎒙
4E499
񎒚
4E49A
񎒛
4E49B
񎒜
4E49C
񎒝
4E49D
񎒞
4E49E
񎒟
4E49F
90
A0
񎒠
4E4A0
񎒡
4E4A1
񎒢
4E4A2
񎒣
4E4A3
񎒤
4E4A4
񎒥
4E4A5
񎒦
4E4A6
񎒧
4E4A7
񎒨
4E4A8
񎒩
4E4A9
񎒪
4E4AA
񎒫
4E4AB
񎒬
4E4AC
񎒭
4E4AD
񎒮
4E4AE
񎒯
4E4AF
A0
B0
񎒰
4E4B0
񎒱
4E4B1
񎒲
4E4B2
񎒳
4E4B3
񎒴
4E4B4
񎒵
4E4B5
񎒶
4E4B6
񎒷
4E4B7
񎒸
4E4B8
񎒹
4E4B9
񎒺
4E4BA
񎒻
4E4BB
񎒼
4E4BC
񎒽
4E4BD
񎒾
4E4BE
񎒿
4E4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]