International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39D92

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󝒀
DD480
󝒁
DD481
󝒂
DD482
󝒃
DD483
󝒄
DD484
󝒅
DD485
󝒆
DD486
󝒇
DD487
󝒈
DD488
󝒉
DD489
󝒊
DD48A
󝒋
DD48B
󝒌
DD48C
󝒍
DD48D
󝒎
DD48E
󝒏
DD48F
80
90
󝒐
DD490
󝒑
DD491
󝒒
DD492
󝒓
DD493
󝒔
DD494
󝒕
DD495
󝒖
DD496
󝒗
DD497
󝒘
DD498
󝒙
DD499
󝒚
DD49A
󝒛
DD49B
󝒜
DD49C
󝒝
DD49D
󝒞
DD49E
󝒟
DD49F
90
A0
󝒠
DD4A0
󝒡
DD4A1
󝒢
DD4A2
󝒣
DD4A3
󝒤
DD4A4
󝒥
DD4A5
󝒦
DD4A6
󝒧
DD4A7
󝒨
DD4A8
󝒩
DD4A9
󝒪
DD4AA
󝒫
DD4AB
󝒬
DD4AC
󝒭
DD4AD
󝒮
DD4AE
󝒯
DD4AF
A0
B0
󝒰
DD4B0
󝒱
DD4B1
󝒲
DD4B2
󝒳
DD4B3
󝒴
DD4B4
󝒵
DD4B5
󝒶
DD4B6
󝒷
DD4B7
󝒸
DD4B8
󝒹
DD4B9
󝒺
DD4BA
󝒻
DD4BB
󝒼
DD4BC
󝒽
DD4BD
󝒾
DD4BE
󝒿
DD4BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]