International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F19892

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񘒀
58480
񘒁
58481
񘒂
58482
񘒃
58483
񘒄
58484
񘒅
58485
񘒆
58486
񘒇
58487
񘒈
58488
񘒉
58489
񘒊
5848A
񘒋
5848B
񘒌
5848C
񘒍
5848D
񘒎
5848E
񘒏
5848F
80
90
񘒐
58490
񘒑
58491
񘒒
58492
񘒓
58493
񘒔
58494
񘒕
58495
񘒖
58496
񘒗
58497
񘒘
58498
񘒙
58499
񘒚
5849A
񘒛
5849B
񘒜
5849C
񘒝
5849D
񘒞
5849E
񘒟
5849F
90
A0
񘒠
584A0
񘒡
584A1
񘒢
584A2
񘒣
584A3
񘒤
584A4
񘒥
584A5
񘒦
584A6
񘒧
584A7
񘒨
584A8
񘒩
584A9
񘒪
584AA
񘒫
584AB
񘒬
584AC
񘒭
584AD
񘒮
584AE
񘒯
584AF
A0
B0
񘒰
584B0
񘒱
584B1
񘒲
584B2
񘒳
584B3
񘒴
584B4
񘒵
584B5
񘒶
584B6
񘒷
584B7
񘒸
584B8
񘒹
584B9
񘒺
584BA
񘒻
584BB
񘒼
584BC
񘒽
584BD
񘒾
584BE
񘒿
584BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]