International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B392

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󳒀
F3480
󳒁
F3481
󳒂
F3482
󳒃
F3483
󳒄
F3484
󳒅
F3485
󳒆
F3486
󳒇
F3487
󳒈
F3488
󳒉
F3489
󳒊
F348A
󳒋
F348B
󳒌
F348C
󳒍
F348D
󳒎
F348E
󳒏
F348F
80
90
󳒐
F3490
󳒑
F3491
󳒒
F3492
󳒓
F3493
󳒔
F3494
󳒕
F3495
󳒖
F3496
󳒗
F3497
󳒘
F3498
󳒙
F3499
󳒚
F349A
󳒛
F349B
󳒜
F349C
󳒝
F349D
󳒞
F349E
󳒟
F349F
90
A0
󳒠
F34A0
󳒡
F34A1
󳒢
F34A2
󳒣
F34A3
󳒤
F34A4
󳒥
F34A5
󳒦
F34A6
󳒧
F34A7
󳒨
F34A8
󳒩
F34A9
󳒪
F34AA
󳒫
F34AB
󳒬
F34AC
󳒭
F34AD
󳒮
F34AE
󳒯
F34AF
A0
B0
󳒰
F34B0
󳒱
F34B1
󳒲
F34B2
󳒳
F34B3
󳒴
F34B4
󳒵
F34B5
󳒶
F34B6
󳒷
F34B7
󳒸
F34B8
󳒹
F34B9
󳒺
F34BA
󳒻
F34BB
󳒼
F34BC
󳒽
F34BD
󳒾
F34BE
󳒿
F34BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]