International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3938E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󓎀
D3380
󓎁
D3381
󓎂
D3382
󓎃
D3383
󓎄
D3384
󓎅
D3385
󓎆
D3386
󓎇
D3387
󓎈
D3388
󓎉
D3389
󓎊
D338A
󓎋
D338B
󓎌
D338C
󓎍
D338D
󓎎
D338E
󓎏
D338F
80
90
󓎐
D3390
󓎑
D3391
󓎒
D3392
󓎓
D3393
󓎔
D3394
󓎕
D3395
󓎖
D3396
󓎗
D3397
󓎘
D3398
󓎙
D3399
󓎚
D339A
󓎛
D339B
󓎜
D339C
󓎝
D339D
󓎞
D339E
󓎟
D339F
90
A0
󓎠
D33A0
󓎡
D33A1
󓎢
D33A2
󓎣
D33A3
󓎤
D33A4
󓎥
D33A5
󓎦
D33A6
󓎧
D33A7
󓎨
D33A8
󓎩
D33A9
󓎪
D33AA
󓎫
D33AB
󓎬
D33AC
󓎭
D33AD
󓎮
D33AE
󓎯
D33AF
A0
B0
󓎰
D33B0
󓎱
D33B1
󓎲
D33B2
󓎳
D33B3
󓎴
D33B4
󓎵
D33B5
󓎶
D33B6
󓎷
D33B7
󓎸
D33B8
󓎹
D33B9
󓎺
D33BA
󓎻
D33BB
󓎼
D33BC
󓎽
D33BD
󓎾
D33BE
󓎿
D33BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]