International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18790

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񇐀
47400
񇐁
47401
񇐂
47402
񇐃
47403
񇐄
47404
񇐅
47405
񇐆
47406
񇐇
47407
񇐈
47408
񇐉
47409
񇐊
4740A
񇐋
4740B
񇐌
4740C
񇐍
4740D
񇐎
4740E
񇐏
4740F
80
90
񇐐
47410
񇐑
47411
񇐒
47412
񇐓
47413
񇐔
47414
񇐕
47415
񇐖
47416
񇐗
47417
񇐘
47418
񇐙
47419
񇐚
4741A
񇐛
4741B
񇐜
4741C
񇐝
4741D
񇐞
4741E
񇐟
4741F
90
A0
񇐠
47420
񇐡
47421
񇐢
47422
񇐣
47423
񇐤
47424
񇐥
47425
񇐦
47426
񇐧
47427
񇐨
47428
񇐩
47429
񇐪
4742A
񇐫
4742B
񇐬
4742C
񇐭
4742D
񇐮
4742E
񇐯
4742F
A0
B0
񇐰
47430
񇐱
47431
񇐲
47432
񇐳
47433
񇐴
47434
񇐵
47435
񇐶
47436
񇐷
47437
񇐸
47438
񇐹
47439
񇐺
4743A
񇐻
4743B
񇐼
4743C
񇐽
4743D
񇐾
4743E
񇐿
4743F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]