International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3B38E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󳎀
F3380
󳎁
F3381
󳎂
F3382
󳎃
F3383
󳎄
F3384
󳎅
F3385
󳎆
F3386
󳎇
F3387
󳎈
F3388
󳎉
F3389
󳎊
F338A
󳎋
F338B
󳎌
F338C
󳎍
F338D
󳎎
F338E
󳎏
F338F
80
90
󳎐
F3390
󳎑
F3391
󳎒
F3392
󳎓
F3393
󳎔
F3394
󳎕
F3395
󳎖
F3396
󳎗
F3397
󳎘
F3398
󳎙
F3399
󳎚
F339A
󳎛
F339B
󳎜
F339C
󳎝
F339D
󳎞
F339E
󳎟
F339F
90
A0
󳎠
F33A0
󳎡
F33A1
󳎢
F33A2
󳎣
F33A3
󳎤
F33A4
󳎥
F33A5
󳎦
F33A6
󳎧
F33A7
󳎨
F33A8
󳎩
F33A9
󳎪
F33AA
󳎫
F33AB
󳎬
F33AC
󳎭
F33AD
󳎮
F33AE
󳎯
F33AF
A0
B0
󳎰
F33B0
󳎱
F33B1
󳎲
F33B2
󳎳
F33B3
󳎴
F33B4
󳎵
F33B5
󳎶
F33B6
󳎷
F33B7
󳎸
F33B8
󳎹
F33B9
󳎺
F33BA
󳎻
F33BB
󳎼
F33BC
󳎽
F33BD
󳎾
F33BE
󳎿
F33BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]