International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F1908E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񐎀
50380
񐎁
50381
񐎂
50382
񐎃
50383
񐎄
50384
񐎅
50385
񐎆
50386
񐎇
50387
񐎈
50388
񐎉
50389
񐎊
5038A
񐎋
5038B
񐎌
5038C
񐎍
5038D
񐎎
5038E
񐎏
5038F
80
90
񐎐
50390
񐎑
50391
񐎒
50392
񐎓
50393
񐎔
50394
񐎕
50395
񐎖
50396
񐎗
50397
񐎘
50398
񐎙
50399
񐎚
5039A
񐎛
5039B
񐎜
5039C
񐎝
5039D
񐎞
5039E
񐎟
5039F
90
A0
񐎠
503A0
񐎡
503A1
񐎢
503A2
񐎣
503A3
񐎤
503A4
񐎥
503A5
񐎦
503A6
񐎧
503A7
񐎨
503A8
񐎩
503A9
񐎪
503AA
񐎫
503AB
񐎬
503AC
񐎭
503AD
񐎮
503AE
񐎯
503AF
A0
B0
񐎰
503B0
񐎱
503B1
񐎲
503B2
񐎳
503B3
񐎴
503B4
񐎵
503B5
񐎶
503B6
񐎷
503B7
񐎸
503B8
񐎹
503B9
񐎺
503BA
񐎻
503BB
񐎼
503BC
񐎽
503BD
񐎾
503BE
񐎿
503BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]