International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38490

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄐀
C4400
󄐁
C4401
󄐂
C4402
󄐃
C4403
󄐄
C4404
󄐅
C4405
󄐆
C4406
󄐇
C4407
󄐈
C4408
󄐉
C4409
󄐊
C440A
󄐋
C440B
󄐌
C440C
󄐍
C440D
󄐎
C440E
󄐏
C440F
80
90
󄐐
C4410
󄐑
C4411
󄐒
C4412
󄐓
C4413
󄐔
C4414
󄐕
C4415
󄐖
C4416
󄐗
C4417
󄐘
C4418
󄐙
C4419
󄐚
C441A
󄐛
C441B
󄐜
C441C
󄐝
C441D
󄐞
C441E
󄐟
C441F
90
A0
󄐠
C4420
󄐡
C4421
󄐢
C4422
󄐣
C4423
󄐤
C4424
󄐥
C4425
󄐦
C4426
󄐧
C4427
󄐨
C4428
󄐩
C4429
󄐪
C442A
󄐫
C442B
󄐬
C442C
󄐭
C442D
󄐮
C442E
󄐯
C442F
A0
B0
󄐰
C4430
󄐱
C4431
󄐲
C4432
󄐳
C4433
󄐴
C4434
󄐵
C4435
󄐶
C4436
󄐷
C4437
󄐸
C4438
󄐹
C4439
󄐺
C443A
󄐻
C443B
󄐼
C443C
󄐽
C443D
󄐾
C443E
󄐿
C443F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]