International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48190

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􁐀
101400
􁐁
101401
􁐂
101402
􁐃
101403
􁐄
101404
􁐅
101405
􁐆
101406
􁐇
101407
􁐈
101408
􁐉
101409
􁐊
10140A
􁐋
10140B
􁐌
10140C
􁐍
10140D
􁐎
10140E
􁐏
10140F
80
90
􁐐
101410
􁐑
101411
􁐒
101412
􁐓
101413
􁐔
101414
􁐕
101415
􁐖
101416
􁐗
101417
􁐘
101418
􁐙
101419
􁐚
10141A
􁐛
10141B
􁐜
10141C
􁐝
10141D
􁐞
10141E
􁐟
10141F
90
A0
􁐠
101420
􁐡
101421
􁐢
101422
􁐣
101423
􁐤
101424
􁐥
101425
􁐦
101426
􁐧
101427
􁐨
101428
􁐩
101429
􁐪
10142A
􁐫
10142B
􁐬
10142C
􁐭
10142D
􁐮
10142E
􁐯
10142F
A0
B0
􁐰
101430
􁐱
101431
􁐲
101432
􁐳
101433
􁐴
101434
􁐵
101435
􁐶
101436
􁐷
101437
􁐸
101438
􁐹
101439
􁐺
10143A
􁐻
10143B
􁐼
10143C
􁐽
10143D
􁐾
10143E
􁐿
10143F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]