International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F48C94

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
􌔀
10C500
􌔁
10C501
􌔂
10C502
􌔃
10C503
􌔄
10C504
􌔅
10C505
􌔆
10C506
􌔇
10C507
􌔈
10C508
􌔉
10C509
􌔊
10C50A
􌔋
10C50B
􌔌
10C50C
􌔍
10C50D
􌔎
10C50E
􌔏
10C50F
80
90
􌔐
10C510
􌔑
10C511
􌔒
10C512
􌔓
10C513
􌔔
10C514
􌔕
10C515
􌔖
10C516
􌔗
10C517
􌔘
10C518
􌔙
10C519
􌔚
10C51A
􌔛
10C51B
􌔜
10C51C
􌔝
10C51D
􌔞
10C51E
􌔟
10C51F
90
A0
􌔠
10C520
􌔡
10C521
􌔢
10C522
􌔣
10C523
􌔤
10C524
􌔥
10C525
􌔦
10C526
􌔧
10C527
􌔨
10C528
􌔩
10C529
􌔪
10C52A
􌔫
10C52B
􌔬
10C52C
􌔭
10C52D
􌔮
10C52E
􌔯
10C52F
A0
B0
􌔰
10C530
􌔱
10C531
􌔲
10C532
􌔳
10C533
􌔴
10C534
􌔵
10C535
􌔶
10C536
􌔷
10C537
􌔸
10C538
􌔹
10C539
􌔺
10C53A
􌔻
10C53B
􌔼
10C53C
􌔽
10C53D
􌔾
10C53E
􌔿
10C53F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]