International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A994

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󩔀
E9500
󩔁
E9501
󩔂
E9502
󩔃
E9503
󩔄
E9504
󩔅
E9505
󩔆
E9506
󩔇
E9507
󩔈
E9508
󩔉
E9509
󩔊
E950A
󩔋
E950B
󩔌
E950C
󩔍
E950D
󩔎
E950E
󩔏
E950F
80
90
󩔐
E9510
󩔑
E9511
󩔒
E9512
󩔓
E9513
󩔔
E9514
󩔕
E9515
󩔖
E9516
󩔗
E9517
󩔘
E9518
󩔙
E9519
󩔚
E951A
󩔛
E951B
󩔜
E951C
󩔝
E951D
󩔞
E951E
󩔟
E951F
90
A0
󩔠
E9520
󩔡
E9521
󩔢
E9522
󩔣
E9523
󩔤
E9524
󩔥
E9525
󩔦
E9526
󩔧
E9527
󩔨
E9528
󩔩
E9529
󩔪
E952A
󩔫
E952B
󩔬
E952C
󩔭
E952D
󩔮
E952E
󩔯
E952F
A0
B0
󩔰
E9530
󩔱
E9531
󩔲
E9532
󩔳
E9533
󩔴
E9534
󩔵
E9535
󩔶
E9536
󩔷
E9537
󩔸
E9538
󩔹
E9539
󩔺
E953A
󩔻
E953B
󩔼
E953C
󩔽
E953D
󩔾
E953E
󩔿
E953F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]