International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3899C

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󉜀
C9700
󉜁
C9701
󉜂
C9702
󉜃
C9703
󉜄
C9704
󉜅
C9705
󉜆
C9706
󉜇
C9707
󉜈
C9708
󉜉
C9709
󉜊
C970A
󉜋
C970B
󉜌
C970C
󉜍
C970D
󉜎
C970E
󉜏
C970F
80
90
󉜐
C9710
󉜑
C9711
󉜒
C9712
󉜓
C9713
󉜔
C9714
󉜕
C9715
󉜖
C9716
󉜗
C9717
󉜘
C9718
󉜙
C9719
󉜚
C971A
󉜛
C971B
󉜜
C971C
󉜝
C971D
󉜞
C971E
󉜟
C971F
90
A0
󉜠
C9720
󉜡
C9721
󉜢
C9722
󉜣
C9723
󉜤
C9724
󉜥
C9725
󉜦
C9726
󉜧
C9727
󉜨
C9728
󉜩
C9729
󉜪
C972A
󉜫
C972B
󉜬
C972C
󉜭
C972D
󉜮
C972E
󉜯
C972F
A0
B0
󉜰
C9730
󉜱
C9731
󉜲
C9732
󉜳
C9733
󉜴
C9734
󉜵
C9735
󉜶
C9736
󉜷
C9737
󉜸
C9738
󉜹
C9739
󉜺
C973A
󉜻
C973B
󉜼
C973C
󉜽
C973D
󉜾
C973E
󉜿
C973F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]