International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F382A5

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󂥀
C2940
󂥁
C2941
󂥂
C2942
󂥃
C2943
󂥄
C2944
󂥅
C2945
󂥆
C2946
󂥇
C2947
󂥈
C2948
󂥉
C2949
󂥊
C294A
󂥋
C294B
󂥌
C294C
󂥍
C294D
󂥎
C294E
󂥏
C294F
80
90
󂥐
C2950
󂥑
C2951
󂥒
C2952
󂥓
C2953
󂥔
C2954
󂥕
C2955
󂥖
C2956
󂥗
C2957
󂥘
C2958
󂥙
C2959
󂥚
C295A
󂥛
C295B
󂥜
C295C
󂥝
C295D
󂥞
C295E
󂥟
C295F
90
A0
󂥠
C2960
󂥡
C2961
󂥢
C2962
󂥣
C2963
󂥤
C2964
󂥥
C2965
󂥦
C2966
󂥧
C2967
󂥨
C2968
󂥩
C2969
󂥪
C296A
󂥫
C296B
󂥬
C296C
󂥭
C296D
󂥮
C296E
󂥯
C296F
A0
B0
󂥰
C2970
󂥱
C2971
󂥲
C2972
󂥳
C2973
󂥴
C2974
󂥵
C2975
󂥶
C2976
󂥷
C2977
󂥸
C2978
󂥹
C2979
󂥺
C297A
󂥻
C297B
󂥼
C297C
󂥽
C297D
󂥾
C297E
󂥿
C297F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]