International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38182

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󁂀
C1080
󁂁
C1081
󁂂
C1082
󁂃
C1083
󁂄
C1084
󁂅
C1085
󁂆
C1086
󁂇
C1087
󁂈
C1088
󁂉
C1089
󁂊
C108A
󁂋
C108B
󁂌
C108C
󁂍
C108D
󁂎
C108E
󁂏
C108F
80
90
󁂐
C1090
󁂑
C1091
󁂒
C1092
󁂓
C1093
󁂔
C1094
󁂕
C1095
󁂖
C1096
󁂗
C1097
󁂘
C1098
󁂙
C1099
󁂚
C109A
󁂛
C109B
󁂜
C109C
󁂝
C109D
󁂞
C109E
󁂟
C109F
90
A0
󁂠
C10A0
󁂡
C10A1
󁂢
C10A2
󁂣
C10A3
󁂤
C10A4
󁂥
C10A5
󁂦
C10A6
󁂧
C10A7
󁂨
C10A8
󁂩
C10A9
󁂪
C10AA
󁂫
C10AB
󁂬
C10AC
󁂭
C10AD
󁂮
C10AE
󁂯
C10AF
A0
B0
󁂰
C10B0
󁂱
C10B1
󁂲
C10B2
󁂳
C10B3
󁂴
C10B4
󁂵
C10B5
󁂶
C10B6
󁂷
C10B7
󁂸
C10B8
󁂹
C10B9
󁂺
C10BA
󁂻
C10BB
󁂼
C10BC
󁂽
C10BD
󁂾
C10BE
󁂿
C10BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]