International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38EA3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󎣀
CE8C0
󎣁
CE8C1
󎣂
CE8C2
󎣃
CE8C3
󎣄
CE8C4
󎣅
CE8C5
󎣆
CE8C6
󎣇
CE8C7
󎣈
CE8C8
󎣉
CE8C9
󎣊
CE8CA
󎣋
CE8CB
󎣌
CE8CC
󎣍
CE8CD
󎣎
CE8CE
󎣏
CE8CF
80
90
󎣐
CE8D0
󎣑
CE8D1
󎣒
CE8D2
󎣓
CE8D3
󎣔
CE8D4
󎣕
CE8D5
󎣖
CE8D6
󎣗
CE8D7
󎣘
CE8D8
󎣙
CE8D9
󎣚
CE8DA
󎣛
CE8DB
󎣜
CE8DC
󎣝
CE8DD
󎣞
CE8DE
󎣟
CE8DF
90
A0
󎣠
CE8E0
󎣡
CE8E1
󎣢
CE8E2
󎣣
CE8E3
󎣤
CE8E4
󎣥
CE8E5
󎣦
CE8E6
󎣧
CE8E7
󎣨
CE8E8
󎣩
CE8E9
󎣪
CE8EA
󎣫
CE8EB
󎣬
CE8EC
󎣭
CE8ED
󎣮
CE8EE
󎣯
CE8EF
A0
B0
󎣰
CE8F0
󎣱
CE8F1
󎣲
CE8F2
󎣳
CE8F3
󎣴
CE8F4
󎣵
CE8F5
󎣶
CE8F6
󎣷
CE8F7
󎣸
CE8F8
󎣹
CE8F9
󎣺
CE8FA
󎣻
CE8FB
󎣼
CE8FC
󎣽
CE8FD
󎣾
CE8FE
󎣿
CE8FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]