International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F381A3

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󁣀
C18C0
󁣁
C18C1
󁣂
C18C2
󁣃
C18C3
󁣄
C18C4
󁣅
C18C5
󁣆
C18C6
󁣇
C18C7
󁣈
C18C8
󁣉
C18C9
󁣊
C18CA
󁣋
C18CB
󁣌
C18CC
󁣍
C18CD
󁣎
C18CE
󁣏
C18CF
80
90
󁣐
C18D0
󁣑
C18D1
󁣒
C18D2
󁣓
C18D3
󁣔
C18D4
󁣕
C18D5
󁣖
C18D6
󁣗
C18D7
󁣘
C18D8
󁣙
C18D9
󁣚
C18DA
󁣛
C18DB
󁣜
C18DC
󁣝
C18DD
󁣞
C18DE
󁣟
C18DF
90
A0
󁣠
C18E0
󁣡
C18E1
󁣢
C18E2
󁣣
C18E3
󁣤
C18E4
󁣥
C18E5
󁣦
C18E6
󁣧
C18E7
󁣨
C18E8
󁣩
C18E9
󁣪
C18EA
󁣫
C18EB
󁣬
C18EC
󁣭
C18ED
󁣮
C18EE
󁣯
C18EF
A0
B0
󁣰
C18F0
󁣱
C18F1
󁣲
C18F2
󁣳
C18F3
󁣴
C18F4
󁣵
C18F5
󁣶
C18F6
󁣷
C18F7
󁣸
C18F8
󁣹
C18F9
󁣺
C18FA
󁣻
C18FB
󁣼
C18FC
󁣽
C18FD
󁣾
C18FE
󁣿
C18FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]