International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F388A6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󈦀
C8980
󈦁
C8981
󈦂
C8982
󈦃
C8983
󈦄
C8984
󈦅
C8985
󈦆
C8986
󈦇
C8987
󈦈
C8988
󈦉
C8989
󈦊
C898A
󈦋
C898B
󈦌
C898C
󈦍
C898D
󈦎
C898E
󈦏
C898F
80
90
󈦐
C8990
󈦑
C8991
󈦒
C8992
󈦓
C8993
󈦔
C8994
󈦕
C8995
󈦖
C8996
󈦗
C8997
󈦘
C8998
󈦙
C8999
󈦚
C899A
󈦛
C899B
󈦜
C899C
󈦝
C899D
󈦞
C899E
󈦟
C899F
90
A0
󈦠
C89A0
󈦡
C89A1
󈦢
C89A2
󈦣
C89A3
󈦤
C89A4
󈦥
C89A5
󈦦
C89A6
󈦧
C89A7
󈦨
C89A8
󈦩
C89A9
󈦪
C89AA
󈦫
C89AB
󈦬
C89AC
󈦭
C89AD
󈦮
C89AE
󈦯
C89AF
A0
B0
󈦰
C89B0
󈦱
C89B1
󈦲
C89B2
󈦳
C89B3
󈦴
C89B4
󈦵
C89B5
󈦶
C89B6
󈦷
C89B7
󈦸
C89B8
󈦹
C89B9
󈦺
C89BA
󈦻
C89BB
󈦼
C89BC
󈦽
C89BD
󈦾
C89BE
󈦿
C89BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]