International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F19CA6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񜦀
5C980
񜦁
5C981
񜦂
5C982
񜦃
5C983
񜦄
5C984
񜦅
5C985
񜦆
5C986
񜦇
5C987
񜦈
5C988
񜦉
5C989
񜦊
5C98A
񜦋
5C98B
񜦌
5C98C
񜦍
5C98D
񜦎
5C98E
񜦏
5C98F
80
90
񜦐
5C990
񜦑
5C991
񜦒
5C992
񜦓
5C993
񜦔
5C994
񜦕
5C995
񜦖
5C996
񜦗
5C997
񜦘
5C998
񜦙
5C999
񜦚
5C99A
񜦛
5C99B
񜦜
5C99C
񜦝
5C99D
񜦞
5C99E
񜦟
5C99F
90
A0
񜦠
5C9A0
񜦡
5C9A1
񜦢
5C9A2
񜦣
5C9A3
񜦤
5C9A4
񜦥
5C9A5
񜦦
5C9A6
񜦧
5C9A7
񜦨
5C9A8
񜦩
5C9A9
񜦪
5C9AA
񜦫
5C9AB
񜦬
5C9AC
񜦭
5C9AD
񜦮
5C9AE
񜦯
5C9AF
A0
B0
񜦰
5C9B0
񜦱
5C9B1
񜦲
5C9B2
񜦳
5C9B3
񜦴
5C9B4
񜦵
5C9B5
񜦶
5C9B6
񜦷
5C9B7
񜦸
5C9B8
񜦹
5C9B9
񜦺
5C9BA
񜦻
5C9BB
񜦼
5C9BC
񜦽
5C9BD
񜦾
5C9BE
񜦿
5C9BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]