International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18FA6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񏦀
4F980
񏦁
4F981
񏦂
4F982
񏦃
4F983
񏦄
4F984
񏦅
4F985
񏦆
4F986
񏦇
4F987
񏦈
4F988
񏦉
4F989
񏦊
4F98A
񏦋
4F98B
񏦌
4F98C
񏦍
4F98D
񏦎
4F98E
񏦏
4F98F
80
90
񏦐
4F990
񏦑
4F991
񏦒
4F992
񏦓
4F993
񏦔
4F994
񏦕
4F995
񏦖
4F996
񏦗
4F997
񏦘
4F998
񏦙
4F999
񏦚
4F99A
񏦛
4F99B
񏦜
4F99C
񏦝
4F99D
񏦞
4F99E
񏦟
4F99F
90
A0
񏦠
4F9A0
񏦡
4F9A1
񏦢
4F9A2
񏦣
4F9A3
񏦤
4F9A4
񏦥
4F9A5
񏦦
4F9A6
񏦧
4F9A7
񏦨
4F9A8
񏦩
4F9A9
񏦪
4F9AA
񏦫
4F9AB
񏦬
4F9AC
񏦭
4F9AD
񏦮
4F9AE
񏦯
4F9AF
A0
B0
񏦰
4F9B0
񏦱
4F9B1
񏦲
4F9B2
񏦳
4F9B3
񏦴
4F9B4
񏦵
4F9B5
񏦶
4F9B6
񏦷
4F9B7
񏦸
4F9B8
񏦹
4F9B9
񏦺
4F9BA
񏦻
4F9BB
񏦼
4F9BC
񏦽
4F9BD
񏦾
4F9BE
񏦿
4F9BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]