International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F28DA6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򍦀
8D980
򍦁
8D981
򍦂
8D982
򍦃
8D983
򍦄
8D984
򍦅
8D985
򍦆
8D986
򍦇
8D987
򍦈
8D988
򍦉
8D989
򍦊
8D98A
򍦋
8D98B
򍦌
8D98C
򍦍
8D98D
򍦎
8D98E
򍦏
8D98F
80
90
򍦐
8D990
򍦑
8D991
򍦒
8D992
򍦓
8D993
򍦔
8D994
򍦕
8D995
򍦖
8D996
򍦗
8D997
򍦘
8D998
򍦙
8D999
򍦚
8D99A
򍦛
8D99B
򍦜
8D99C
򍦝
8D99D
򍦞
8D99E
򍦟
8D99F
90
A0
򍦠
8D9A0
򍦡
8D9A1
򍦢
8D9A2
򍦣
8D9A3
򍦤
8D9A4
򍦥
8D9A5
򍦦
8D9A6
򍦧
8D9A7
򍦨
8D9A8
򍦩
8D9A9
򍦪
8D9AA
򍦫
8D9AB
򍦬
8D9AC
򍦭
8D9AD
򍦮
8D9AE
򍦯
8D9AF
A0
B0
򍦰
8D9B0
򍦱
8D9B1
򍦲
8D9B2
򍦳
8D9B3
򍦴
8D9B4
򍦵
8D9B5
򍦶
8D9B6
򍦷
8D9B7
򍦸
8D9B8
򍦹
8D9B9
򍦺
8D9BA
򍦻
8D9BB
򍦼
8D9BC
򍦽
8D9BD
򍦾
8D9BE
򍦿
8D9BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]