International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F18DA6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񍦀
4D980
񍦁
4D981
񍦂
4D982
񍦃
4D983
񍦄
4D984
񍦅
4D985
񍦆
4D986
񍦇
4D987
񍦈
4D988
񍦉
4D989
񍦊
4D98A
񍦋
4D98B
񍦌
4D98C
񍦍
4D98D
񍦎
4D98E
񍦏
4D98F
80
90
񍦐
4D990
񍦑
4D991
񍦒
4D992
񍦓
4D993
񍦔
4D994
񍦕
4D995
񍦖
4D996
񍦗
4D997
񍦘
4D998
񍦙
4D999
񍦚
4D99A
񍦛
4D99B
񍦜
4D99C
񍦝
4D99D
񍦞
4D99E
񍦟
4D99F
90
A0
񍦠
4D9A0
񍦡
4D9A1
񍦢
4D9A2
񍦣
4D9A3
񍦤
4D9A4
񍦥
4D9A5
񍦦
4D9A6
񍦧
4D9A7
񍦨
4D9A8
񍦩
4D9A9
񍦪
4D9AA
񍦫
4D9AB
񍦬
4D9AC
񍦭
4D9AD
񍦮
4D9AE
񍦯
4D9AF
A0
B0
񍦰
4D9B0
񍦱
4D9B1
񍦲
4D9B2
񍦳
4D9B3
񍦴
4D9B4
񍦵
4D9B5
񍦶
4D9B6
񍦷
4D9B7
񍦸
4D9B8
񍦹
4D9B9
񍦺
4D9BA
񍦻
4D9BB
񍦼
4D9BC
񍦽
4D9BD
񍦾
4D9BE
񍦿
4D9BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]