International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F19DA6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񝦀
5D980
񝦁
5D981
񝦂
5D982
񝦃
5D983
񝦄
5D984
񝦅
5D985
񝦆
5D986
񝦇
5D987
񝦈
5D988
񝦉
5D989
񝦊
5D98A
񝦋
5D98B
񝦌
5D98C
񝦍
5D98D
񝦎
5D98E
񝦏
5D98F
80
90
񝦐
5D990
񝦑
5D991
񝦒
5D992
񝦓
5D993
񝦔
5D994
񝦕
5D995
񝦖
5D996
񝦗
5D997
񝦘
5D998
񝦙
5D999
񝦚
5D99A
񝦛
5D99B
񝦜
5D99C
񝦝
5D99D
񝦞
5D99E
񝦟
5D99F
90
A0
񝦠
5D9A0
񝦡
5D9A1
񝦢
5D9A2
񝦣
5D9A3
񝦤
5D9A4
񝦥
5D9A5
񝦦
5D9A6
񝦧
5D9A7
񝦨
5D9A8
񝦩
5D9A9
񝦪
5D9AA
񝦫
5D9AB
񝦬
5D9AC
񝦭
5D9AD
񝦮
5D9AE
񝦯
5D9AF
A0
B0
񝦰
5D9B0
񝦱
5D9B1
񝦲
5D9B2
񝦳
5D9B3
񝦴
5D9B4
񝦵
5D9B5
񝦶
5D9B6
񝦷
5D9B7
񝦸
5D9B8
񝦹
5D9B9
񝦺
5D9BA
񝦻
5D9BB
񝦼
5D9BC
񝦽
5D9BD
񝦾
5D9BE
񝦿
5D9BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]