International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F39EA6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󞦀
DE980
󞦁
DE981
󞦂
DE982
󞦃
DE983
󞦄
DE984
󞦅
DE985
󞦆
DE986
󞦇
DE987
󞦈
DE988
󞦉
DE989
󞦊
DE98A
󞦋
DE98B
󞦌
DE98C
󞦍
DE98D
󞦎
DE98E
󞦏
DE98F
80
90
󞦐
DE990
󞦑
DE991
󞦒
DE992
󞦓
DE993
󞦔
DE994
󞦕
DE995
󞦖
DE996
󞦗
DE997
󞦘
DE998
󞦙
DE999
󞦚
DE99A
󞦛
DE99B
󞦜
DE99C
󞦝
DE99D
󞦞
DE99E
󞦟
DE99F
90
A0
󞦠
DE9A0
󞦡
DE9A1
󞦢
DE9A2
󞦣
DE9A3
󞦤
DE9A4
󞦥
DE9A5
󞦦
DE9A6
󞦧
DE9A7
󞦨
DE9A8
󞦩
DE9A9
󞦪
DE9AA
󞦫
DE9AB
󞦬
DE9AC
󞦭
DE9AD
󞦮
DE9AE
󞦯
DE9AF
A0
B0
󞦰
DE9B0
󞦱
DE9B1
󞦲
DE9B2
󞦳
DE9B3
󞦴
DE9B4
󞦵
DE9B5
󞦶
DE9B6
󞦷
DE9B7
󞦸
DE9B8
󞦹
DE9B9
󞦺
DE9BA
󞦻
DE9BB
󞦼
DE9BC
󞦽
DE9BD
󞦾
DE9BE
󞦿
DE9BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]