International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38980

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󉀀
C9000
󉀁
C9001
󉀂
C9002
󉀃
C9003
󉀄
C9004
󉀅
C9005
󉀆
C9006
󉀇
C9007
󉀈
C9008
󉀉
C9009
󉀊
C900A
󉀋
C900B
󉀌
C900C
󉀍
C900D
󉀎
C900E
󉀏
C900F
80
90
󉀐
C9010
󉀑
C9011
󉀒
C9012
󉀓
C9013
󉀔
C9014
󉀕
C9015
󉀖
C9016
󉀗
C9017
󉀘
C9018
󉀙
C9019
󉀚
C901A
󉀛
C901B
󉀜
C901C
󉀝
C901D
󉀞
C901E
󉀟
C901F
90
A0
󉀠
C9020
󉀡
C9021
󉀢
C9022
󉀣
C9023
󉀤
C9024
󉀥
C9025
󉀦
C9026
󉀧
C9027
󉀨
C9028
󉀩
C9029
󉀪
C902A
󉀫
C902B
󉀬
C902C
󉀭
C902D
󉀮
C902E
󉀯
C902F
A0
B0
󉀰
C9030
󉀱
C9031
󉀲
C9032
󉀳
C9033
󉀴
C9034
󉀵
C9035
󉀶
C9036
󉀷
C9037
󉀸
C9038
󉀹
C9039
󉀺
C903A
󉀻
C903B
󉀼
C903C
󉀽
C903D
󉀾
C903E
󉀿
C903F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]