International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F2A382

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
򣂀
A3080
򣂁
A3081
򣂂
A3082
򣂃
A3083
򣂄
A3084
򣂅
A3085
򣂆
A3086
򣂇
A3087
򣂈
A3088
򣂉
A3089
򣂊
A308A
򣂋
A308B
򣂌
A308C
򣂍
A308D
򣂎
A308E
򣂏
A308F
80
90
򣂐
A3090
򣂑
A3091
򣂒
A3092
򣂓
A3093
򣂔
A3094
򣂕
A3095
򣂖
A3096
򣂗
A3097
򣂘
A3098
򣂙
A3099
򣂚
A309A
򣂛
A309B
򣂜
A309C
򣂝
A309D
򣂞
A309E
򣂟
A309F
90
A0
򣂠
A30A0
򣂡
A30A1
򣂢
A30A2
򣂣
A30A3
򣂤
A30A4
򣂥
A30A5
򣂦
A30A6
򣂧
A30A7
򣂨
A30A8
򣂩
A30A9
򣂪
A30AA
򣂫
A30AB
򣂬
A30AC
򣂭
A30AD
򣂮
A30AE
򣂯
A30AF
A0
B0
򣂰
A30B0
򣂱
A30B1
򣂲
A30B2
򣂳
A30B3
򣂴
A30B4
򣂵
A30B5
򣂶
A30B6
򣂷
A30B7
򣂸
A30B8
򣂹
A30B9
򣂺
A30BA
򣂻
A30BB
򣂼
A30BC
򣂽
A30BD
򣂾
A30BE
򣂿
A30BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]