International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F384AB

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄫀
C4AC0
󄫁
C4AC1
󄫂
C4AC2
󄫃
C4AC3
󄫄
C4AC4
󄫅
C4AC5
󄫆
C4AC6
󄫇
C4AC7
󄫈
C4AC8
󄫉
C4AC9
󄫊
C4ACA
󄫋
C4ACB
󄫌
C4ACC
󄫍
C4ACD
󄫎
C4ACE
󄫏
C4ACF
80
90
󄫐
C4AD0
󄫑
C4AD1
󄫒
C4AD2
󄫓
C4AD3
󄫔
C4AD4
󄫕
C4AD5
󄫖
C4AD6
󄫗
C4AD7
󄫘
C4AD8
󄫙
C4AD9
󄫚
C4ADA
󄫛
C4ADB
󄫜
C4ADC
󄫝
C4ADD
󄫞
C4ADE
󄫟
C4ADF
90
A0
󄫠
C4AE0
󄫡
C4AE1
󄫢
C4AE2
󄫣
C4AE3
󄫤
C4AE4
󄫥
C4AE5
󄫦
C4AE6
󄫧
C4AE7
󄫨
C4AE8
󄫩
C4AE9
󄫪
C4AEA
󄫫
C4AEB
󄫬
C4AEC
󄫭
C4AED
󄫮
C4AEE
󄫯
C4AEF
A0
B0
󄫰
C4AF0
󄫱
C4AF1
󄫲
C4AF2
󄫳
C4AF3
󄫴
C4AF4
󄫵
C4AF5
󄫶
C4AF6
󄫷
C4AF7
󄫸
C4AF8
󄫹
C4AF9
󄫺
C4AFA
󄫻
C4AFB
󄫼
C4AFC
󄫽
C4AFD
󄫾
C4AFE
󄫿
C4AFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]