International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F380AB

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀫀
C0AC0
󀫁
C0AC1
󀫂
C0AC2
󀫃
C0AC3
󀫄
C0AC4
󀫅
C0AC5
󀫆
C0AC6
󀫇
C0AC7
󀫈
C0AC8
󀫉
C0AC9
󀫊
C0ACA
󀫋
C0ACB
󀫌
C0ACC
󀫍
C0ACD
󀫎
C0ACE
󀫏
C0ACF
80
90
󀫐
C0AD0
󀫑
C0AD1
󀫒
C0AD2
󀫓
C0AD3
󀫔
C0AD4
󀫕
C0AD5
󀫖
C0AD6
󀫗
C0AD7
󀫘
C0AD8
󀫙
C0AD9
󀫚
C0ADA
󀫛
C0ADB
󀫜
C0ADC
󀫝
C0ADD
󀫞
C0ADE
󀫟
C0ADF
90
A0
󀫠
C0AE0
󀫡
C0AE1
󀫢
C0AE2
󀫣
C0AE3
󀫤
C0AE4
󀫥
C0AE5
󀫦
C0AE6
󀫧
C0AE7
󀫨
C0AE8
󀫩
C0AE9
󀫪
C0AEA
󀫫
C0AEB
󀫬
C0AEC
󀫭
C0AED
󀫮
C0AEE
󀫯
C0AEF
A0
B0
󀫰
C0AF0
󀫱
C0AF1
󀫲
C0AF2
󀫳
C0AF3
󀫴
C0AF4
󀫵
C0AF5
󀫶
C0AF6
󀫷
C0AF7
󀫸
C0AF8
󀫹
C0AF9
󀫺
C0AFA
󀫻
C0AFB
󀫼
C0AFC
󀫽
C0AFD
󀫾
C0AFE
󀫿
C0AFF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]