International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38582

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󅂀
C5080
󅂁
C5081
󅂂
C5082
󅂃
C5083
󅂄
C5084
󅂅
C5085
󅂆
C5086
󅂇
C5087
󅂈
C5088
󅂉
C5089
󅂊
C508A
󅂋
C508B
󅂌
C508C
󅂍
C508D
󅂎
C508E
󅂏
C508F
80
90
󅂐
C5090
󅂑
C5091
󅂒
C5092
󅂓
C5093
󅂔
C5094
󅂕
C5095
󅂖
C5096
󅂗
C5097
󅂘
C5098
󅂙
C5099
󅂚
C509A
󅂛
C509B
󅂜
C509C
󅂝
C509D
󅂞
C509E
󅂟
C509F
90
A0
󅂠
C50A0
󅂡
C50A1
󅂢
C50A2
󅂣
C50A3
󅂤
C50A4
󅂥
C50A5
󅂦
C50A6
󅂧
C50A7
󅂨
C50A8
󅂩
C50A9
󅂪
C50AA
󅂫
C50AB
󅂬
C50AC
󅂭
C50AD
󅂮
C50AE
󅂯
C50AF
A0
B0
󅂰
C50B0
󅂱
C50B1
󅂲
C50B2
󅂳
C50B3
󅂴
C50B4
󅂵
C50B5
󅂶
C50B6
󅂷
C50B7
󅂸
C50B8
󅂹
C50B9
󅂺
C50BA
󅂻
C50BB
󅂼
C50BC
󅂽
C50BD
󅂾
C50BE
󅂿
C50BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]