International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38482

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󄂀
C4080
󄂁
C4081
󄂂
C4082
󄂃
C4083
󄂄
C4084
󄂅
C4085
󄂆
C4086
󄂇
C4087
󄂈
C4088
󄂉
C4089
󄂊
C408A
󄂋
C408B
󄂌
C408C
󄂍
C408D
󄂎
C408E
󄂏
C408F
80
90
󄂐
C4090
󄂑
C4091
󄂒
C4092
󄂓
C4093
󄂔
C4094
󄂕
C4095
󄂖
C4096
󄂗
C4097
󄂘
C4098
󄂙
C4099
󄂚
C409A
󄂛
C409B
󄂜
C409C
󄂝
C409D
󄂞
C409E
󄂟
C409F
90
A0
󄂠
C40A0
󄂡
C40A1
󄂢
C40A2
󄂣
C40A3
󄂤
C40A4
󄂥
C40A5
󄂦
C40A6
󄂧
C40A7
󄂨
C40A8
󄂩
C40A9
󄂪
C40AA
󄂫
C40AB
󄂬
C40AC
󄂭
C40AD
󄂮
C40AE
󄂯
C40AF
A0
B0
󄂰
C40B0
󄂱
C40B1
󄂲
C40B2
󄂳
C40B3
󄂴
C40B4
󄂵
C40B5
󄂶
C40B6
󄂷
C40B7
󄂸
C40B8
󄂹
C40B9
󄂺
C40BA
󄂻
C40BB
󄂼
C40BC
󄂽
C40BD
󄂾
C40BE
󄂿
C40BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]