International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38982

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󉂀
C9080
󉂁
C9081
󉂂
C9082
󉂃
C9083
󉂄
C9084
󉂅
C9085
󉂆
C9086
󉂇
C9087
󉂈
C9088
󉂉
C9089
󉂊
C908A
󉂋
C908B
󉂌
C908C
󉂍
C908D
󉂎
C908E
󉂏
C908F
80
90
󉂐
C9090
󉂑
C9091
󉂒
C9092
󉂓
C9093
󉂔
C9094
󉂕
C9095
󉂖
C9096
󉂗
C9097
󉂘
C9098
󉂙
C9099
󉂚
C909A
󉂛
C909B
󉂜
C909C
󉂝
C909D
󉂞
C909E
󉂟
C909F
90
A0
󉂠
C90A0
󉂡
C90A1
󉂢
C90A2
󉂣
C90A3
󉂤
C90A4
󉂥
C90A5
󉂦
C90A6
󉂧
C90A7
󉂨
C90A8
󉂩
C90A9
󉂪
C90AA
󉂫
C90AB
󉂬
C90AC
󉂭
C90AD
󉂮
C90AE
󉂯
C90AF
A0
B0
󉂰
C90B0
󉂱
C90B1
󉂲
C90B2
󉂳
C90B3
󉂴
C90B4
󉂵
C90B5
󉂶
C90B6
󉂷
C90B7
󉂸
C90B8
󉂹
C90B9
󉂺
C90BA
󉂻
C90BB
󉂼
C90BC
󉂽
C90BD
󉂾
C90BE
󉂿
C90BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]