International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3A99E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󩞀
E9780
󩞁
E9781
󩞂
E9782
󩞃
E9783
󩞄
E9784
󩞅
E9785
󩞆
E9786
󩞇
E9787
󩞈
E9788
󩞉
E9789
󩞊
E978A
󩞋
E978B
󩞌
E978C
󩞍
E978D
󩞎
E978E
󩞏
E978F
80
90
󩞐
E9790
󩞑
E9791
󩞒
E9792
󩞓
E9793
󩞔
E9794
󩞕
E9795
󩞖
E9796
󩞗
E9797
󩞘
E9798
󩞙
E9799
󩞚
E979A
󩞛
E979B
󩞜
E979C
󩞝
E979D
󩞞
E979E
󩞟
E979F
90
A0
󩞠
E97A0
󩞡
E97A1
󩞢
E97A2
󩞣
E97A3
󩞤
E97A4
󩞥
E97A5
󩞦
E97A6
󩞧
E97A7
󩞨
E97A8
󩞩
E97A9
󩞪
E97AA
󩞫
E97AB
󩞬
E97AC
󩞭
E97AD
󩞮
E97AE
󩞯
E97AF
A0
B0
󩞰
E97B0
󩞱
E97B1
󩞲
E97B2
󩞳
E97B3
󩞴
E97B4
󩞵
E97B5
󩞶
E97B6
󩞷
E97B7
󩞸
E97B8
󩞹
E97B9
󩞺
E97BA
󩞻
E97BB
󩞼
E97BC
󩞽
E97BD
󩞾
E97BE
󩞿
E97BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]