International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38996

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󉖀
C9580
󉖁
C9581
󉖂
C9582
󉖃
C9583
󉖄
C9584
󉖅
C9585
󉖆
C9586
󉖇
C9587
󉖈
C9588
󉖉
C9589
󉖊
C958A
󉖋
C958B
󉖌
C958C
󉖍
C958D
󉖎
C958E
󉖏
C958F
80
90
󉖐
C9590
󉖑
C9591
󉖒
C9592
󉖓
C9593
󉖔
C9594
󉖕
C9595
󉖖
C9596
󉖗
C9597
󉖘
C9598
󉖙
C9599
󉖚
C959A
󉖛
C959B
󉖜
C959C
󉖝
C959D
󉖞
C959E
󉖟
C959F
90
A0
󉖠
C95A0
󉖡
C95A1
󉖢
C95A2
󉖣
C95A3
󉖤
C95A4
󉖥
C95A5
󉖦
C95A6
󉖧
C95A7
󉖨
C95A8
󉖩
C95A9
󉖪
C95AA
󉖫
C95AB
󉖬
C95AC
󉖭
C95AD
󉖮
C95AE
󉖯
C95AF
A0
B0
󉖰
C95B0
󉖱
C95B1
󉖲
C95B2
󉖳
C95B3
󉖴
C95B4
󉖵
C95B5
󉖶
C95B6
󉖷
C95B7
󉖸
C95B8
󉖹
C95B9
󉖺
C95BA
󉖻
C95BB
󉖼
C95BC
󉖽
C95BD
󉖾
C95BE
󉖿
C95BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]