International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F380AC

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󀬀
C0B00
󀬁
C0B01
󀬂
C0B02
󀬃
C0B03
󀬄
C0B04
󀬅
C0B05
󀬆
C0B06
󀬇
C0B07
󀬈
C0B08
󀬉
C0B09
󀬊
C0B0A
󀬋
C0B0B
󀬌
C0B0C
󀬍
C0B0D
󀬎
C0B0E
󀬏
C0B0F
80
90
󀬐
C0B10
󀬑
C0B11
󀬒
C0B12
󀬓
C0B13
󀬔
C0B14
󀬕
C0B15
󀬖
C0B16
󀬗
C0B17
󀬘
C0B18
󀬙
C0B19
󀬚
C0B1A
󀬛
C0B1B
󀬜
C0B1C
󀬝
C0B1D
󀬞
C0B1E
󀬟
C0B1F
90
A0
󀬠
C0B20
󀬡
C0B21
󀬢
C0B22
󀬣
C0B23
󀬤
C0B24
󀬥
C0B25
󀬦
C0B26
󀬧
C0B27
󀬨
C0B28
󀬩
C0B29
󀬪
C0B2A
󀬫
C0B2B
󀬬
C0B2C
󀬭
C0B2D
󀬮
C0B2E
󀬯
C0B2F
A0
B0
󀬰
C0B30
󀬱
C0B31
󀬲
C0B32
󀬳
C0B33
󀬴
C0B34
󀬵
C0B35
󀬶
C0B36
󀬷
C0B37
󀬸
C0B38
󀬹
C0B39
󀬺
C0B3A
󀬻
C0B3B
󀬼
C0B3C
󀬽
C0B3D
󀬾
C0B3E
󀬿
C0B3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]