International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F189AC

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
񉬀
49B00
񉬁
49B01
񉬂
49B02
񉬃
49B03
񉬄
49B04
񉬅
49B05
񉬆
49B06
񉬇
49B07
񉬈
49B08
񉬉
49B09
񉬊
49B0A
񉬋
49B0B
񉬌
49B0C
񉬍
49B0D
񉬎
49B0E
񉬏
49B0F
80
90
񉬐
49B10
񉬑
49B11
񉬒
49B12
񉬓
49B13
񉬔
49B14
񉬕
49B15
񉬖
49B16
񉬗
49B17
񉬘
49B18
񉬙
49B19
񉬚
49B1A
񉬛
49B1B
񉬜
49B1C
񉬝
49B1D
񉬞
49B1E
񉬟
49B1F
90
A0
񉬠
49B20
񉬡
49B21
񉬢
49B22
񉬣
49B23
񉬤
49B24
񉬥
49B25
񉬦
49B26
񉬧
49B27
񉬨
49B28
񉬩
49B29
񉬪
49B2A
񉬫
49B2B
񉬬
49B2C
񉬭
49B2D
񉬮
49B2E
񉬯
49B2F
A0
B0
񉬰
49B30
񉬱
49B31
񉬲
49B32
񉬳
49B33
񉬴
49B34
񉬵
49B35
񉬶
49B36
񉬷
49B37
񉬸
49B38
񉬹
49B39
񉬺
49B3A
񉬻
49B3B
񉬼
49B3C
񉬽
49B3D
񉬾
49B3E
񉬿
49B3F
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]