International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F381A6

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󁦀
C1980
󁦁
C1981
󁦂
C1982
󁦃
C1983
󁦄
C1984
󁦅
C1985
󁦆
C1986
󁦇
C1987
󁦈
C1988
󁦉
C1989
󁦊
C198A
󁦋
C198B
󁦌
C198C
󁦍
C198D
󁦎
C198E
󁦏
C198F
80
90
󁦐
C1990
󁦑
C1991
󁦒
C1992
󁦓
C1993
󁦔
C1994
󁦕
C1995
󁦖
C1996
󁦗
C1997
󁦘
C1998
󁦙
C1999
󁦚
C199A
󁦛
C199B
󁦜
C199C
󁦝
C199D
󁦞
C199E
󁦟
C199F
90
A0
󁦠
C19A0
󁦡
C19A1
󁦢
C19A2
󁦣
C19A3
󁦤
C19A4
󁦥
C19A5
󁦦
C19A6
󁦧
C19A7
󁦨
C19A8
󁦩
C19A9
󁦪
C19AA
󁦫
C19AB
󁦬
C19AC
󁦭
C19AD
󁦮
C19AE
󁦯
C19AF
A0
B0
󁦰
C19B0
󁦱
C19B1
󁦲
C19B2
󁦳
C19B3
󁦴
C19B4
󁦵
C19B5
󁦶
C19B6
󁦷
C19B7
󁦸
C19B8
󁦹
C19B9
󁦺
C19BA
󁦻
C19BB
󁦼
C19BC
󁦽
C19BD
󁦾
C19BE
󁦿
C19BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]