International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F3878E

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󇎀
C7380
󇎁
C7381
󇎂
C7382
󇎃
C7383
󇎄
C7384
󇎅
C7385
󇎆
C7386
󇎇
C7387
󇎈
C7388
󇎉
C7389
󇎊
C738A
󇎋
C738B
󇎌
C738C
󇎍
C738D
󇎎
C738E
󇎏
C738F
80
90
󇎐
C7390
󇎑
C7391
󇎒
C7392
󇎓
C7393
󇎔
C7394
󇎕
C7395
󇎖
C7396
󇎗
C7397
󇎘
C7398
󇎙
C7399
󇎚
C739A
󇎛
C739B
󇎜
C739C
󇎝
C739D
󇎞
C739E
󇎟
C739F
90
A0
󇎠
C73A0
󇎡
C73A1
󇎢
C73A2
󇎣
C73A3
󇎤
C73A4
󇎥
C73A5
󇎦
C73A6
󇎧
C73A7
󇎨
C73A8
󇎩
C73A9
󇎪
C73AA
󇎫
C73AB
󇎬
C73AC
󇎭
C73AD
󇎮
C73AE
󇎯
C73AF
A0
B0
󇎰
C73B0
󇎱
C73B1
󇎲
C73B2
󇎳
C73B3
󇎴
C73B4
󇎵
C73B5
󇎶
C73B6
󇎷
C73B7
󇎸
C73B8
󇎹
C73B9
󇎺
C73BA
󇎻
C73BB
󇎼
C73BC
󇎽
C73BD
󇎾
C73BE
󇎿
C73BF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]