International Components for Unicode

ICU Home
  · ICU Home
ICU4C Demos
  · Converter Explorer
  · Collation Demo
  · Segments
  · IDNA
  · Locale Explorer
  · Normalization Browser
  · Regular Expressions
  · String Compare
  · Transforms
  · Unicode Browser
ICU4J Demos
  · Demo Page
Tools
 

Related Websites

Unicode Consortium

Common Locale Data

 

 
ICU  >  Demo  >  Converter Explorer  > 

UTF-8

Select a standard to view:











Related Topics
 
 · Converter Explorer Help 
 · ICU Charset Information 
 

List of Converter Aliases
Internal
Converter Name
All Aliases
UTF-8 UTF-8
ibm-1208
ibm-1209
ibm-5304
ibm-5305
ibm-13496
ibm-13497
ibm-17592
ibm-17593
windows-65001
cp1208
x-UTF_8J
unicode-1-1-utf-8
unicode-2-0-utf-8


Codepage Layout

Currently showing the codepage starting with the bytes F38597

  000102030405060708090A0B0C0D0E0F 
00                                 00
10                                 10
20                                 20
30                                 30
40                                 40
50                                 50
60                                 60
70                                 70
80
󅗀
C55C0
󅗁
C55C1
󅗂
C55C2
󅗃
C55C3
󅗄
C55C4
󅗅
C55C5
󅗆
C55C6
󅗇
C55C7
󅗈
C55C8
󅗉
C55C9
󅗊
C55CA
󅗋
C55CB
󅗌
C55CC
󅗍
C55CD
󅗎
C55CE
󅗏
C55CF
80
90
󅗐
C55D0
󅗑
C55D1
󅗒
C55D2
󅗓
C55D3
󅗔
C55D4
󅗕
C55D5
󅗖
C55D6
󅗗
C55D7
󅗘
C55D8
󅗙
C55D9
󅗚
C55DA
󅗛
C55DB
󅗜
C55DC
󅗝
C55DD
󅗞
C55DE
󅗟
C55DF
90
A0
󅗠
C55E0
󅗡
C55E1
󅗢
C55E2
󅗣
C55E3
󅗤
C55E4
󅗥
C55E5
󅗦
C55E6
󅗧
C55E7
󅗨
C55E8
󅗩
C55E9
󅗪
C55EA
󅗫
C55EB
󅗬
C55EC
󅗭
C55ED
󅗮
C55EE
󅗯
C55EF
A0
B0
󅗰
C55F0
󅗱
C55F1
󅗲
C55F2
󅗳
C55F3
󅗴
C55F4
󅗵
C55F5
󅗶
C55F6
󅗷
C55F7
󅗸
C55F8
󅗹
C55F9
󅗺
C55FA
󅗻
C55FB
󅗼
C55FC
󅗽
C55FD
󅗾
C55FE
󅗿
C55FF
B0
C0                                 C0
D0                                 D0
E0                                 E0
F0                                 F0
  000102030405060708090A0B0C0D0E0F 

Information About This Converter
Type of converterUCNV_UTF8
Minimum number of bytes per UChar1
Maximum number of bytes per UChar3
Substitution character\xEF\xBF\xBD
Is ASCII [\x20-\x7E] compatible?TRUE
Is ASCII [\u0020-\u007E] ambiguous?FALSE
Contains ambiguous aliases?FALSE
Always generates Unicode NFC?UNKNOWN
Contains BiDi characters?TRUE

List of Languages Representable By This Codepage
View Complete Set...

Set of Unicode Characters Representable By This Codepage

[^\uD800-\uDFFF]