Unicode: Difference between revisions

From Rosetta Code
Content added Content deleted
(Short description of UNICODE)
 
No edit summary
 
(2 intermediate revisions by 2 users not shown)
Line 1: Line 1:
[[Category:Encyclopedia]]'''Unicode''' is a mapping from characters in a ''very'' large set of languages to code points, together with a set of descriptive metadata about those code points (so that you can know whether they are alphabetic, numeric, symbolic, white-space, etc.)
{{stub}}
[[Category:Encyclopedia]]'''UNICODE''' is a mapping from characters in a ''very'' large set of languages to code points, together with a set of descriptive metadata about those code points (so that you can know whether they are alphabetic, numeric, symbolic, white-space, etc.)


A number of different mechanisms are used for mapping this to a sequence of bytes. The single most important one (because it embeds [[ASCII]] and so is easy to deploy on existing systems) is [[UTF-8]].
A number of different mechanisms are used for mapping this to a sequence of bytes. The single most important one (because it embeds [[ASCII]] and so is easy to deploy on existing systems) is [[UTF-8]].

* [[wikt:Appendix:Unicode|Appendix:Unicode]] from Wiktionary

Latest revision as of 12:42, 18 April 2018

Unicode is a mapping from characters in a very large set of languages to code points, together with a set of descriptive metadata about those code points (so that you can know whether they are alphabetic, numeric, symbolic, white-space, etc.)

A number of different mechanisms are used for mapping this to a sequence of bytes. The single most important one (because it embeds ASCII and so is easy to deploy on existing systems) is UTF-8.