Unicode: Difference between revisions

Content added Content deleted

Inline

Revision as of 18:14, 3 August 2011

This page is a stub. It needs more information! You can help Rosetta Code by filling it in!

Unicode is a mapping from characters in a very large set of languages to code points, together with a set of descriptive metadata about those code points (so that you can know whether they are alphabetic, numeric, symbolic, white-space, etc.)

A number of different mechanisms are used for mapping this to a sequence of bytes. The single most important one (because it embeds ASCII and so is easy to deploy on existing systems) is UTF-8.

Appendix:Unicode from Wiktionary

@@ Line 1: / Line 1: @@
 {{stub}}
-[[Category:Encyclopedia]]'''UNICODE''' is a mapping from characters in a ''very'' large set of languages to code points, together with a set of descriptive metadata about those code points (so that you can know whether they are alphabetic, numeric, symbolic, white-space, etc.)
+[[Category:Encyclopedia]]'''Unicode''' is a mapping from characters in a ''very'' large set of languages to code points, together with a set of descriptive metadata about those code points (so that you can know whether they are alphabetic, numeric, symbolic, white-space, etc.)
 A number of different mechanisms are used for mapping this to a sequence of bytes. The single most important one (because it embeds [[ASCII]] and so is easy to deploy on existing systems) is [[UTF-8]].