Search results
Results From The WOW.Com Content Network
Over a thousand characters from the Latin script are encoded in the Unicode Standard, grouped in several basic and extended Latin blocks.The extended ranges contain mainly precomposed letters plus diacritics that are equivalently encoded with combining diacritics, as well as some ligatures and distinct letters, used for example in the orthographies of various African languages (including click ...
HTML and XML provide ways to reference Unicode characters when the characters themselves either cannot or should not be used. A numeric character reference refers to a character by its Universal Character Set/Unicode code point, and a character entity reference refers to a character by a predefined name.
The definition of a Latin-script letter for this list is a character encoded in the Unicode Standard that has a script property of 'Latin' and the general category of 'Letter'. An overview of the distribution of Latin-script letters in Unicode is given in Latin script in Unicode.
The Basic Latin Unicode block, [3] sometimes informally called C0 Controls and Basic Latin, [4] is the first block of the Unicode standard, and the only block which is encoded in one byte in UTF-8. The block contains all the letters and control codes of the ASCII encoding.
The most common combining characters in the Latin script are the combining diacritical marks (including combining accents). Unicode also contains many precomposed characters , so that in many cases it is possible to use both combining diacritics and precomposed characters, at the user's or application's choice.
The Latin-1 Supplement (also called C1 Controls and Latin-1 Supplement) is the second Unicode block in the Unicode standard. It encodes the upper range of ISO 8859-1 : 80 (U+0080) - FF (U+00FF). C1 Controls (0080–009F) are not graphic.
The lists and tables below summarize and compare the letter inventories of some of the Latin-script alphabets.In this article, the scope of the word "alphabet" is broadened to include letters with tone marks, and other diacritics used to represent a wide range of orthographic traditions, without regard to whether or how they are sequenced in their alphabet or the table.
The term Latin alphabet may refer to either the alphabet used to write Latin (as described in this article) or other alphabets based on the Latin script, which is the basic set of letters common to the various alphabets descended from the classical Latin alphabet, such as the English alphabet.