Arikah Map

Diacritic

Diacritical marks

accent

acute accent ( ˊ )
double acute accent ( ˝ )
grave accent ( ˋ )

breve ( ˘ )
caron / háček ( ˇ )
cedilla ( ¸ )
circumflex ( ˆ )
diaeresis / umlaut ( ¨ )
dot ( · )

anunaasika ( ˙ )
anusvaara (  ̣ )

hook / dấu hỏi (  ̉ )
macron ( ˉ )
ogonek ( ˛ )
ring / kroužek ( ˚ )
rough breathing / spiritus asper (  ῾ )
smooth breathing / spiritus lenis (  ᾿ )

Marks sometimes used as diacritics

apostrophe ( )
bar ( | )
colon ( : )
comma ( , )
hyphen ( ˗ )
tilde ( ˜ )
titlo (  ҃ )

A diacritical mark or diacritic, in some cases also called an accent mark, is a small sign added to a letter to alter pronunciation or to distinguish between similar words. The term derives from Greek διακριτικός (diakritikos, distinguishing). Note that diacritic is a noun and diacritical is the corresponding adjective.

A diacritical mark can appear above or below a letter, or in some other position. Its main usage is to change the phonetic value of the letter to which it is added, but it may also be used to modify the pronunciation of a whole word or syllable, like the tone marks of tonal languages, to distinguish between homographs, to make abbreviations, such as the titlo in old Slavic texts, or to change the meaning of a letter, such as denoting numerals in numeral systems like early Greek numerals.

A letter which has been modified by a diacritic may be treated as a new, individual letter, or simply as a letter-diacritic combination, in orthography and collation. This varies from language to language, and in some cases from symbol to symbol within a single language.


Contents

Types of diacritic

Marks that are sometimes diacritics, but also have other uses, are:

See also Category:Diacritics and Category:Uncommon Latin letters.

Alphabetization or collation

Main article: Collation

Different languages use different rules to put diacritic characters in alphabetical order. French treats letters with diacritical marks the same as the underlying letter for purposes of ordering and dictionaries.

The Scandinavian languages, by contrast, treat the characters with diacritics ä, ö and å as new and separate letters of the alphabet, and sort them after z. Usually ä is sorted as equal to æ (ash) and ö is sorted as equal to ø (o-slash). Also, aa, when used as an alternative spelling to å, is sorted as such. Other letters modified by diacritics are treated as variants of the underlying letter, with the exception that ü is frequently sorted as y.

Languages that treat accented letters as variants of the underlying letter usually alphabetize words with such symbols immediately after similar unmarked words. For instance, in German where two words differ only by an umlaut, the word without it is sorted first in German dictionaries (eg schon and then schön, or fallen and then fällen). However, when names are concerned (eg in phone books or in author catalogues in libraries), umlauts are often treated as combinations of the vowel with a suffixed e; Austrian phone books now treat umlauts as separate letters (immediately following the underlying letter). In Spanish, although ñ is considered a new letter different from n and placed between n and o, acute accents and diaereses are ignored in alphabetization.

For a comprehensive list of the collating orders in various languages, see Alphabets derived from the Latin.

Non-alphabetic scripts

Some non-alphabetic scripts also employ symbols that function essentially as diacritics.

Generation with computers

Modern computer technology was developed mostly in the English speaking countries, so data formats, keyboard layouts, etc. were developed with an English bias; a "simple" alphabet without diacritical marks. This has led to fears internationally that the marks and accents may become obsolete to facilitate the worldwide exchange of data. Efforts have been made to create domain names that extend further than the English alphabet: the internationalized domain names, example: "pokémon.com".

Depending on the keyboard layout, which differs amongst countries, it is more or less easy to enter letters with diacritics on computers and typewriters. Some have their own keys, some are created by first pressing the key with the diacritic mark followed by the letter to place it on. Such a key is sometimes referred to as a dead key, as it produces no output of its own, but modifies the output of the key pressed after it.

In modern Microsoft Windows operating systems, the keyboard layout US International allows one to type almost all diacritics directly: "+e gives ë, ~+o gives õ, etc. On Apple Macintosh computers, there are keyboard shortcuts for the most common diacritics; Option-e followed by a vowel places an acute accent, Option-u folowed by a vowel gives an umlat, option-c gives a cedilla, etc. Diacritics can be composed in most X Window System keyboard layouts.

On computers it is also a matter of available code pages, whether you can use certain diacritics. Unicode solves this problem by assigning every known character its own code; if this code is known most modern computer systems provide a method to input it. With Unicode it is also possible to combine diacritical marks with most characters.

Languages with letters containing diacritics

The following languages have letters which contain or are made up of diacritics.

Languages with diacritics that do not produce new letters

The following is a list of languages with letter-diacritic combinations that are not considered independent letters.

See also

The OSI basic Latin alphabet
Aa Bb Cc Dd Ee Ff Gg Hh Ii Jj Kk Ll Mm Nn Oo Pp Qq Rr Ss Tt Uu Vv Ww Xx Yy Zz
historypalaeographyderivationsdiacriticspunctuationnumeralsUnicodelist of letters

Categories


Diacritics | Punctuation | Typography

Find

Find

Find