ISO 639-3
ISO 639-3 is in process of development as an international standard for language codes. It extends the ISO 639-2 alpha-3 codes with an aim to cover all known languages. The draft version is already used by the 15th edition Ethnologue.
It is a superset of ISO 639-1 and of the individual languages in ISO 639-2. Since ISO 639-2 also includes language collections, whereas Part 3 does not, ISO 639-3 is not a superset of ISO 639-2. Where B and T codes exist in ISO 639-2, it uses the T-codes.
Examples:
| language | 639-1 | 639-2 (B/T) | type | 639-3 |
| English | en | eng | individual | eng |
| German | de | ger/deu | individual | deu |
| Arabic | ar | ara | macro | arb + several others |
| Minnan | (zh-min-nan) | individual | nan |
The draft from 2005-07-30 contains 7602 entries. The inventory of languages is based on three sources: the individual languages contained in 639-2 are the basis, this was extended by modern languages from the Ethnologue 15th edition, and by historic varieties, ancient languages and artificial languages from the Linguist List.
The status of this project is that of Final Draft International Standard (FDIS). The current draft is referred to as ISO/FDIS 639-3. As of August 2006, it is in ballot for approval as standard; in theory, it could be approved and published before the end of the year.
A transition from ISO 639-1 could be done with List of ISO 639-1 codes.
Contents |
Code space
Since the code is three-letter alphabetic, one upper bound for the number of languages that can be represented is 26 × 26 × 26 = 17576. Since ISO 639-2 defines special codes (2), a reserved range (520) and B-only codes (23), 545 codes cannot be used in part 3. Therefore a lower upper bound is 17576 - 545 = 17032.
The upper bound gets even lower if one subtracts the language collections defined in 639-2.
Macrolanguages
There are 56 languages in ISO 639-2 which the SIL considers to be “macrolanguages” in 639-3 [1].
Some of these macrolanguages had no individual language as defined by 639-3 in ISO 639-2, e.g. 'ara'. Others like 'nor' had their two individual parts ('nno', 'nob') already in 639-2.
That means some languages (e.g. 'arb') that were considered by ISO 639-2 to be dialects of one language ('ara') are now in ISO 639-3 in certain contexts considered to be individual languages themselves.
This is an attempt to deal with varieties that may be linguistically distinct from each other, but are treated by their speakers as two forms of the same language, e.g. in cases of diglossia.
For example,
- http://www.sil.org/iso639-3/documentation.asp?id=ara (Generic Arabic, 639-2)
- http://www.sil.org/iso639-3/documentation.asp?id=arb (Standard Arabic, 639-3)
See http://www.sil.org/iso639-3/macrolanguages.asp for the complete list.
Collective languages
Some ISO 639-2 codes that are commonly used for languages do not precisely represent a particular language or some related languages (as the above macrolanguages). They are regarded as collective languages (or collectives) and are excluded from ISO 639-3.
See also: ISO_639-2#Collective_languages
History
Stages:
- 2006-07-14 FDIS 50.00
See also
External links
- ISO/DIS 639-3 Registration Authority
- Linguist List - List of Ancient and Extinct Languages
- explanation by Håvard Hjulstad
- ISO 639-3 status from the ISO Website
Categories
ISO standards | ISO 639
