Language Hierarchy
Glottolog organizes languages in a hierarchical tree structure:nucl1643 for Japanese).
Language Object
Fields
| Field | Type | Description |
|---|---|---|
id | integer | Internal LangDex ID |
glottocode | string | Unique Glottolog identifier (8 chars) |
iso639_3 | string | ISO 639-3 code (3 chars, if exists) |
name | string | Primary English name |
level | enum | family, language, or dialect |
parent_glottocode | string | Parent node in hierarchy |
family | object | Top-level language family |
scripts | array | ISO 15924 script codes |
speakers | integer | Estimated speaker count |
latitude / longitude | float | Geographic center |
Language Varieties (Langvars)
LangDex also includes PanLex language varieties (langvar), which provide finer-grained distinctions than Glottolog. These are used for precise translation mapping.
language_id, enabling translations to specify exactly which variety of a language is being used.
Language Names
Language names are stored in multiple locales via CLDR data:Scripts
Languages can use multiple writing systems. LangDex tracks this via thelanguage_script table:
| Language | Scripts |
|---|---|
| Japanese | Jpan (Japanese), Hira (Hiragana), Kana (Katakana), Latn (Latin) |
| Chinese | Hans (Simplified), Hant (Traditional), Latn (Pinyin) |
| Serbian | Cyrl (Cyrillic), Latn (Latin) |
Common Queries
List all languages in a family
Search languages by name
Get language with full metadata
Data Coverage
| Level | Count |
|---|---|
| Language families | ~450 |
| Languages | ~7,000 |
| Dialects | ~20,000 |
| Language names | ~5.4M (200+ locales) |