A dataset containes the original catalogue of languages of the world involving genealogical affiliation, macro-area, country, iso code, and coordinates.
glottolog
A data frame with 25900 rows and 10 variables:
languoid code from Glottolog 4.4
name of the language
code based on ISO 639--3 https://iso639-3.sil.org/
languoid type: dialect or language (possible values are dialect, language, family, bookkeeping, pseudo family, sign language, unclassifiable, pidgin, unattested, artificial language, speech register, mixed language)
have six values Africa, Australia, Eurasia, North America, Papunesia, South America
latitude
longitude
list of countries, where the language is spoken
genealogical affiliation
subclassification in a Newick format
Hammarstr<U+00F6>m, Harald & Forkel, Robert & Haspelmath, Martin & Bank, Sebastian. 2021.Glottolog 4.4. Leipzig: Max Planck Institute for Evolutionary Anthropology. https://doi.org/10.5281/zenodo.4761960 (Available online at http://glottolog.org, Accessed on 2021-05-15.)