Skip to contents

A long-form data set scraped from wikipedia containing an information about 599 languages. The data were scraped from: https://en.wikipedia.org/wiki/List_of_language_names

Usage

language_names

Format

A data frame with 599 rows and 4 columns:

language_en

Name of the language (in English)

language

Name of language (in native language, characters)

notes

Additional information regarding official status, where spoken/used, etc.

is_extinct

Logical vector (TRUE = extinct)