Skip to contents

A long-form data set scraped from wikipedia containing a list of the languages spoken in 212 different countries. The data set distinguishes between official, regional, minority, and national status, as well as widely spoken languages. The data were scraped from: https://en.wikipedia.org/wiki/List_of_official_languages_by_country_and_territory

Usage

languages_by_country

Format

A data frame with 212 rows and 6 columns:

country_region

The country (or region)

official_language

Character vector of official language(s)

regional_language

Character vector of regional language(s)

minority_language

Character vector of minority language(s)

national_language

Character vector of national language(s)

widely_spoken

Character vector of widely spoken language(s)