Skip to contents

A long-form data set scraped from wikipedia containing the number of L1 and L2 speakers of 45 languages. The data set also contains information regarding the language family, language branch, and other notes. The table was scraped from: https://en.wikipedia.org/wiki/List_of_languages_by_total_number_of_speakers

Usage

total_speakers

Format

A data frame with 45 rows and 7 columns:

language

Language name in English

family

Language family

branch

Language branch

notes

Misc. notes about the language

l1_speakers

Number of L1 speakers

l2_speakers

Number of L2 speakers

total_speakers

Total number of speakers (L1 + L2)