Chapter 2 Data sources

This is an overview of all data sources for this project. The multivariate analysis conducted in the rest of the book is solely based on these data sets.


Data description: Total population is based on the de facto definition of population, which counts all residents regardless of legal status or citizenship.

Link to data.


Data description: Measure of a country’s economic output computed by adding up the monetary value (in current US$) of all finished goods and services made within a country for a given year.

Link to data.


Data description: Composite index measuring average achievement in three basic dimensions of human development-a long and healthy life, knowledge and a decent standard of living.

Link to data.

Life expectancy

Data description: Mean length of life of an actual birth cohort (all individuals born in a given year).

Link to data.

Years of schooling

Data description: Number of years of schooling that a child of school entrance age can expect to receive if prevailing patterns of age-specific enrollment rates persist throughout the child’s life.

Link to data.

Energy consumption

Data description: Energy consumption by source, country and year.

Link to data.

Country info

Data description: ISO code, region and level of income for all worldbank-recognized regions The .csv dataframe is in the same zip as the one containing GDP data (first source).

Link to data.