Skip to content

blakeisaac1993/energy-data

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data on Energy by Our World in Data

Our complete Energy dataset is a collection of key metrics maintained by Our World in Data. It is updated regularly and includes data on energy consumption (primary energy, per capita, and growth rates), energy mix, electricity mix and other relevant metrics.

The complete Our World in Data Energy dataset

🗂️ Download our complete Energy dataset : CSV | XLSX | JSON

The CSV and XLSX files follow a format of 1 row per location and year. The JSON version is split by country, with an array of yearly records.

The variables represent all of our main data related to energy consumption, energy mix, electricity mix as well as other variables of potential interest.

We will continue to publish updated data on energy as it becomes available. Most metrics are published on an annual basis.

A full codebook is made available, with a description and source for each variable in the dataset. This codebook is also included as an additional sheet in the XLSX file.

Our source data and code

The dataset is built upon a number of datasets and processing steps:

Additionally, to construct region aggregates and variables per capita and per GDP, we use the following datasets and processing steps:

Changelog

  • On January 24, 2024:
    • Improved codebook, to clarify whether indicators refer to electricity generation or primary energy consumption.
    • Improved the calculation of the share of electricity in primary energy. Previously, electricity generation was calculated as a share of input-equivalent primary energy consumption. Now it is calculated as a share of direct primary energy consumption.
  • On December 12, 2023:
    • Updated Ember's Yearly electricity data and EIA's International energy data.
    • Enhanced codebook (improved descriptions, added units, updated sources).
    • Fixed various minor issues.
  • On July 7, 2023:
    • Replaced BP's data by the new Energy Institute Statistical Review of World Energy 2023.
    • Updated Ember's yearly electricity data.
    • Updated all datasets accordingly.
  • On June 1, 2023:
    • Updated Ember's yearly electricity data.
    • Renamed countries 'East Timor' and 'Faroe Islands', and added 'Middle East (Ember)'.
    • Population and per capita variables are now calculated using an updated version of our population dataset.
  • On March 1, 2023:
    • Updated Ember's yearly electricity data and fixed some minor issues.
  • On December 30, 2022:
    • Fixed some minor issues with BP's dataset. Regions like "Other North America (BP)" have been removed from the data, since, in the original Statistical Review of World Energy, these regions represented different sets of countries for different variables.
  • On December 16, 2022:
    • The column electricity_share_energy (electricity as a share of primary energy) was added to the dataset.
    • Fixed some minor inconsistencies in electricity data between Ember and BP, by prioritizing data from Ember.
    • Updated Ember's yearly electricity data.
  • On August 9, 2022:
    • All inconsistencies due to different definitions of regions among different datasets (especially Europe) have been fixed.
      • Now all regions follow Our World in Data's definitions.
      • We also include data for regions as defined in the original datasets; for example, Europe (BP) corresponds to Europe as defined by BP.
    • All data processing now occurs outside this repository; the code has been migrated to be part of the etl repository.
    • Variable fossil_cons_per_capita has been renamed fossil_elec_per_capita for consistency, since it corresponds to electricity generation.
    • The codebook has been updated following these changes.
  • On April 8, 2022:
    • Electricity data from Ember was updated (using the Global Electricity Review 2022).
    • Data on greenhouse-gas emissions in electricity generation was added (greenhouse_gas_emissions).
    • Data on emissions intensity is now provided for most countries in the world.
  • On March 25, 2022:
    • Data on net electricity imports and electricity demand was added.
    • BP data was updated (using the Statistical Review of the World Energy 2021).
    • Maddison data on GDP was updated (using the Maddison Project Database 2020).
    • EIA data on primary energy consumption was included in the dataset.
    • Some issues in the dataset were corrected (for example some missing data in production by fossil fuels).
  • On February 14, 2022:
    • Some issues were corrected in the electricity data, and the energy dataset was updated accordingly.
    • The json and xlsx dataset files were removed from GitHub in favor of an external storage service, to keep this repository at a reasonable size.
    • The carbon_intensity_elec column was added back into the energy dataset.
  • On February 3, 2022, we updated the Ember global electricity data, combined with the European Electricity Review from Ember.
    • The carbon_intensity_elec column was removed from the energy dataset (since no updated data was available).
    • Columns for electricity from other renewable sources excluding bioenergy were added (namely other_renewables_elec_per_capita_exc_biofuel, and other_renewables_share_elec_exc_biofuel).
    • Certain countries and regions have been removed from the dataset, because we identified significant inconsistencies in the original data.
  • On March 31, 2021, we updated 2020 electricity mix data.
  • On September 9, 2020, the first version of this dataset was made available.

Data alterations

  • We standardize names of countries and regions. Since the names of countries and regions are different in different data sources, we harmonize all names to the Our World in Data standard entity names.
  • We create aggregate data for regions (e.g. Africa, Europe, etc.). Since regions are defined differently by our sources, we create our own aggregates following Our World in Data region definitions.
    • We also include data for regions as defined in the original datasets; for example, Europe (EI) corresponds to Europe as defined by the Energy Institute.
  • We recalculate primary energy in terawatt-hours. The primary data sources on energy—the Energy Institute Statistical review of world energy, for example—typically report consumption in terms of exajoules. We have recalculated these figures as terawatt-hours using a conversion factor of 277.8.
  • We calculate per capita figures. All of our per capita figures are calculated from our population metric, which is included in the complete dataset.
    • We also calculate energy consumption per gdp, and include the corresponding gdp metric used in the calculation as part of the dataset.
  • We remove inconsistent data. Certain data points have been removed because their original data presented anomalies. They may be included again in further data releases if the anomalies are amended.

License

All visualizations, data, and code produced by Our World in Data are completely open access under the Creative Commons BY license. You have the permission to use, distribute, and reproduce these in any medium, provided the source and authors are credited.

The data produced by third parties and made available by Our World in Data is subject to the license terms from the original third-party authors. We will always indicate the original source of the data in our database, and you should always check the license of any such third-party data before use.

Authors

This data has been collected, aggregated, and documented by Hannah Ritchie, Pablo Rosado, Edouard Mathieu, Max Roser.

Our World in Data makes data and research on the world’s largest problems understandable and accessible. Read more about our mission.

How to cite this data?

If you are using this dataset, please cite both Our World in Data and the underlying data source(s).

Please follow the guidelines in our FAQ on how to cite our work.

About

Data on energy by Our World in Data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%