6 great websites with free data sets

18 June 2020 By Gyana

Data is everywhere. It’s not surprising, considering the amount generated each and every day. 

In 2020, it’s estimated that the digital sphere consists of 44 zettabytes of data, so there’s certainly no shortage of free and interesting data.

There are plenty of repositories curating data sets to suit all your needs, and many of these sites also filter out the not-so-great ones, meaning you don’t have to waste time downloading useless CSV files. 

If you want to learn how to analyze data, improve your data literacy skills, or learn how to create data visualizations, readily available data sets are a great palace to start.

In this blog post, we’ll take a look at some of our favourite places to find free data sets, so you can spend less time searching and more time uncovering insights.


1. FiveThirtyEight 

Link: https://data.fivethirtyeight.com/

FiveThirtyEight is an independent collection of data on US politics, US sport and other general interest datasets. It specialises in the collation and ranking of reliable political and opinion polls. We’ve used them in a number of projects, finding out some interesting things along the way, like when Donald Trump is most active on Twitter (Sign up to VAYU for free to view the template).

2. Google Trends

Link: https://trends.google.com/trends/

Google provides readily accessible data sets on search trends, and you can customize the parameters to easily find whatever it is you’re interested in. We recommend exporting the dataset and running it through VAYU for one-click visualizations and advanced analysis.

3. ProPublica Data Store

Link: https://www.propublica.org/datastore/ 

ProPublica, probably best known for their award-winning investigative journalism, collects data pertaining to the US economy, finance, health, industry, politics and more. They have both free and premium datasets, should you need to delve deeper into whatever it is you’re exploring.

4. Centers for Disease Control and Prevention

Link: https://www.cdc.gov/datastatistics/index.html

The CDC collects the abundance of health data provided by US government research and sources, including data and research on alcohol, life expectancy, obesity and chronic diseases. This is a great resource for analyzing and understanding public health.

5. Office for National Statistics

Link: https://www.ons.gov.uk/

The ONS is a centralized repository holding data related to the United Kingdom. They have datasets on crime, the economy, public health and policy data, and it’s all readily available.

6. Johns Hopkins COVID-19 data

Link: https://github.com/CSSEGISandData/COVID-19

Johns Hopkins, a renowned private research university, provides extensive and reliable data on all facets of COVID-19 statistics and research. It is publicly funded and 100% independent, bias-free and, most importantly, pinpoint accurate. 


If you need help analyzing your findings or turning them into visualizations, get in touch. We’re on hand to demo VAYU.

Follow our blog for more tutorials and free resources.

Gyana

The Gyana team post business insight, product demos, team interviews and resources.