Published on November 25th, 2019
Most of people believe that collecting big data would be a rough thing, but it’s simply not true. There are thousands of free data sets available online, ready to be analyzed and visualized by anyone. We all like free things right?
The web is practically swimming in free information. But these 30 sources of free data are widely considered to be quite reputable.
1. Google Dataset Search
Dataset Search enables users to find datasets stored across the Web through a simple keyword search. The tool surfaces information about datasets hosted in thousands of repositories across the Web, making these datasets universally accessible and useful.
UNdata is a web-based data service for the global user community. It brings international statistical databases within easy reach of users through a single-entry point.
Users can search and download a variety of statistical resources compiled by the United Nations (UN) statistical system and other international agencies.
The numerous databases or tables collectively known as “datamarts” contain over 60 million data points and cover a wide range of statistical themes including agriculture, crime, communication, development assistance, education, energy, environment, finance, gender, health, labour market, manufacturing, national accounts, population and migration, science and technology, tourism, transport and trade.
3. Amazon Public Data Sets
This registry exists to help people discover and share datasets that are available via AWS resources.
4. Google Trends
With this tool, you can search for keywords and see a variety of information about them, including:
- Interest over time.
- Interest by region.
- Related topics.
- Related queries.
5. Pew Research
Pew Research Center is a nonpartisan fact tank that informs the public about the issues, attitudes and trends shaping the world. We conduct public opinion polling, demographic research, content analysis and other data-driven social science research.
6. Google Scholar
A wide array of information, including articles, theses, books, abstracts, white papers, and court opinions.
7. EU Open Data Portal
The European Union Open Data Portal (EU ODP) provides access to an expanding range of data from the European Union (EU) institutions and other EU bodies. You can use and reuse these data for commercial or non-commercial purposes.
By providing easy access to data — free of charge — we aim to help you put them to innovative use and unlock their economic potential. The portal is also designed to make the EU institutions and other bodies more open and accountable.
8. Data.gov U.S.
Here you will find data, tools, and resources to conduct research, develop web and mobile applications, design data visualizations, and more.
9. Datasets Subreddit
A dive into anything and everything, from English grain prices of the 14th Century to U.S. homelessness rates.
Statistical analysis that tells compelling stories about elections, politics, sports, science, economics, and more.
11. Health Data
This site is dedicated to making high value health data more accessible to entrepreneurs, researchers, and policy makers in the hopes of better health outcomes for all.
12. The World Factbook
The World Factbook provides information on the history, people and society, government, economy, energy, geography, communications, transportation, military, and transnational issues for 267 world entities. The Reference tab includes: a variety of world, regional, country, ocean, and time zone maps; Flags of the World; and a Country Comparison function that ranks the country information and data in more than 75 Factbook fields.
Altmetric has tracked over 25 million mentions of 2.8 million research outputs. This page highlights the top 100 most-mentioned scholarly articles published in the past year – those which have truly captured the public imagination.
14. Qlik DataMarket
Make smarter business decisions. The Qlik DataMarket helps you find, connect and manage data from external sources. A place to check out data related to economics, healthcare, food, agriculture, and the automotive industry.
15. FBI Crime Statistics
The Uniform Crime Reporting program compiles official data on crime in the United States, published by the Federal Bureau of Investigation.
Statistical crime reports and publications detailing specific offenses and outlining trends to understand crime threats at both local and national levels.
16. National Institute on Drug Abuse
There are a variety of sources of information NIDA uses to monitor the prevalence and trends regarding drug abuse in the United States. The resources below cover a variety of drug related issues, including information on drug usage, emergency room data, prevention and treatment programs, and other research findings.
17. Drug War Facts
rug War Facts is a project of Common Sense for Drug Policy, a nonprofit 501(c)(3) organization dedicated to reforming drug policy and expanding harm reduction. CSDP disseminates factual information and comments on existing laws, policies and practices.
18. Open Corporates
OpenCorporates is the largest open database of companies and company data in the world, with in excess of 100 million companies in a similarly large number of jurisdictions. Our primary goal is to make information on companies more usable and more widely available for the public benefit, particularly to tackle the use of companies for criminal or anti-social purposes, for example corruption, money laundering and organised crime.
19. Kaggle Data Sets
Kaggle currently has close to 16,000 datasets. You can find data on anything from sports team player stats to Los Angeles parking citations.
20. Earth Data By NASA
The Earth Observing System Data and Information System contain NASA’s earth observation data which includes things like land surface temperature in NC and carbon flux.
21. Education Data By The World Bank
Comprehensive data and analysis source for key topics in education, such as literacy rates and government expenditures.
22. Pew Internet
Typically, survey data are released two years after the reports are issued. See this post for information on how to access and download our datasets. Pew Research Center staff are available to answer questions and to provide limited assistance in importing and analyzing the data.
23. Bureau Of Labor Statistics
he Bureau of Labor Statistics (BLS) of the U.S. Department of Labor is the principal federal agency responsible for measuring labor market activity, working conditions, and price changes in the economy. Its mission is to collect, analyze, and disseminate essential economic information to support public and private decision making. As an independent statistical agency, BLS serves its diverse user communities by providing products and services that are accurate, objective, relevant, timely, and accessible.
IFPI is a not-for-profit international organisation registered in Switzerland. We have offices in London, Brussels, Hong Kong and Miami plus a representative office in Beijing.
25. Academic Rights Press
Music ID is an academic research tool that utilizes powerful analytics to compile global music industry data into three, easy-to-use platforms: Music ID Data, Music ID Revenue, and Music ID Impact.
Incorporating 5,452 different charts spanning 74 countries, Music ID Data provides access to perpetually-updated, week-to-week information on iTunes downloads, Spotify and Apple Music streams, and Shazam searches, as well as contemporary and historical data from Billboard, GfK, and more.
26. Group Lens
GroupLens Research has collected and made available several datasets. Choose the one you’re interested in from the menu on the right. Before using these data sets, please review their README files for the usage licenses and other details.
27. GitHub’s BuzzFeed News
Come help us make collaboration even better. We’ve built a company we truly love working for, and we think you will too.
Developers from all around the world are building amazing things together. Their story is our story.
28. Data Catalogs
The aim of the Open Government Data Working Group is to support development of open government data catalogues around the world, and ensure different platforms are technically interoperable. Work will include:
- Technical support for setting up instances of CKAN in countries around the world
- Introductory guide to open government data catalogues
DataCatalogs.org aims to be the most comprehensive list of open data catalogs in the world. It is curated by a group of leading open data experts from around the world – including representatives from local, regional and national governments, international organisations such as the World Bank, and numerous NGOs.
29. Harvard Law School
A collection of links that cover a variety of topics, including everything from international relations and human rights data, from political institution databases.
AidData aggregates information from multiple sources to provide a searchable database of global development assistance flows and projects.
30. Department Of Labor
Few economic data are as closely watched as measures of employment and unemployment. The Bureau of Labor Statistics (BLS) provides a broad range of statistics covering jobs and joblessness.