A place to find cool datasets.
Follow us on Twitter for updates! @cooldatasets
This file contains salaries for the City Of Chicago
The Toxics Release Inventory (TRI) makes available information for more than 600 toxic chemicals
2016 National popular vote tracker compiled by David Wasserman
by Volume and Rate per 100,000 Inhabitants, 1994–2013. Includes Violent Crimes, Murders, Rapes, Bu
This data set includes provider data for the hip/knee complication measure, and the Agency for Healthcare Research and Quality (AHRQ) measures of serious complications.
This data set includes provider data for the payment measures and value of care displays associated with a 30-day episode of care for heart attack, heart failure, and pneumonia patients.
City Official's salaries for the City of Phoenix, Arizona.
United States Department of Commerce dataset of total value of construction currently put in place.
11,000+ proposed amendments to the United States Constitution from 1787-2014
Crime in Louisville, Kentucky from 2003 to 2016
Dataset of police cruiser district locations in Columbus Ohio
List of informal consumer complaint calls regarding unwanted robocalls and telemarketing calls.
Information on the salaries of staff at the White House
This dataset contains a number of climate change mitigation policies and measures (PAM) implemented or planned by European countries to reduce greenhouse gas emissions.
Officer Involved Shootings in Austin Texas from 2000-2014
Adjusted gross income and taxes owed by Hillary are included for each year from 2000-2015.
2000 tweets immediately following the first Presidential Debate in September 2016
A dataset containing the Open Data Portals of 100 of America's largest cities
800+ White House nominations and appointments
The Ames Electric Arc Shock Tube (EAST) Facility is the only shock tube...
NASA Financial Budget Documents, Strategic Plans and Performance Reports for fiscal year 1997.
A comprehensive list of research resources, including catalogues, metadata, archives and records, which can be accessed both online
60 East Coast data sets from 1906 to 2009, with over 260,000 records of seabird observations.
90,000+ entries, the Complete PLANTS Checklist is nearly 7 MB and includes Symbol, Synonym Symbol, Scientific Name with Authors, National Common Name, and Family.
45,000 recorded NASA meteorite landings.
Orbital elements of near earth Comets
50k+ digitally-constructed and downloadable neurons
Chronological data summary of fireball and bolide events provided by U.S. Government sensors.
Activities performed by an astronaut or cosmonaut outside a spacecraft beyond the Earth's appreciable atmosphere dating back to 1965.
Detailed tracking and info for tropical storms and hurricanes in the North Atlantic since 1851.
Global Carbon emission from 1751 to 2013 by Carbon Dioxide Information Analysis Center, Oak Ridge National Laboratory, U.S. Department of Energy
Monthly numbers of sunspots, as from the World Data Center, aka SIDC from 1749-2013
Ten rats are randomized to each of the four treatments. The question of interest is how diet affects weight gain. source source of protein given, a factor with levels Beef and Cereal. type amount of protein given, a factor with levels High and Low. weightgain weigt gain in grams.
Nitrogen pollution from contributing sources in Bay watershed, pounds per year.
Between May 1934 and July 1935, the National Bureau of Standards in Washington D.C. conducted a series of experiments to estimate the acceleration due to gravity, g, at Washington.
Also known as the GW150914 event, this observation from LIGO proved Einstein's prediction of general relativity
Movies with 40 or more critic reviews vie for their place in history at Rotten Tomatoes. Eligible movies are ranked based on their Adjusted Scores.
50 Most Streamed Spotify Songs
Weekly updated football datasets.
Master list of 2,600+ Ted Talks and descriptions
Dataset of Rolling Stone's 500 greatest albums of all time
Images and videos of various types of agents (not just pedestrians, but also bicyclists, skateboarders, cars, buses, and golf carts) that navigate in a real world outdoor environment
This data set consists of 20000 messages taken from 20 Usenet newsgroups.
A sampling of Twitter posts that have been judged based on whether they are offensive or contain hate speech, as a training set for text analysis.
The aim of this data is to predict the burned area of forest fires, in the northeast region of Portugal, by using meteorological and other data.
Curated datasets from Computer Vision Online
The largest human created question answer dataset for natural language processing
A reading comprehension dataset for the AI research
2000+ positive words used for sentiment analysis
8Million video URLs, 500K hours of video
7 hours of self-driving training data from Comma.ai
Anonymized data from over 2 billion Uber trips.
200,000 standard reimbursement rates for travel among various U.S. destinations
Francis Galton introduced the correlation coefficient with an analysis of the similarities of the parent and child generation of 700 sweet peas.
Sample dataset of 350+ diamonds, their color, size, clarity, and price
Categorized database of 800,000+ fasion images
Wells Fargo branch deposits by US states and counties
Passenger information from the Titanic
United States patent information dating from 1790-2015
Name, City, Country, and Lat/Lon of 5000+ Airports Around the World.
Historical dataset for nominal and inflation adjusted oil prices since 1918
This dataset provides restaurant inspections, violations, grades and adjudication information
A dataset containing all NYC High Schools average SAT scores in reading, writing and math
San Francisco International Airport Report on Monthly Passenger Traffic Statistics by Airline.
Fuel economy data are the result of vehicle testing done at the Environmental Protection Agency's National Vehicle and Fuel Emissions Laboratory in Ann Arbor, Michigan, and by vehicle manufacturers with oversight by EPA.
A crowd sourced database of how well beer styles (Stout, Pale Ale, etc) and additions (chocolate, bacon, cherry) go with each other.
Monthly residential water usage use by zip code. Numbers represent Hundered Cubic Feet (HCF) usage. Records from 2005-2013
Birth Rates, by Age of Mother in the United States from 1940
Top 25 boy names, each year from 1980-2013 including frequency.
Most common food recalls by brand since 2009.
Population of homeless in New York City Neighborhoods by year
This dataset covers euro-denominated deposits with an agreed maturity from euro area households (percentages per annum, rates on new business).
3,000+ Barbershop locations in Texas.
Miami-Dade Corrections jail bookings from May 29, 2015 to current.
Valet Parking by District, Facility, and Locations in Philadelphia
Listing of 470,000+ business names and locations in Los Angeles
List of San FranciscoDepartment of Public Works (dpw) maintained street trees including: Planting date, species, and location
Dataset of all public libraries in the United States
historical and projected probabilities of death by single year of age, gender, and year for the period 1900 through 2010. Death Probabilities for Male.
U.S. states ranked by cases of Chlamydia, gonorrhea, and primary and secondary syphilis reported.
The world's telephones by continent in the years 1951, 1956, 1957, 1958, 1959, 1960, 1961
These data list total primary energy consumption by country and region in Quadrillion Btu. Figures are annual totals for the years 1980 through 2008
1900+ New York City subway entrance locations
Dataset by trip, dates, ports, ships, and passengers.
Data on sectorial holdings of sovereign bonds for 12 countries
Not necessarily a dataset but still cool
Monthly datasets of all campaigns from Kickstarter.com