Data Sets

These data sets are provided for teachers of statistics to use with their students. Information is given about the data and an indication is given of statistical techniques that may be useful when working with the data set. If anyone else wishes to use the data sets for any other purpose or to publish them elsewhere, please contact MEI to enable permission to be sought from the original provider of the data.

For guidance on the Large Data Sets in A level Mathematics see: Working with the Large Data Sets in A level Mathematics.

Natural history data sets

These data set have been made available through The Royal Statistical Society as part of its support for statistics education in schools.

(source EURING http://www.euring.org/)
Thanks are due to all European ringing schemes which have agreed to release these ringing recovery data. Thanks are also due to the very many ringers, throughout more than a century, who have contributed to this data set.

Statistical techniques: Averages, Spread, Graphical summaries, Dealing with missing data, Time series, Categorical data.
Data set (Excel)
Data set (CSV)

(source Chris du Feu)

Statistical techniques: Averages, Spread, Graphical summaries, Dealing with missing data, General linear model.
Data set (Excel)
Data set (CSV)

(source Chris du Feu)

Statistical techniques: Grouping data, Contingency tables, Dealing with missing data.
Data set (Excel)
Data set (CSV)


Links to other sources of data sets

There are many sources of data on the internet.  The following sources offer a number of data sets with some information about the data and guidance about which statistical techniques are useful for each data set.

Statistical data sets - from University of Massachusetts Amherst. Most of these data sets use techniques beyond A level but it is easy to find the ones suitable for GCSE and A level.

Environmental science data sets - Some data sets use techniques beyond A level but it is easy to find suitable data for KS4 and A level.

Radical Math - Statistics lesson suggestions using data for social justice – mainly US data.

EEPS data zoo - Mainly scientific data for modelling relationships in bivariate data

Statistics Video Presentations - Video introductions, data sets and descriptions suitable for sixth form and beyond.

Data and Story Library - Data and descriptions searchable by topic or statistical technique – the data are generally coded and need to be copied to put them in a spreadsheet.

Useful websites for working with real data

StatTutor and Statistical Reasoning Course - Downloadable data sets and related questions for students to work through with feedback on answers.

Census at School Data Tool - Samples of real data from Census at School and Experiments at School which can be explored with online graphs.

Gapminder World - Explore real data through interactive graphs – online or offline. The data can also be downloaded.

UK Census 2011 (1% sample) - A teaching data set from Census 2011, consisting of categorical variables.

UK Maritime and Shipping statistics

Humanitarian data exchange - Data sets, many as spreadsheets and largely connected to world development.

World Bank Data - Most of this is open data which can be used for commercial and non-commercial purposes.

Data from the International Space Station background information and downloadable data for use by schools; other data here https://www.raspberrypi.org/learning/astro-pi-flight-data-analysis/worksheet/