Education Endowment Foundation:Evaluation data archive

Evaluation data archive

Find out more about the data we’ll collect, how we’ll store it and how we’ll keep it safe.

A unique feature of the EEF approach to evidence generation is that we archive the data from the impact evaluations we commission. Through this, we want to support open science by facilitating lawful access to data in the EEF Archive for secondary analysis.


Since the EEF was established in 2011, data from almost 140 EEF-funded evaluations has been deposited in the EEF archive, comprising data from more than 1.8 million children across England. This data repository will continue to grow as more evaluations are completed and their data archived.

The archive holds the data of pupils in state-funded schools in England and, occasionally, of teachers and parents. Most evaluations we commission include randomised control trials (RCTs) to assess the impact of education interventions on pupils’ attainment and a range of secondary outcome measures. The archive also contains data from some QEDs, and we intend to extend the types of data held within as EEF expands to different types of evaluation and methodologies.


The type of data archived after project completion varies between projects, but for most evaluations, this includes primary outcomes, secondary outcomes, treatment allocation and any other variables used in statistical analysis (e.g., compliance, missingness, sensitivity, mediation analysis). Primary outcome measures are most often attainment measures, which vary depending on age group and type of intervention, but could be statutory assessment, like SATs and GCSE scores or commercial, validated tests, for example GL assessments. Secondary outcome measures may include proximal outcomes, or other attainment areas of interest.

You can find more information about the content of the EEF data archive in our new EEF data catalogue. The data catalogue contains information and metadata about the datasets and fields available for secondary research analysis. By clicking the Resources” tab on each individual project page, you can also find low-fidelity synthetic datasets for most trials. Low-fidelity synthetic data can be used to facilitate understanding of what the data looks like, as well as to begin prepping analysis code. Find more about the creation of synthetic data here. We welcome feedback that could help improve the usefulness of these resources.

The EEF Archive is hosted in the Secure Research Service (SRS) at the Office of National Statistics (ONS) and is managed by the Fischer Family Trust (FFT), the EEF Archive data manager. Thus, to be able to access the data in the EEF’s archive, researchers will need to be accredited under the ONS-approved researcher scheme.


Researchers who wish to access data extracts from the EEF archive will need to have applications processed and approved by the EEF and other relevant stakeholders, such as FFT, the ONS and the Department for Education (DfE).We are working closely with the ONS, DfE and FFT to streamline the process for third parties to access data extracts from the EEF archive. Currently, we are only providing extracts of EEF archive data via projects we commission. See here for our open tenders.

We aim to provide unfunded opportunities to access archive data in the future. We will update this webpage on the process for this in due course.


  • Privacy notice for EEF archive

    This privacy notice for the EEF data archive aligns with the Information Commissioner’s Office ICO requirements. It lays out the purpose for archiving our evaluation data, describes where it comes from and its category, along with our lawful basis for processing data. It also describes people’s information rights regarding the data we hold.
  • EEF data archive data protection impact assessment (DPIA)

    Also, in line with ICO requirements, our DPIA lays out the process for archiving data from the evaluations we fund. It also assesses the data protection risks involved in the management and sharing of the EEF data archive, along with identifying measures to reduce those risks.
  • EEF data archive legitimate interests assessment (LIA)

    This document is also aligned with ICO requirements. It provides an assessment of the archive lawful basis for data processing as legitimate interest, explaining how and why we collect data from the evaluations we fund.

More information about the process and guidance for archiving evaluation data can be found here: Archiving evaluation data