TEDS Data Dictionary

16 Year Study Data Files

Contents of this page:

Introduction

This page relates specifically to data collected in the 16 Year study. More general issues relating to the storage and organisation of TEDS data files are discussed on another page.

Raw Data files

These are currently stored in the \System\Rawdata\16yr\ folder, and the list below refers to files and sub-directories within this folder.

The Access database has been periodically updated to keep it in the most recent Access format, to avoid software compatibility issues. Nevertheless, the essential data from the tables are also stored as exported "copies" in the form of the csv files in the Export folder.

  • 16yr.accdb.
    This is the Access database file (2007 format) containing aggregated and cleaned 16 Year raw data from the booklets/questionnaires and from administrative sources. (Note that it does not contain the web study data; the files of web activity data are generally very large, and contain far too many columns to be accommodated in Access tables.)
    This Access database is now the master copy of the data extracted from all paper questionnaires, and is the source of such data for the analysis dataset. The important tables in the database are:
    • Parent1, Parent2 - data from the parent Behaviour/LEAP Study booklet
    • Child1, Child2, Child3 - data from the child Behaviour/LEAP Study booklet (the Child3 table contains item data from new measures that were not in the cohort 1 Behaviour study, but were added for the cohort 2/3/4 LEAP study)
    • Leap2Parent1, Leap2Parent2 - data from the parent LEAP-2 Study booklet
    • Leap2Child1, Leap2Child2 - data from the child LEAP-2 Study booklet
    • GCSEresults, OtherResults - data from the paper GCSE form. The former table contains results for short-course GCSEs and full GCSEs, the latter table contains results from other types of qualification.
    • yr16Progress - admin data relating to the 16 Year study, such as return dates.
    Other tables in the database do not have any direct bearing on the analysis dataset, so they are not listed here. They include lookup tables for GCSE and other qualifications.
  • \Export\ subdirectory, containing exported 16 Year raw data files. These files are directly used to construct the analysis dataset. The files are exported from the Access database described above; they are csv files. Each exported csv files originates from a table in the Access database above. The files are called:
    • Parent1.csv, Parent2.csv (parent Behaviour/LEAP Study booklet data)
    • Child1.csv, Child2.csv, Child3.csv (twin Behaviour/LEAP Study booklet data)
    • Leap2Parent1.csv, Leap2Parent2.csv (parent LEAP-2 Study booklet data)
    • Leap2Child1.csv, Leap2Child2.csv (twin LEAP-2 Study booklet data)
    • GCSEresults.csv, OtherExamResults.csv (twin GCSE and other examination results)
    • 16yrAdmin.csv (admin data from table yr16Progress, including return dates)
  • \web data files\ subdirectory, containing aggregated web test data files. There is one file for each of the twin web activities, although the 4 questionnaire activities have been merged into a single file. There is also a file containing web family data, including the brief parent questionnaire data, dates, and status flags for each twin activity. Each file contains data aggregated from the two waves of data collection, with identifying fields (like names) removed. These files were aggregated, with some cleaning, from the raw analysis files that were originally downloaded from the web server. The web data files are as follows.
    • 16yr_families_status.csv - web study progress data for each family, plus the parent web SES questionnaire data
    • 16yr_corsi_block.csv - Corsi Block test data
    • 16yr_dot_number.csv - Dot Number test data
    • 16yr_number_line.csv - Number Line test data
    • 16yr_number_sense.csv - Number Sense (Dot Task) test data, including derived Weber fractions scores
    • 16yr_PVT.csv - Problem Verification Task test data
    • 16yr_reaction_times.csv - Reaction Times test data
    • 16yr_understanding_number.csv - Understanding Number test data
    • 16yr_raven.csv - Ravens Matrices test data
    • 16yr_mill_hill.csv - Mill Hill Vocabulary test data
    • 16yr_expressions.csv - Expressions (Figurative Language) test data
    • 16yr_yes_no.csv - Yes/No (Reading Fluency) test data
    • 16yr_passages.csv - Passages (RC-16) test data
    • 16yr_qnrABCD.csv - combined data from the four twin "Environment and Wellbeing" web questionnaires: Part A, About You; Part B, Your School; Part C, You and Your Friends; Part D, More About You

Dataset files

These files are currently stored in the \System\Datasets\16yr\ folder. The following list refers to items within this folder.

  • Pdb9456.sav - the SPSS version of the full 16 Year dataset
  • \working files\ - this subdirectory contains various intermediate files, saved during the process of converting the raw data into the dataset. These files include working datasets p1merge, p2clean, p3derive, p4label1, p5label2, p6double (all .sav files), saved at the end of the 6 scripts. The latter file is identical (except for the name) to the final dataset Pdb9456.sav.

Syntax files (scripts)

These files are currently stored in the \System\Scripts\16yr\ folder.
Note that these are SPSS syntax files. The names of the scripts are P1_merge, P2_clean, P3_derive, P4_label_part1, P5_label_part2, P6_double (all .sps files). The processing carried out by these scripts is described on another page.