TEDS Data Dictionary

10 Year Study Data Files

Contents of this page:

Introduction

This page relates specifically to data collected in the 10 Year study. More general issues relating to the storage and organisation of TEDS data files are discussed on another page.

Raw Data files

These files and sub-directories are currently stored in the \System\Rawdata\10yr\ directory, and the list below refers to files and sub-directories within this directory.

The Access database has been periodically updated to keep it in the most recent Access format, to avoid software compatibility issues. Nevertheless, the essential data from the tables are also stored in exported "copies" in the form of the csv files in the Export folder.

  • 10yr.accdb.
    This is the Access database file (2007 format) containing aggregated and cleaned 10 Year raw data from the teacher questionnaires and from administrative sources. (Note that it does not contain the web test data; the files of web test data are generally very large, and contain far too many columns to be accommodated in Access tables.) This Access database is now treated as the master copy of the teacher data, and is the source of such data for the analysis dataset. The important tables in the database are:
    • Teacher - data from the teacher questionnaires
    • yr10Progress - admin data relating to the 10 Year study, such as return dates. This table was originally in the TEDS admin database.
  • \Export\ subdirectory, containing exported 10 Year raw data files. These files are used to construct the analysis dataset. These files are exported from the Access database described above. They are csv files. The files are called:
    • teacher.csv (teacher questionnaire data)
    • 10yrAdmin.csv (admin data from table yr10Progress)
  • \web data files\ subdirectory, containing aggregated web test data files. There is one file for each of the twin web activities, plus a family file containing parent data and twin test status data. Each such file contains data aggregated from the two cohort data collections, with identifying fields (like names) removed. These files were aggregated, with some cleaning, from the raw analysis files that were originally downloaded from the web server. The web data files are as follows.
    • 10yr_piat.csv - PIAT test data
    • 10yr_maths.csv - Mathematics test data
    • 10yr_mrq_ar.csv - Maths and Reading Questionnaire plus Author Recognition data
    • 10yr_raven.csv - Ravens test data
    • 10yr_vocab.csv - Vocabulary test data
    • 10yr_piccomp.csv - Picture Completion test data
    • 10yr_genkn.csv - General Knowledge test data
    • 10yr_famstatus_and_parentqnr.csv - Parent web questionnaire data plus status variables summarising each twin's progress in the various tests.

Dataset files

These files are currently stored in the \System\Datasets\10yr\files\ directory. The following list refers to items within this directory:

  • Jdoub945.sav - the full SPSS version of the 10 Year dataset
  • \working files\ - this subdirectory contains various intermediate files, saved during the process of converting the raw data into the dataset. These files include working datasets j1merge, j2recode, j3derive, j4label, j5double (all .sav files), saved at the end of the 5 scripts. The latter file is identical (except for the name) to the final dataset Jdoub945.sav.

Syntax files (scripts)

These files are currently stored in the \System\Scripts\10yr\ directory.
Note that these are SPSS syntax files. The names of the scripts are J1_merge, J2_recode, J3_derive, J4_label, J5_double (all .sps files). The processing carried out by these scripts is described on another page.