TEDS Data Dictionary

14 Year Study Data Files

Contents of this page:

Introduction

This page relates specifically to data collected in the 14 Year study. More general issues relating to the storage and organisation of TEDS data files are discussed on another page.

Raw Data files

These are currently stored in the \System\Rawdata\14yr\ folder, and the list below refers to files and sub-directories within this folder.

The Access database has been periodically updated to keep it in the most recent Access format, to avoid software compatibility issues. Nevertheless, the essential data from the tables are also stored in exported "copies" in the form of the csv files in the Export folder.

  • 14yr.accdb.
    This is the Access database file (2007 format) containing aggregated and cleaned 14 Year raw data from the booklets/questionnaires and from administrative sources. (Note that it does not contain the web test data; the files of web test data are generally very large, and contain far too many columns to be accommodated in Access tables.)
    This Access database is now treated as the master copy of the data (except for the web tests), and is the source of such data for the analysis dataset. The important tables in the database are:
    • Teacher - data from the teacher questionnaires
    • Parent - data from the parent booklet
    • Child - data from the child booklet
    • SLQ - data from the parent SLQ questionnaire
    • yr14Progress - admin data relating to the 14 Year study, such as return dates. This table was originally in the TEDS admin database.
  • \Export\ subdirectory, containing 14 Year raw data files that have been exported from the Access database. These files are directly used to construct the analysis dataset. They are csv text files. The files are called:
    • Parent.csv (parent booklet data)
    • SLQ.csv (parent SLQ questionnaire data)
    • TwinQuestionnaire.csv (twin booklet data)
    • Teacher.csv (teacher questionnaire data)
    • Admin.csv (general admin data from table yr14Progress, including booklet return dates)
  • \web data files\ subdirectory, containing aggregated web test data files. There is one file for each of the twin web activities (Science, Vocabulary, Ravens Matrices) plus a file containing web family status data (such as parent consent, and start and end dates and test status for each twin). Each such file contains data aggregated from the two waves of data collection, with identifying fields (like names) removed. These files were aggregated, with some cleaning, from the raw analysis files that were originally downloaded from the web server. The web data files are as follows.
    • 14yr_science.csv - Science test data
    • 14yr_raven.csv - Ravens test data
    • 14yr_vocab.csv - Vocabulary test data
    • 14yr_families_status.csv - Family status data

Dataset files

These files are currently stored in the \System\Datasets\14yr\ folder. The following list refers to items within this folder.

  • Ndb9456.sav - the PSS version of the 14 Year dataset
  • \working files\ - this subdirectory contains various intermediate files, saved during the process of converting the raw data into the dataset. These files include working datasets n1merge, n2recode, n3derive, n4label, n5double (all .sav files), saved at the end of the 5 scripts. The latter file is identical (except for the name) to the final dataset file named above.

Syntax files (scripts)

These files are currently stored in the \System\Scripts\14yr\ folder.
Note that these are SPSS syntax files. The names of the scripts are N1_merge, N2_recode, N3_derive, N4_label, N5_double (all .sps files). The processing carried out by these scripts is described on another page.