The National Archives - link to home page    

Thursday 20 November

 

Main website navigation:

   
 
 NDAD: The National Digital Archive of Datasets
Welcome (home page) About NDAD Users Contributors  
Search Browse News Help (new window)  
 
 

Sub-series details: CRDA/24/DS/1994

1994

 
 
Quick reference Full details
 
  View in hierarchy
 

Jump to :

  Context   |   Identity statement   |   Administrative context   |   Source of acquisition   |   Nature and content   |   Conditions of access and use   |   Allied materials   |   Original system attributes   |   Structure   |   Validation   |   Links to dataset catalogues  

Context

Public Health Common Dataset
Top of pagetop of page

Identity statement

Title 1994
NDAD referenceCRDA/24/DS/1994
Dates of creation of datasets1994
Dates of contents of datasets1993
Date of last input to datasets [1994]
Date of last access to datasets[1994]
Extent of datasets6 datasets
ISAD(G) level of description Subseries
Top of pagetop of page

Administrative context

Aim and purpose
Statement of responsibility
Custodial history
Top of pagetop of page

Source of acquisition

Source of acquisition

This dataset was transferred from the Department of Health on 2 CD-ROMs which were received by NDAD on 1 October 1999.

Top of pagetop of page

Nature and content

Scope and content

This dataset includes public health data covering the year 1993. Further information about the Public Health Common data set is provided in the Series Catalogue and Dataset Documentation Catalogues. The sub-series consists of 6 datasets comprising a total of 360 tables, which contain various indicators for health regions in England and Wales. Data is arranged by administrative areas, namely District Health Authorities (DHAs), Family Health Service Authorities (FHSAs) and National Health Service (NHS) regions.

The 1994 Data Set contains several new indicators, detailed descriptions of the indicators can be seen in the data definitions and user guide in the Dataset Documentation Catalogue reference CRDA/24/DD/1/5/1. However just one user guide has been supplied for CRDA/24/DS/1994 and this does not cover the Population Health Outcome Indicators, dataset reference: CRDA/24/DS/1994/6. Researchers are advised to consult the 1995 User Guides for assistance, see CRDA/24/DD/1/6/1-2.

Digital processing and conversion

The tables in CRDA/24/DS/1994 were transferred to NDAD in the form of Symphony (WR1) spreadsheets, with the file names having meaningful prefixes to reflect the type of data. See CRDA/24/DD/1/5/1 and CRDA/24/DD/1/6/1-2 for a more detailed explanation of the prefixes. These files were opened in Microsoft Excel 97 (under Windows 95/98). Visual Basic Macros were written to process them so that the headings, subheadings, other metadata, and blank columns or rows between the data were removed and the format of the cells set to Microsoft Excel General format before saving the data in CSV (Comma separated variable) format. In a number of the original files (within CRDA/24/DS/1994/2 and CRDA/24/DS/1994/4), the data for each area is split over two lines. A Visual Basic module was written to move the figures so that there is one line per area.

Some additional processing was carried out on cda5 (CRDA/24/DS/1994/3/5) which lists the English DHAs classified hierarchically into 'families' and 'groups'; in order to retain the classification across the records, the entries in the first two fields were copied into the appropriate, blank cells.

Some cells in the original spreadsheets contain real numbers but they are formatted to display as integers and others which have many figures after the decimal point are displayed with just 2 decimal places. In order to preserve the actual data, this formatting has been removed. However, it must be assumed that original users of the spreadsheets would have seen the data as it had been originally formatted. Although some files contain fields that have many decimal places, users are advised that NDAD recommends that all data in the PHCDS is not quoted at more than two decimal places. This is because fields are not formatted with more than two decimal places in the original spreadsheet files. NDAD assumes that this is because the data creators considered the data to be accurate to, at most, two decimal places. The field formats set by NDAD (DOUBLE or INTEGER) have been specified according to the data the cells contain, rather than how they are displayed within the original spreadsheet.

The PHCDS in its original format does not use specific field names as such: generally, spreadsheet packages do not require data to be held within named fields. (The indicators within PHCDS do have original names which have been preserved but these equate to the table name within NDAD). To identify fields within a table, the first field is named either D_CODE (if the data relates to DHAs) or F_CODE (if it relates to FHSAs) and the second AREA (for the name of the area). The rest of the fields have been named sequentially, starting with the third field as F3, the fourth as F4 etc. These are not from the original data files but have been allocated by NDAD during the data conversion process. The column headings in the spreadsheet, supplemented at times by information from the 'Data definition and user guide', form the basis of the field descriptions. Where a spreadsheet contains one or more footnotes, the text was automatically extracted for inclusion in the relevant Table catalogue and is provided at the end of that catalogue under the heading 'Other information'.

Accruals
Top of pagetop of page

Conditions of access and use

Legal status
Access conditions

No access conditions apply

Copyright requirements
Data Protection Act requirements
Language

The language of the materials is .

Top of pagetop of page

Allied materials

Related units of description
Associated material
Publications produced by the originating department
Publications produced by researchers working on the datasets
Top of pagetop of page

Original system attributes

Hardware
Operating system
Application software
User interface
Top of pagetop of page

Structure

Logical structure and schema

The data has been divided into six datasets by topic. For access to these datasets, see Links to dataset catalogues.

The data is available for District Health Authorities (DHAs), Family Health Service Authorities (FHSAs), NHS Regions and nationally for England and Wales. However this does not apply to all tables; for instance the data in tables CRDA/24/DS/1994/1/6 (hna6) - CRDA/24/DS/1994/1/9 (hna9) and CRDA/24/DS/1994/6/25 (phob4) is only provided at the regional level. Also CRDA/24/DS/1994/3/5 (cda5) does not provide any numerical data, but, as mentioned above in the section relating to Digital processing and conversion., lists the English DHAs grouped into similar area types according to a range of socio-economic and demographic census variables. These groupings of DHAs are used in a number of tables in the dataset covering the Population Health Outcome Indicators (CRDA/24/DS/1994/6), for instance table phoa3-1 (CRDA/24/DS/1994/6/9); the 1995 documentation covering the PHOI states "In this release the indicators have, where possible, been calculated for the eleven 'groups' of DHAs (boundaries as at April 1994) to give mean values for each area type. Thus, where possible, for every indicator the data are presented for RHAs, the OPCS area classification groups, and DHAs".

In CRDA/24/DS/1994/1, the data in the tables covering indicators A8 and A9 data are only available to regional level and the regional data relates to the Standard Regions used by OPCS for statistical purposes. These Standard Regions (based on County boundaries which are not in general co-terminous with Health Authority areas; see the Documentation Catalogue, reference CRDA/24/DD/1/5/1) have not been encoded at all in the tables and therefore tables CRDA/24/DS/1994/1/8 and CRDA/24/DS/1994/1/9 have not been included in the relationships set up by NDAD between the tables (to allow users to link data in similar tables within a dataset via the code for the area). In a number of the tables, (eg cdc1 CRDA/24/DS/1994/3/18) there some rows which provide data for two RHAs, for instance, Northern and Yorkshire RHAs are combined under code 'A00+B00'; clearly this affects linking of data with other tables which have separate rows of data for these two RHAs (ie separately under codes 'A00' and 'B00).

Dynamic or closed
How data was originally captured and validated

Details of the sources which were used to produce the Public Health Common data files and how the data was checked by OPCS are given in the Series Catalogue . This year saw major changes in OPCS systems of collecting, processing and tabulating births and deaths data. Changes affecting the mortality data also occurred and included the introduction of an automatic cause of death coding system, which entailed a number of changes to the rules and procedures for classifying deaths as from 1993. There was also a change in the cut-off date (from 31 January to 11 February) for births registered in the year after their occurrence. For further information about this and the presentation of mortality data please see the Dataset Documentation Catalogue reference CRDA/24/DD/1/5/1.

Constraints on the reliability of the data
Top of pagetop of page

Validation

Content validation

No discrepancies were noted in the original spreadsheet files as compared to the expected contents as described in the User Guide for the 1994 PHCDS, see the Dataset Documentation Catalogue, references CRDA/24/DD/1/5/1 . However, during processing/checking of the data, the following anomalies were noticed:

In table CRDA/24/DS/1994/6/7 (phoa2-3), there are a number of 'invalid' formulae in the seventh column (for instance cell G26=E26/F26*1000 but there is no data in columns E and F and therefore this formula is displayed as #DIV/0!).

In CRDA/24/DS/1994/1, the data in the tables covering indicators A8 and A9 data are only available to regional level and the regional data relates to the Standard Regions used by OPCS for statistical purposes. These Standard Regions (based on County boundaries which are not in general co-terminous with Health Authority areas; see the Documentation Catalogue, reference CRDA/24/DD/1/5/1) have not been encoded at all in the tables and therefore tables CRDA/24/DS/1994/1/8 and CRDA/24/DS/1994/1/9 have not been included in the relationships set up by NDAD between the tables (to allow users to link data in similar tables within a dataset via the code for the area). In a number of the tables, (eg cdc1 CRDA/24/DS/1994/3/18) there some rows which provide data for two RHAs, for instance, Northern and Yorkshire RHAs are combined under code 'A00+B00'; clearly this affects linking of data with other tables which have separate rows of data for these two RHAs (ie separately under codes 'A00' and 'B00).

Transformation validation

Spot checks were carried out to compare the transformed data against the data in the original Symphony files. These included comparing the values of specific fields and checking that the totals of numeric fields were the same. In addition, each table was checked to ensure that the overall number of records and fields remained the same. No discrepancies were detected between the original and transformed data. The only differences found resulted from rounding and/or floating point representation, particularly for example where the original numbers had 12 figures after the decimal point. The transformed data is restricted to the level of accuracy provided by the general format in Excel (generally 8 figures after the decimal point).

Top of pagetop of page

Links to dataset catalogues

Links to dataset catalogues

Dataset catalogues provide more detailed information about individual datasets, and are currently available for the following dataset(s):

NDAD referenceTitle (link leads to Dataset Catalogue)
CRDA/24/DS/1994/11994 - Health of the Nation Indicators: Monitoring Data
CRDA/24/DS/1994/21994 - Trends in Health of the Nation Mortality Indicators
CRDA/24/DS/1994/31994 - PHCDS Indicators
CRDA/24/DS/1994/41994 - Trends in PHCDS Mortality Indicators
CRDA/24/DS/1994/51994 - 1991 Census Supplement Indicators
CRDA/24/DS/1994/61994 - Population Health Outcome Indicators
Top of pagetop of page

Last updated 2003-06-09 11:16:15

 
 

NDAD v3.0