Validation of THIN data for non-melanoma skin cancer
- Meal, Andy BMedSci (Hons) BM BS MPhil FHEA, Lecturer, School of Nursing
- Leonardi-Bee, Jo BSC (Hons) MSc PhD, Lecturer in Medical Statistics, Division of Epidemiology and Public Health
- Smith, Chris BA PGCE, Senior Research Fellow, Division of Epidemiology and Public Health
- Hubbard, Richard MSc DM FRCP, British Lung Foundation Professor of Respiratory Epidemiology, Division of Epidemiology and Public Health
- Bath-Hextall, Fiona BSc (Hons) PhD, Associate Professor, Centre for Evidence-Based Dermatology
Background
The Health Improvement Network (THIN) database began in 2003. It consists of anonymised records from over 300 general practice computer systems and is likely to be valuable for research, planning and strategic issues in health care, but it is important to establish completeness and accuracy of the data.
Aim
To investigate the validity of THIN data for non-melanoma skin cancer (NMSC). We defined NMSC as basal cell carcinoma (BCC) and squamous cell carcinoma (SCC).
Methods
Using Read codes we extracted THIN database records of first-recorded diagnoses of NMSC from 1 January 1996 to 31 December 2003. Searches for SCC were unable to distinguish between skin tumours of this type, and SCC at any other site. From our dataset for BCC, 40 patient records were selected at random, and a questionnaire sent to their corresponding practice, asking if they had been referred to hospital/dermatology clinic, and how the diagnosis of BCC had been confirmed.
Results
All the patients in the sample were referred to a hospital or dermatology clinic: 37/40 (93%) had the diagnosis of BCC confirmed, either by a letter from the hospital or a pathology report, a finding that we have reported previously. One patient’s diagnosis was confirmed as SCC, and the other two either died or moved away before diagnosis could be confirmed. The 38 patients with diagnoses confirmed were all treated in hospital or dermatology clinic.
Conclusions
Data for BCC are sufficiently accurate for research. It is also likely that these data will prove valuable for quality management. It is not possible currently to obtain accurate data for SCC of the skin from the THIN database. This seems not to be a problem with the THIN database itself, but attributable to the Read coding scheme being, in practice, unable to allow differentiation between SCCs of different organs.