Administrative data linkage to Census 2021 in Wales, UK: A cross-sectional study examining completeness and representativeness for population analytics

Introduction Measuring population representativeness is an important methodological step in public health and epidemiological studies. Objectives To explore the representativeness of Census 2021 data linkage when compared with the Welsh Demographic Service Dataset (WDSD) within the Secure Anonymised...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of population data science Jg. 10; H. 1
Hauptverfasser: Jane Lyons, Rhodri Johnson, Michael Edwards, Samantha Turner, Richard Fry, Lucy J Griffiths, Ronan Lyons
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Swansea University 01.11.2025
Schlagworte:
ISSN:2399-4908
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Introduction Measuring population representativeness is an important methodological step in public health and epidemiological studies. Objectives To explore the representativeness of Census 2021 data linkage when compared with the Welsh Demographic Service Dataset (WDSD) within the Secure Anonymised Information Linkage (SAIL) Databank for research on the population of Wales, UK. To understand the characteristics of individuals linked and not linked and which subgroups of the population are disproportionately represented in data linkage population-wide studies. Methods An observational, population-wide cross-sectional comparison study, utilising administrative demographic data and decennial survey data held in SAIL. Two data sources, the WDSD and Census 2021, were used to create and compare two cohorts of the resident population of Wales, UK, on 21st March 2021. The two cohorts were linked to understand how many individuals from Census 2021 can be successfully linked within SAIL, in WDSD and not in Census 2021, and found across both sources. Logistic regression models analysed the variation in the linkability of the survey data within SAIL by various demographic and household characteristics. Results The central analytical cohort contained 2,440,191 individuals present in both data sources. WDSD contained 3,090,976 individuals with 2,965,196 individuals in Census data. With a positively classed outcome indicating non-linkage from WDS to Census the characteristics associated with the highest odds of individuals being registered in WDS but not linked to Census (in SAIL) are male (aOR = 1.28 [95%CI 1.28,1.32]), 75+ years of age (aOR = 1.27 [95%CI 1.25,1.29]), of Asian ethnicity (aOR = 1.27 [95%CI 1.24,1.30]), a more recent migrant (arriving to UK after 2000) (aOR = 1.30 [95%CI 1.28,1.32]), a member of the LGBTQ+ community (aOR = 1.29 [95%CI 1.25,1.29]) or not disclosing LGBTQ+ status (aOR = 1.41 [95%CI 1.39,1.43]), being separated, divorced or widowed (aOR = 1.28 [95%CI 1.27,1.29]), or living in rental accommodation (aOR = 1.47 [95%CI 1.45,1.48]). Conclusions Results show that certain personal characteristics and sub-groups of the population of Wales are disproportionately represented when combining population estimates and utilising Census data in data linkage population-wide studies in SAIL.
AbstractList Introduction Measuring population representativeness is an important methodological step in public health and epidemiological studies. Objectives To explore the representativeness of Census 2021 data linkage when compared with the Welsh Demographic Service Dataset (WDSD) within the Secure Anonymised Information Linkage (SAIL) Databank for research on the population of Wales, UK. To understand the characteristics of individuals linked and not linked and which subgroups of the population are disproportionately represented in data linkage population-wide studies. Methods An observational, population-wide cross-sectional comparison study, utilising administrative demographic data and decennial survey data held in SAIL. Two data sources, the WDSD and Census 2021, were used to create and compare two cohorts of the resident population of Wales, UK, on 21st March 2021. The two cohorts were linked to understand how many individuals from Census 2021 can be successfully linked within SAIL, in WDSD and not in Census 2021, and found across both sources. Logistic regression models analysed the variation in the linkability of the survey data within SAIL by various demographic and household characteristics. Results The central analytical cohort contained 2,440,191 individuals present in both data sources. WDSD contained 3,090,976 individuals with 2,965,196 individuals in Census data. With a positively classed outcome indicating non-linkage from WDS to Census the characteristics associated with the highest odds of individuals being registered in WDS but not linked to Census (in SAIL) are male (aOR = 1.28 [95%CI 1.28,1.32]), 75+ years of age (aOR = 1.27 [95%CI 1.25,1.29]), of Asian ethnicity (aOR = 1.27 [95%CI 1.24,1.30]), a more recent migrant (arriving to UK after 2000) (aOR = 1.30 [95%CI 1.28,1.32]), a member of the LGBTQ+ community (aOR = 1.29 [95%CI 1.25,1.29]) or not disclosing LGBTQ+ status (aOR = 1.41 [95%CI 1.39,1.43]), being separated, divorced or widowed (aOR = 1.28 [95%CI 1.27,1.29]), or living in rental accommodation (aOR = 1.47 [95%CI 1.45,1.48]). Conclusions Results show that certain personal characteristics and sub-groups of the population of Wales are disproportionately represented when combining population estimates and utilising Census data in data linkage population-wide studies in SAIL.
Author Richard Fry
Ronan Lyons
Jane Lyons
Rhodri Johnson
Lucy J Griffiths
Michael Edwards
Samantha Turner
Author_xml – sequence: 1
  fullname: Jane Lyons
  organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP
– sequence: 2
  orcidid: 0000-0001-9636-0753
  fullname: Rhodri Johnson
  organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP
– sequence: 3
  fullname: Michael Edwards
  organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP
– sequence: 4
  fullname: Samantha Turner
  organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP
– sequence: 5
  fullname: Richard Fry
  organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP
– sequence: 6
  fullname: Lucy J Griffiths
  organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP
– sequence: 7
  fullname: Ronan Lyons
  organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP
BookMark eNotkNtqGzEQhkVpoW7qB8idHqDr6LAn9c6YHkwCuYnp5TKrmTVy19IiyaZ-kr5u1k6uBj7m_2b4v7CPPnhi7F6KldJtax7cYcK0Okvh5EoZU35gC6WNKUoj2s9smdJBCKFkqZpaLtj_NR6ddylHyO5MHCEDH53_C3viOfAN-XRKXM0J7jz_AyOlb3z3-J2vuY0hpSKRzS54GHnKJ7xw-gdXo99zG47TSJk8pcTBI480RUrk8-3WDQ8h8ilMpxGuknkLxkt2Nn1lnwYYEy3f5x3b_fzxsvldPD3_2m7WTwVKYcrCDgNatKAa3WNvqtZqq_pWQG_tYIZegpEoalGrcqCqhkqRUthoAKyk0Vrfse2bFwMcuim6I8RLF8B1NxDivoM4PzRSZ0pEkFXZKI1lpdu5z7q3aMzsoqbq9SsSwnuT
ContentType Journal Article
DBID DOA
DOI 10.23889/ijpds.v10i1.2994
DatabaseName DOAJ Directory of Open Access Journals
DatabaseTitleList
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Economics
EISSN 2399-4908
ExternalDocumentID oai_doaj_org_article_94dda154723d45389086bcd99ad5e75b
GroupedDBID AAFWJ
ADBBV
AFPKN
ALMA_UNASSIGNED_HOLDINGS
BCNDV
GROUPED_DOAJ
M~E
OK1
RPM
ID FETCH-LOGICAL-d1094-cffdcdca273bdb958c3c2b80abccf9fb1a91d060624fe56a52e22d73aad519333
IEDL.DBID DOA
IngestDate Mon Dec 01 19:26:11 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-d1094-cffdcdca273bdb958c3c2b80abccf9fb1a91d060624fe56a52e22d73aad519333
ORCID 0000-0001-9636-0753
OpenAccessLink https://doaj.org/article/94dda154723d45389086bcd99ad5e75b
ParticipantIDs doaj_primary_oai_doaj_org_article_94dda154723d45389086bcd99ad5e75b
PublicationCentury 2000
PublicationDate 2025-11-01
PublicationDateYYYYMMDD 2025-11-01
PublicationDate_xml – month: 11
  year: 2025
  text: 2025-11-01
  day: 01
PublicationDecade 2020
PublicationTitle International journal of population data science
PublicationYear 2025
Publisher Swansea University
Publisher_xml – name: Swansea University
SSID ssj0002142761
Score 2.307752
Snippet Introduction Measuring population representativeness is an important methodological step in public health and epidemiological studies. Objectives To explore...
SourceID doaj
SourceType Open Website
SubjectTerms administrative data
census representativeness
data linkage
Title Administrative data linkage to Census 2021 in Wales, UK: A cross-sectional study examining completeness and representativeness for population analytics
URI https://doaj.org/article/94dda154723d45389086bcd99ad5e75b
Volume 10
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  databaseCode: DOA
  dateStart: 20170101
  customDbUrl:
  isFulltext: true
  eissn: 2399-4908
  dateEnd: 99991231
  titleUrlDefault: https://www.doaj.org/
  omitProxy: false
  ssIdentifier: ssj0002142761
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  databaseCode: M~E
  dateStart: 20170101
  customDbUrl:
  isFulltext: true
  eissn: 2399-4908
  dateEnd: 99991231
  titleUrlDefault: https://road.issn.org
  omitProxy: false
  ssIdentifier: ssj0002142761
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8NAEF5EBL2IT3wzB4_GNrt57HqrpUUQiweLvZXZR6CiaWlq0Yt_w7_r7KZKPXnxkkMI2TCTnfmGnfk-xs6Fk1nq4iQirOGipMiSSEmv9YJGodYCha3FJvJeTw4G6n5J6sv3hNX0wLXhGiqxFinP51zYhHanIgyujVUKberyVPvoS6hnqZjyMdgTiVGBXh9jUlaSqjF6mtjqch43R1QWKpX8IukP2aS7xTYXMBBa9fLbbMWVO2z9e0q42mWfy6y2cwe-kxP8aSvtf5iNoe0lKirg9A0wKuGR4nx1Af3bK2hByHxRFZqs_CqBQhbcG74ENQgIbeQeLFOUAywtBGbLxRRSHfyAsCxMfsS96Cl8fveMznus3-08tG-ihYhCZGMq3SJTFNZYgwRTtNUqlUYYrmUTtTGFKnSMKrZNKmN4Urg0w5Q7zm0ukOxL4E6IfbZajkt3wEAQPHJCc4MyT4omyswiSiqJXGp5rvQhu_YWHU5qnoyhZ64ON8ifw4U_h3_58-g_XnLMNrjX6Q0zgydsdTZ9dadszcxno2p6Fn4Vut59dL4A2DPJug
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Administrative+data+linkage+to+Census+2021+in+Wales%2C+UK%3A+A+cross-sectional+study+examining+completeness+and+representativeness+for+population+analytics&rft.jtitle=International+journal+of+population+data+science&rft.au=Jane+Lyons&rft.au=Rhodri+Johnson&rft.au=Michael+Edwards&rft.au=Samantha+Turner&rft.date=2025-11-01&rft.pub=Swansea+University&rft.eissn=2399-4908&rft.volume=10&rft.issue=1&rft_id=info:doi/10.23889%2Fijpds.v10i1.2994&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_94dda154723d45389086bcd99ad5e75b