Administrative data linkage to Census 2021 in Wales, UK: A cross-sectional study examining completeness and representativeness for population analytics
Introduction Measuring population representativeness is an important methodological step in public health and epidemiological studies. Objectives To explore the representativeness of Census 2021 data linkage when compared with the Welsh Demographic Service Dataset (WDSD) within the Secure Anonymised...
Gespeichert in:
| Veröffentlicht in: | International journal of population data science Jg. 10; H. 1 |
|---|---|
| Hauptverfasser: | , , , , , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
Swansea University
01.11.2025
|
| Schlagworte: | |
| ISSN: | 2399-4908 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Abstract | Introduction Measuring population representativeness is an important methodological step in public health and epidemiological studies. Objectives To explore the representativeness of Census 2021 data linkage when compared with the Welsh Demographic Service Dataset (WDSD) within the Secure Anonymised Information Linkage (SAIL) Databank for research on the population of Wales, UK. To understand the characteristics of individuals linked and not linked and which subgroups of the population are disproportionately represented in data linkage population-wide studies. Methods An observational, population-wide cross-sectional comparison study, utilising administrative demographic data and decennial survey data held in SAIL. Two data sources, the WDSD and Census 2021, were used to create and compare two cohorts of the resident population of Wales, UK, on 21st March 2021. The two cohorts were linked to understand how many individuals from Census 2021 can be successfully linked within SAIL, in WDSD and not in Census 2021, and found across both sources. Logistic regression models analysed the variation in the linkability of the survey data within SAIL by various demographic and household characteristics. Results The central analytical cohort contained 2,440,191 individuals present in both data sources. WDSD contained 3,090,976 individuals with 2,965,196 individuals in Census data. With a positively classed outcome indicating non-linkage from WDS to Census the characteristics associated with the highest odds of individuals being registered in WDS but not linked to Census (in SAIL) are male (aOR = 1.28 [95%CI 1.28,1.32]), 75+ years of age (aOR = 1.27 [95%CI 1.25,1.29]), of Asian ethnicity (aOR = 1.27 [95%CI 1.24,1.30]), a more recent migrant (arriving to UK after 2000) (aOR = 1.30 [95%CI 1.28,1.32]), a member of the LGBTQ+ community (aOR = 1.29 [95%CI 1.25,1.29]) or not disclosing LGBTQ+ status (aOR = 1.41 [95%CI 1.39,1.43]), being separated, divorced or widowed (aOR = 1.28 [95%CI 1.27,1.29]), or living in rental accommodation (aOR = 1.47 [95%CI 1.45,1.48]). Conclusions Results show that certain personal characteristics and sub-groups of the population of Wales are disproportionately represented when combining population estimates and utilising Census data in data linkage population-wide studies in SAIL. |
|---|---|
| AbstractList | Introduction Measuring population representativeness is an important methodological step in public health and epidemiological studies. Objectives To explore the representativeness of Census 2021 data linkage when compared with the Welsh Demographic Service Dataset (WDSD) within the Secure Anonymised Information Linkage (SAIL) Databank for research on the population of Wales, UK. To understand the characteristics of individuals linked and not linked and which subgroups of the population are disproportionately represented in data linkage population-wide studies. Methods An observational, population-wide cross-sectional comparison study, utilising administrative demographic data and decennial survey data held in SAIL. Two data sources, the WDSD and Census 2021, were used to create and compare two cohorts of the resident population of Wales, UK, on 21st March 2021. The two cohorts were linked to understand how many individuals from Census 2021 can be successfully linked within SAIL, in WDSD and not in Census 2021, and found across both sources. Logistic regression models analysed the variation in the linkability of the survey data within SAIL by various demographic and household characteristics. Results The central analytical cohort contained 2,440,191 individuals present in both data sources. WDSD contained 3,090,976 individuals with 2,965,196 individuals in Census data. With a positively classed outcome indicating non-linkage from WDS to Census the characteristics associated with the highest odds of individuals being registered in WDS but not linked to Census (in SAIL) are male (aOR = 1.28 [95%CI 1.28,1.32]), 75+ years of age (aOR = 1.27 [95%CI 1.25,1.29]), of Asian ethnicity (aOR = 1.27 [95%CI 1.24,1.30]), a more recent migrant (arriving to UK after 2000) (aOR = 1.30 [95%CI 1.28,1.32]), a member of the LGBTQ+ community (aOR = 1.29 [95%CI 1.25,1.29]) or not disclosing LGBTQ+ status (aOR = 1.41 [95%CI 1.39,1.43]), being separated, divorced or widowed (aOR = 1.28 [95%CI 1.27,1.29]), or living in rental accommodation (aOR = 1.47 [95%CI 1.45,1.48]). Conclusions Results show that certain personal characteristics and sub-groups of the population of Wales are disproportionately represented when combining population estimates and utilising Census data in data linkage population-wide studies in SAIL. |
| Author | Richard Fry Ronan Lyons Jane Lyons Rhodri Johnson Lucy J Griffiths Michael Edwards Samantha Turner |
| Author_xml | – sequence: 1 fullname: Jane Lyons organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP – sequence: 2 orcidid: 0000-0001-9636-0753 fullname: Rhodri Johnson organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP – sequence: 3 fullname: Michael Edwards organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP – sequence: 4 fullname: Samantha Turner organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP – sequence: 5 fullname: Richard Fry organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP – sequence: 6 fullname: Lucy J Griffiths organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP – sequence: 7 fullname: Ronan Lyons organization: Population Data Science, Swansea University Medical School, Swansea, SA2 8PP |
| BookMark | eNotkNtqGzEQhkVpoW7qB8idHqDr6LAn9c6YHkwCuYnp5TKrmTVy19IiyaZ-kr5u1k6uBj7m_2b4v7CPPnhi7F6KldJtax7cYcK0Okvh5EoZU35gC6WNKUoj2s9smdJBCKFkqZpaLtj_NR6ddylHyO5MHCEDH53_C3viOfAN-XRKXM0J7jz_AyOlb3z3-J2vuY0hpSKRzS54GHnKJ7xw-gdXo99zG47TSJk8pcTBI480RUrk8-3WDQ8h8ilMpxGuknkLxkt2Nn1lnwYYEy3f5x3b_fzxsvldPD3_2m7WTwVKYcrCDgNatKAa3WNvqtZqq_pWQG_tYIZegpEoalGrcqCqhkqRUthoAKyk0Vrfse2bFwMcuim6I8RLF8B1NxDivoM4PzRSZ0pEkFXZKI1lpdu5z7q3aMzsoqbq9SsSwnuT |
| ContentType | Journal Article |
| DBID | DOA |
| DOI | 10.23889/ijpds.v10i1.2994 |
| DatabaseName | DOAJ Directory of Open Access Journals |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Economics |
| EISSN | 2399-4908 |
| ExternalDocumentID | oai_doaj_org_article_94dda154723d45389086bcd99ad5e75b |
| GroupedDBID | AAFWJ ADBBV AFPKN ALMA_UNASSIGNED_HOLDINGS BCNDV GROUPED_DOAJ M~E OK1 RPM |
| ID | FETCH-LOGICAL-d1094-cffdcdca273bdb958c3c2b80abccf9fb1a91d060624fe56a52e22d73aad519333 |
| IEDL.DBID | DOA |
| IngestDate | Mon Dec 01 19:26:11 EST 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 1 |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-d1094-cffdcdca273bdb958c3c2b80abccf9fb1a91d060624fe56a52e22d73aad519333 |
| ORCID | 0000-0001-9636-0753 |
| OpenAccessLink | https://doaj.org/article/94dda154723d45389086bcd99ad5e75b |
| ParticipantIDs | doaj_primary_oai_doaj_org_article_94dda154723d45389086bcd99ad5e75b |
| PublicationCentury | 2000 |
| PublicationDate | 2025-11-01 |
| PublicationDateYYYYMMDD | 2025-11-01 |
| PublicationDate_xml | – month: 11 year: 2025 text: 2025-11-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationTitle | International journal of population data science |
| PublicationYear | 2025 |
| Publisher | Swansea University |
| Publisher_xml | – name: Swansea University |
| SSID | ssj0002142761 |
| Score | 2.307752 |
| Snippet | Introduction Measuring population representativeness is an important methodological step in public health and epidemiological studies. Objectives To explore... |
| SourceID | doaj |
| SourceType | Open Website |
| SubjectTerms | administrative data census representativeness data linkage |
| Title | Administrative data linkage to Census 2021 in Wales, UK: A cross-sectional study examining completeness and representativeness for population analytics |
| URI | https://doaj.org/article/94dda154723d45389086bcd99ad5e75b |
| Volume | 10 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAON databaseName: DOAJ Directory of Open Access Journals databaseCode: DOA dateStart: 20170101 customDbUrl: isFulltext: true eissn: 2399-4908 dateEnd: 99991231 titleUrlDefault: https://www.doaj.org/ omitProxy: false ssIdentifier: ssj0002142761 providerName: Directory of Open Access Journals – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources databaseCode: M~E dateStart: 20170101 customDbUrl: isFulltext: true eissn: 2399-4908 dateEnd: 99991231 titleUrlDefault: https://road.issn.org omitProxy: false ssIdentifier: ssj0002142761 providerName: ISSN International Centre |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8NAEF5EBL2IT3wzB4_GNrt57HqrpUUQiweLvZXZR6CiaWlq0Yt_w7_r7KZKPXnxkkMI2TCTnfmGnfk-xs6Fk1nq4iQirOGipMiSSEmv9YJGodYCha3FJvJeTw4G6n5J6sv3hNX0wLXhGiqxFinP51zYhHanIgyujVUKberyVPvoS6hnqZjyMdgTiVGBXh9jUlaSqjF6mtjqch43R1QWKpX8IukP2aS7xTYXMBBa9fLbbMWVO2z9e0q42mWfy6y2cwe-kxP8aSvtf5iNoe0lKirg9A0wKuGR4nx1Af3bK2hByHxRFZqs_CqBQhbcG74ENQgIbeQeLFOUAywtBGbLxRRSHfyAsCxMfsS96Cl8fveMznus3-08tG-ihYhCZGMq3SJTFNZYgwRTtNUqlUYYrmUTtTGFKnSMKrZNKmN4Urg0w5Q7zm0ukOxL4E6IfbZajkt3wEAQPHJCc4MyT4omyswiSiqJXGp5rvQhu_YWHU5qnoyhZ64ON8ifw4U_h3_58-g_XnLMNrjX6Q0zgydsdTZ9dadszcxno2p6Fn4Vut59dL4A2DPJug |
| linkProvider | Directory of Open Access Journals |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Administrative+data+linkage+to+Census+2021+in+Wales%2C+UK%3A+A+cross-sectional+study+examining+completeness+and+representativeness+for+population+analytics&rft.jtitle=International+journal+of+population+data+science&rft.au=Jane+Lyons&rft.au=Rhodri+Johnson&rft.au=Michael+Edwards&rft.au=Samantha+Turner&rft.date=2025-11-01&rft.pub=Swansea+University&rft.eissn=2399-4908&rft.volume=10&rft.issue=1&rft_id=info:doi/10.23889%2Fijpds.v10i1.2994&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_94dda154723d45389086bcd99ad5e75b |