Estimating households and populations from primary care electronic health records: comparison with Office for National Statistics Census 2021 aggregated estimates

IntroductionUp-to-date, high-quality estimates of population and households are essential for planning the provision of local and central infrastructure. ObjectivesWe aimed to derive estimates of population size, and household numbers and size on Census date (21/03/2021) using north-east London prim...

Full description

Saved in:
Bibliographic Details
Published in:International journal of population data science Vol. 10; no. 1
Main Authors: Wilk, Marta, Harper, Gill, Firman, Nicola, Dibben, Chris, Fry, Rich, Dezateux, Carol
Format: Journal Article
Language:English
Published: Swansea University 24.11.2025
Subjects:
ISSN:2399-4908, 2399-4908
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract IntroductionUp-to-date, high-quality estimates of population and households are essential for planning the provision of local and central infrastructure. ObjectivesWe aimed to derive estimates of population size, and household numbers and size on Census date (21/03/2021) using north-east London primary care Electronic Health Records (EHR) and calculate levels of their agreement with the publicly available official Census 2021 estimates to assess if health data have the potential to be used to create reliable statistics. MethodsWe compared EHR and Census population estimates by sex, age, local authority, and IMD quintile, and EHR and Census household estimates by number, size, and local authority. We estimated 95% Limits of Agreement between EHR and Census household and population estimates using the Bland and Altman method. In sensitivity analyses, we excluded people with no General Practice encounter within 12 months and compared the adjusted population's size to Census estimate. We compared EHR and administrative Statistical Population Dataset (SPD) to Census population estimates by sex and age, and EHR and Admin-based Occupied Address Dataset (ABOAD) to Census household estimates by local authority and household size. ResultsEHR population estimate was 2,130,965, i.e. 7.1% higher than Census of 1,990,087. EHR household estimate was 658,264, i.e. 9.1% lower than Census of 724,045. The estimate of population with recent GP encounter was 11.6% lower than the Census estimate. Compared to Census, both SPD and EHR overcounted population of males (10.7%, 7.9% respectively) and females (3.6%, 2.7% respectively). Both ABOAD and EHR had undercounted households compared to Census (-7.3%; -9.1% respectively). ConclusionsReliable, up-to-date populations and households estimates can be derived from health records. High residential mobility increases the complexity of deriving these estimates. Excluding people without GP encounters does not improve agreement with Census. Future work will focus on comparing Census and EHR estimates using individual-level data.
AbstractList IntroductionUp-to-date, high-quality estimates of population and households are essential for planning the provision of local and central infrastructure. ObjectivesWe aimed to derive estimates of population size, and household numbers and size on Census date (21/03/2021) using north-east London primary care Electronic Health Records (EHR) and calculate levels of their agreement with the publicly available official Census 2021 estimates to assess if health data have the potential to be used to create reliable statistics. MethodsWe compared EHR and Census population estimates by sex, age, local authority, and IMD quintile, and EHR and Census household estimates by number, size, and local authority. We estimated 95% Limits of Agreement between EHR and Census household and population estimates using the Bland and Altman method. In sensitivity analyses, we excluded people with no General Practice encounter within 12 months and compared the adjusted population's size to Census estimate. We compared EHR and administrative Statistical Population Dataset (SPD) to Census population estimates by sex and age, and EHR and Admin-based Occupied Address Dataset (ABOAD) to Census household estimates by local authority and household size. ResultsEHR population estimate was 2,130,965, i.e. 7.1% higher than Census of 1,990,087. EHR household estimate was 658,264, i.e. 9.1% lower than Census of 724,045. The estimate of population with recent GP encounter was 11.6% lower than the Census estimate. Compared to Census, both SPD and EHR overcounted population of males (10.7%, 7.9% respectively) and females (3.6%, 2.7% respectively). Both ABOAD and EHR had undercounted households compared to Census (-7.3%; -9.1% respectively). ConclusionsReliable, up-to-date populations and households estimates can be derived from health records. High residential mobility increases the complexity of deriving these estimates. Excluding people without GP encounters does not improve agreement with Census. Future work will focus on comparing Census and EHR estimates using individual-level data.
Introduction Up-to-date, high-quality estimates of population and households are essential for planning the provision of local and central infrastructure. Objectives We aimed to derive estimates of population size, and household numbers and size on Census date (21/03/2021) using north-east London primary care Electronic Health Records (EHR) and calculate levels of their agreement with the publicly available official Census 2021 estimates to assess if health data have the potential to be used to create reliable statistics. Methods We compared EHR and Census population estimates by sex, age, local authority, and IMD quintile, and EHR and Census household estimates by number, size, and local authority. We estimated 95% Limits of Agreement between EHR and Census household and population estimates using the Bland and Altman method. In sensitivity analyses, we excluded people with no General Practice encounter within 12 months and compared the adjusted population's size to Census estimate. We compared EHR and administrative Statistical Population Dataset (SPD) to Census population estimates by sex and age, and EHR and Admin-based Occupied Address Dataset (ABOAD) to Census household estimates by local authority and household size. Results EHR population estimate was 2,130,965, i.e. 7.1% higher than Census of 1,990,087. EHR household estimate was 658,264, i.e. 9.1% lower than Census of 724,045. The estimate of population with recent GP encounter was 11.6% lower than the Census estimate. Compared to Census, both SPD and EHR overcounted population of males (10.7%, 7.9% respectively) and females (3.6%, 2.7% respectively). Both ABOAD and EHR had undercounted households compared to Census (-7.3%; -9.1% respectively). Conclusions Reliable, up-to-date populations and households estimates can be derived from health records. High residential mobility increases the complexity of deriving these estimates. Excluding people without GP encounters does not improve agreement with Census. Future work will focus on comparing Census and EHR estimates using individual-level data.
Author Harper, Gill
Fry, Rich
Dezateux, Carol
Wilk, Marta
Dibben, Chris
Firman, Nicola
Author_xml – sequence: 1
  givenname: Marta
  surname: Wilk
  fullname: Wilk, Marta
– sequence: 2
  givenname: Gill
  orcidid: 0000-0002-3492-2076
  surname: Harper
  fullname: Harper, Gill
– sequence: 3
  givenname: Nicola
  orcidid: 0000-0001-5213-5044
  surname: Firman
  fullname: Firman, Nicola
– sequence: 4
  givenname: Chris
  surname: Dibben
  fullname: Dibben, Chris
– sequence: 5
  givenname: Rich
  surname: Fry
  fullname: Fry, Rich
– sequence: 6
  givenname: Carol
  surname: Dezateux
  fullname: Dezateux, Carol
BookMark eNpNkd1qGzEQhUVIIa7jB-jdvIBdSfsn9a4YtzGY-CLttdBKo7XMerVI65a-Tp80yjqUXs1hZvjmDOcjuR_CgIR8YnTDCyHkZ38ebdr8YtSzDZeVuCMLXki5LiUV9__pB7JK6Uwp5azkTc0W5O8uTf6iJz90cArXhKfQ2wR6sDCG8drnSRgSuBguMMa8Gf-A0REBezRTDIM3cELdTyeIaEK06QuYcBl19CkM8NvnwdE5bxBciPA883QPL1NW-bRJsMUhXRPwbAp010Xs9IQW8GYM0yP54HSfcPVel-Tnt92P7dP6cPy-3349rA2rmFhLhrWmtXTMtUw0jUCBzpS0bq1sBUNblaXOjdrQpkEnWuqoK6mmwgpGmSyWZH_j2qDP6v1ZFbRXcyPETumYHfeoBNdGtIZKJ1lp60q0teG8KCtheJZtZrEby8SQUkT3j8eomjNTc2Zqzky9ZVa8AjGxkdo
ContentType Journal Article
DBID AAYXX
CITATION
DOA
DOI 10.23889/ijpds.v10i1.2958
DatabaseName CrossRef
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList CrossRef

Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals (ODIN)
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Economics
EISSN 2399-4908
ExternalDocumentID oai_doaj_org_article_82ac8bc09f914d658b6c223458c2b6cb
10_23889_ijpds_v10i1_2958
GroupedDBID AAFWJ
AAYXX
ADBBV
AFPKN
ALMA_UNASSIGNED_HOLDINGS
BCNDV
CITATION
GROUPED_DOAJ
M~E
OK1
RPM
ID FETCH-LOGICAL-c1518-91e6a069f1fb18778e8efc406bd9b81ed544afc46c077ef8b0f0f40a08d810193
IEDL.DBID DOA
ISSN 2399-4908
IngestDate Mon Dec 01 19:26:11 EST 2025
Thu Nov 27 01:07:28 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
License http://creativecommons.org/licenses/by/4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c1518-91e6a069f1fb18778e8efc406bd9b81ed544afc46c077ef8b0f0f40a08d810193
ORCID 0000-0002-3492-2076
0000-0001-5213-5044
OpenAccessLink https://doaj.org/article/82ac8bc09f914d658b6c223458c2b6cb
ParticipantIDs doaj_primary_oai_doaj_org_article_82ac8bc09f914d658b6c223458c2b6cb
crossref_primary_10_23889_ijpds_v10i1_2958
PublicationCentury 2000
PublicationDate 2025-11-24
PublicationDateYYYYMMDD 2025-11-24
PublicationDate_xml – month: 11
  year: 2025
  text: 2025-11-24
  day: 24
PublicationDecade 2020
PublicationTitle International journal of population data science
PublicationYear 2025
Publisher Swansea University
Publisher_xml – name: Swansea University
SSID ssj0002142761
Score 2.3105073
Snippet IntroductionUp-to-date, high-quality estimates of population and households are essential for planning the provision of local and central infrastructure....
Introduction Up-to-date, high-quality estimates of population and households are essential for planning the provision of local and central infrastructure....
SourceID doaj
crossref
SourceType Open Website
Index Database
SubjectTerms census comparison study
data representativeness
electronic health records
household estimates
population estimates
Title Estimating households and populations from primary care electronic health records: comparison with Office for National Statistics Census 2021 aggregated estimates
URI https://doaj.org/article/82ac8bc09f914d658b6c223458c2b6cb
Volume 10
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals (ODIN)
  customDbUrl:
  eissn: 2399-4908
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0002142761
  issn: 2399-4908
  databaseCode: DOA
  dateStart: 20170101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2399-4908
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0002142761
  issn: 2399-4908
  databaseCode: M~E
  dateStart: 20170101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1LS8QwEA4igl7EJ64v5uBJqCY12yTedFnxoKsHFW8lSVNdkbpY9eiP8Zc609S1Ny9eSgmlpDPTzCOZ72NsT9DmXGZ9oo6cTaRSWWIb1tSA8QQmRF42Xa53F2o00vf35rpD9UVnwiI8cBTcoU6t185zUxohC_SXLvPo0mRf-xRvHa2-GPV0kilagwlIDBP0uI2JXkmbw_HTpKgPPgQfY1poiOK944g6eP2NYzlbYottRAgncSbLbCZUK2z-p2G4XmVfQ_wNKbCsHuARE_VAO0Y12KqAyZR-qwZqFIFJBI8AOtAFvxQ3ENsdIZZk6mPwU_pBoEosXBGSRACMYKFFyn4GikMjjDMMiBCjhhS_GOwDpuhUfCsgxImFeo3dng1vBudJS62QeHTxtMSFzPLMlKJ0Qiulgw6lR-fuCuO0CEVfSosDmedKhVI7XvJScst1QYhg5midzVYvVdhgYEKQ0mjlCwytSqms8UQTmKLehHHW99j-j5zzVgg5Zh6NUvJGKXmjlJyU0mOnpInpgwR-3QygSeStSeR_mcTmf7xkiy2kZLRCJKncZrNvr-9hh835D5T8625jbXi9_Bx-AyV_3zU
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Estimating+households+and+populations+from+primary+care+electronic+health+records%3A+comparison+with+Office+for+National+Statistics+Census+2021+aggregated+estimates&rft.jtitle=International+journal+of+population+data+science&rft.au=Marta+Wilk&rft.au=Gill+Harper&rft.au=Nicola+Firman&rft.au=Chris+Dibben&rft.date=2025-11-24&rft.pub=Swansea+University&rft.eissn=2399-4908&rft.volume=10&rft.issue=1&rft_id=info:doi/10.23889%2Fijpds.v10i1.2958&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_82ac8bc09f914d658b6c223458c2b6cb
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2399-4908&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2399-4908&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2399-4908&client=summon