Health Informatics

My experience with the health Informatics data warehouse - pilot project

September 2018 - September 2020

Supervisors

Matthias Görges, Elodie Portales-Casamar

Part of the E2i program and “Work and Learn” at the UBC
Special thanks to Dawn Mount and Marissa Gibbard

Background Integrated Data Repositories (IDRs), also referred as to as (clinical) data warehouses, is the integration of several data sources in health informatics with specialized analytical tools that facilitate data processing and analysis. The IDRs offer several advantages in clinical data reuse and the number of institutions implementing an IDR is growing steadily in the past decade.
Objectives The architectural choices of major IDRs are highly diverse and determining their differences can be overwhelming. In this review, we explored the underlying models and common features of IDRs. We provide a high-level overview for those entering the field and propose a set of guiding principles for small to medium size health institutions embarking on IDR implementation.
Methods We reviewed manuscripts published in peer-reviewed scientific literature between 2008 and 2018, and selected those that specifically describe IDR architectures. Out of 80 shortlisted articles, we found 19 articles describing 23 different architectures. The different IDRs were analyzed for common features and classified according to their data processing and integration solution choices.
Results Despite common trends in the selection of standard terminologies and data models, the IDRs examined showed heterogeneity in the underlying architecture design. We identified four common architecture models that use different approaches for data processing and integration; such different approaches were driven by data sources, whether the IDR was for a single institution or a collaborative project, the intended primary data user, and purpose (research-only or including clinical/operational decision-making).
Conclusions IDR implementations are diverse and complex undertakings, which benefit from being preceded by an evaluation of requirements and definition of scope in the early planning stage. Factors such as data source diversity and intended users of the IDR influence data flow and synchronization, both of which are crucial factors in the IDR architecture planning.