The project aims to demonstrate the innovation potential of unstructured data in electronic health record (EHR) systems.
Unstructured textual information adds contextual information not found elsewhere in the EHR, however it is the most difficult to exploit or share.
Sharing unstructured data such as clinical text presents significant technical challenges since it is difficult to provide privacy guarantees. In addition to privacy challenges, individual countries have EHR data in their native languages. This presents special challenges for processing clinical text using data-driven AI algorithms.
In this project, a federated health data network will be developed, geared towards secondary use of health data. The project utilizes distributed machine learning, specifically federated learning, to ensure data privacy and ownership. The federated health data network will be built upon privacy-preserving and secure distributed training of multilingual clinical language models in Norwegian, Swedish, Danish, Finnish, and Estonian. Additionally, the solution incorporates a distributed ledger with smart contracts and blockchain technology, enhancing transparency and security.
Two use cases will be used to demonstrate the innovation potential of unstructured clinical text data: the detection of medical implants and the detection of adverse drug reactions (ADRs).
Before an MRI-examination, it is important to have detailed model knowledge about the medical implants that an individual patient as such can cause serious harm to the patient, or even death. Automatic analysis of clinical text can more rapidly identify new and serious ADRs.
Today, the process requires reading through the entire patient record to establish if the patient has a medical implant and which model.
The model to be developed in the project will be able to reduce the time for finding existing implants and increase the number of devices that can be detected.
The solution includes a pipeline for handling clinical text in multiple Nordic languages, privacy-preserving distributed training using federated learning, and the integration of a distributed ledger for enhanced transparency and security.
The end-product will be a privacy-preserving virtual data warehouse with different user levels for data analysis and research purposes. The implementation involves leveraging large EHR datasets, existing natural language processing tools, and a national primary healthcare infrastructure.
The project aims to advance the field of healthcare data sharing and analysis, unlock the potential of unstructured data in EHR systems and enable innovative advancements in healthcare while maintaining privacy and security.
The project will be led by the Norwegian Centre for E-health Research in collaboration with the partners:
- University of Turku (FIN)
- Stockholm University (SWE)
- County Council of Östergötland/Linköping University Hospital (SWE)
- DNV (NO)
- University of Copenhagen (DK)
- University of Tartu (EST)
- Omilon (DK)
- Cambio (SWE)
This project is one out of two projects funded under the Life Science and Health Tech program and the AI and Data program and supported by the Nordic Ministerial Council for Digitalization, which encompasses cooperation between the Nordic countries and Estonia, Latvia and Lithuania.