Generative AI to promote open science in register based research
Research project
The open science movement aims to foster a replicable and robust scientific environment through transparency and collaboration, advocating for the sharing of research data and code. However, studies involving sensitive data, such as Swedish linked registers, face challenges in adhering to these principles due to privacy concerns. This dilemma is addressed in this project by proposing the creation of sharable synthetic data through the development of generative artificial intelligence methods.
The project goals include creating sharable synthetic data that accurately represents original register data, ensuresreplicability of register studies, and provides privacy guarantees. Additionally, it seeks to establish protocols for creating and using synthetic data in register-based research, facilitating data sharing within the scientific community.
The research team combines expertise in register dataset design, register based observational studies, and advanced machine learning techniques, including deep neural network architectures. By bridging the gap between openness and confidentiality, the project aims to enhance the value of register based research and promote scientific advancement.