Section 01
PRIME-CVD: Open-Source Privacy-Protected Dataset for Medical Informatics Education
PRIME-CVD is an open-source educational dataset developed by UNSW Health Big Data Research Center (CBDRH). It generates 50,000 simulated patient records via causal Directed Acyclic Graph (DAG), offering two versions: clean analysis-ready queue and real EMR-style "dirty" data. It supports teaching of causal inference, survival analysis, data cleaning, etc., while ensuring full privacy protection.