TBGA Open Dataset

A large-scale Gene-Disease Associaton dataset for biomedical relation extraction.

It allows us to train and test Biomedical Relation Extraction (BioRE) models to be used to extract relevant medical facts from the literature and connect them with the facts extracted from the ExaMode medical reports.

Previous releases were named GDAa and GDAb.

Available in its Zenodo webpage.