University of Hamburg / Brigham and Women's Hospital
A transformer foundation model trained on a biomedical knowledge graph for zero-shot drug-discovery inference across indications, targets, ADRs, contraindications, and off-label uses.
Drug discovery depends on reasoning over a dense web of relationships — which drugs hit which targets, which conditions they treat, which adverse effects they cause, and which patients they must not be given to. Network medicine encodes this web as a biomedical knowledge graph, but most predictive tools built on such graphs are trained one task at a time. NetMedGPT instead treats the knowledge graph itself as the substrate for a single foundation model that can answer many drug-discovery questions at once.
NetMedGPT is a transformer-based foundation model pretrained on a large-scale biomedical knowledge graph via masked-token prediction — the same self-supervised objective used by language models, applied here to the entities and relations of network medicine rather than to natural-language text. It was developed by the Baumbach group at the University of Hamburg, with network medicine pioneer Joseph Loscalzo (Harvard / Brigham and Women's Hospital) among the authors, and released as a bioRxiv preprint in January 2026.
By learning the structure of the biomedical graph, NetMedGPT performs zero-shot inference across multiple downstream tasks without task-specific retraining, distinguishing it from sequence-based biological foundation models. It is accompanied by a hosted, interactive chat-style interface that lets users query the model directly.
NetMedGPT is a transformer pretrained with a masked-token-prediction objective over a biomedical knowledge graph, so that the model learns to fill in missing entities and relations — the network-medicine analogue of masked language modeling. This self-supervised pretraining yields representations that transfer zero-shot to five distinct drug-discovery prediction tasks: indications, targets, ADRs, contraindications, and off-label uses. Across these tasks the authors report area-under-the-precision-recall-curve (AUPR) gains of approximately 2.2–26% over specialized baseline methods, and validate predictions against external evidence including ClinicalTrials.gov. Unlike sequence-based foundation models for proteins or genomes, NetMedGPT operates entirely on graph structure. The preprint is released under a permissive CC BY license and is paired with a hosted interactive interface; at the time of writing no downloadable weights or code repository is linked.
NetMedGPT is aimed at translational researchers, pharmacologists, and clinicians working on drug repurposing, safety assessment, and target identification. Because a single pretrained model answers questions about indications, targets, adverse reactions, contraindications, and off-label use, it can serve as a general-purpose reasoning layer over biomedical knowledge — for example proposing repurposing candidates, flagging potential adverse reactions or contraindications, or surfacing plausible off-label uses for further investigation. The generative subnetwork feature and the hosted chat interface make these predictions explorable in a network-medicine context.
NetMedGPT demonstrates that the foundation-model recipe — large-scale self-supervised pretraining followed by zero-shot transfer — can be applied to biomedical knowledge graphs, not just to molecular sequences, and that doing so can outperform task-specific models across several drug-discovery problems. The involvement of a founder of network medicine and the validation against ClinicalTrials.gov lend credibility, and the permissive license and public interface support uptake. As a 2026 preprint its results await peer review, and the absence of a downloadable model or code is a current limitation for reproduction, though the hosted interface lets others probe the model's behavior.