Milestone in AI for Drug Discovery

625566.jpg

Researcher Jasmin Aschenbrenner loading samples on the crystallography beamline at Diamond Light Source. Credit: Stuart March-DNDi

The UK‑led OpenBind initiative has reached a major milestone with the release of its first publicly available dataset and predictive AI model, a step toward accelerating the discovery of new medicines using artificial intelligence.

The release showcases how engineering the production of AI-ready data is not only feasible but essential to evolving AI tools for scientific fields. 

While AI has introduced a step‑change in predictive accuracy for protein structures, its impact on drug discovery has remained muted, limited above all by the global shortage of reliable experimental data measuring in atomic detail how molecules of drug discovery bind to disease‑related proteins. OpenBind aims to fill this critical gap.

This first release demonstrates that OpenBind’s pipeline is now operational, having generated 800 high-quality measurements in only 7 months. In the past, such large datasets took years to be produced and released.

This integrated operation combines automated chemistry, robust binding measurements and high throughput crystallography with an engineered data release process and AI model. It lays the groundwork for progress in drug discovery, with future data tranches planned to address global‑health challenges such as COVID‑19, malaria, dengue, Zika, and cancer, where rapid development of new treatments remains vital.

The initial dataset reflects invaluable learning from the initiative’s early experimental cycles. Standardized workflows, strong metadata practices and high levels of automation have proven crucial in ensuring the consistency and reproducibility required for AI, while highlighting opportunities to further streamline data handling and release frequency.

Building on this foundation, OpenBind will expand to include more targets, larger chemical series and deeper datasets, alongside community blind‑challenges that will validate AI models for newly generated experimental data.

Ultimately, OpenBind aims to create a global, open data engine capable of supporting the development of faster, more accurate and more equitable therapeutics.

Data from Diamond Light Source

Subscribe to our e-Newsletters!
Stay up to date with the latest news, articles, and events. Plus, get special offers from Labcompare – all delivered right to your inbox! Sign up now!

More News