Healthcare and Life Sciences
Bristol Myers Squibb (BMS) faced a fragmented drug development landscape, with the emergence of AI in the biopharma industry impacting its computational needs—factors like GPU scarcity and rising cloud costs were hindering scientists’ ability to focus on critical research. They needed a powerful AI solution to accelerate its drug discovery processes, keep costs down, and empower their scientists with cutting-edge technologies, keeping them at the forefront of innovation and ahead of a complex and changing regulatory environment.
BMS and NVIDIA worked together to establish a centralized, modern infrastructure to accelerate drug discovery using NVIDIA DGX SuperPOD™, managed by Equinix and supported by Mark III’s AI expertise. The impact was significant—the platform achieved 55% overall cost savings compared to BMS’s prior model, ensuring cost-effective data movement and the ability to flexibly scale resources as needed. BMS is using DGX SuperPOD for various use cases, including developing foundational oncology models, predicting patient outcomes using large language models, and drug target discovery.
Bristol Myers Squibb
Equinix
Mark III Systems
Generative AI / LLMs
NVIDIA DGX
NVIDIA AI Enterprise
Bristol-Myers Squibb (BMS), one of the world’s leading pharmaceutical companies, has been pioneering healthcare solutions since the early 1800s. Renowned for breakthrough cancer therapies which empower the immune system to fight tumors, BMS has also made significant strides in cell-based treatments and protein degradation technologies. Their innovations across oncology, hematology, immunology, neuroscience, and cardiovascular disease have resulted in life-changing treatments serving countless patients and cementing their role as a leader in transforming patients’ lives.
However, the path to bringing these groundbreaking drugs to market is a monumental challenge, and the stakes have never been higher. A complex and changing regulatory and competitive environment, and rapidly advancing technology are all converging to raise the bar. To stay ahead, BMS sought to accelerate drug discovery and reduce costs by empowering their scientists with a powerful AI computing platform.
Bristol Myers Squibb
Innovation is often grounded in two more fundamental traits: curiosity and courage. To bring innovation to any process you must be curious as to “why” things are the way they are today, and you need the courage to try something new. This was the insight behind BMS’s push to create a deep and powerful AI capability within the research community.
“Mindful of BMS’s unique research and development commitments, we brought together teams of researchers and technologists to build a transformational, technology-powered capability. This cross-functional collaboration was underpinned by what we internally refer to as taking a ‘First Principles’ approach—a commitment to explore every challenge with a fresh perspective. That mentality drove every aspect of this initiative, from team structure and operations to resource allocation, skill matching, technology procurement and integration, strategic partnerships, and even communications. By building this solution from the ground up, we created a modern, fit-for-purpose research technology capability designed to meet BMS’s rapidly evolving research and development needs,” said Bill Mayo, Senior Vice President, Research IT at BMS.
While BMS has been investing in computational science and AI capabilities for years, the sudden, explosive growth left them with a fragmented landscape that no longer met the needs of today’s scientists. With strict resource constraints, BMS needed a more efficient solution to manage its systems. BMS has also leveraged cloud-based technology for research computing for several years, taking advantage of speed to deploy, flexibility to change, and the ability to scale as needed.
In 2023 however, it was clear that the GPU market had different dynamics. GPU scarcity drove cloud costs up and limited availability. Uncertainty in how they would be consumed and the pace of change made forecasting almost impossible. The net effect was that researchers were spending time trying to predict their needs and worrying about costs instead of focusing on what matters—their research. These hurdles prevented scientists from efficiently conceiving of and running new experiments, creating high opportunity costs and slowing innovation for patients.
BMS knew it needed to change how it provided compute for research, as neither traditional cloud nor traditional on-prem approaches were fully serving scientific needs. “To guarantee GPU availability and maximize computational resources, we decided to establish a centralized and modernized infrastructure for drug discovery. To pull this off, we needed a proven platform that could scale with us as we progressed along this journey. That is why we chose the NVIDIA DGX SuperPOD architecture, which included the capability to expand to cloud as needed, a colocation strategy for hosting, and an acceleration partner to clear all the early hurdles,” said Mayo.
NVIDIA DGX™ partner Equinix expertly managed the DGX SuperPOD infrastructure, colocation data center, and interconnectivity, while Mark III, an NVIDIA solutions partner, delivered crucial AI and operational expertise.
“Mark III’s expertise was the catalyst behind our rapid evolution from isolated, single-node computing to seamless, high-performance multi-node training and ensured we not only optimized our systems but continuously advanced through routine audits and precise workload sizing,” said Brian Wong, Director of Research Computing at BMS. “The Equinix Private AI with DGX solution provided a ready-to-run AI platform and seamless hybrid AI development environment, without the operational headaches commonly associated with on-premises equipment,” added Wong.
As a result, scientists gained computational resources to scale up and out without additional headcount or expenses for system administration. Wong added, “Leveraging DGX SuperPOD with Equinix’s seamless integrations with public cloud providers ensured cost-effective data movement while achieving 55% overall cost savings compared to the prior model. The cluster has demonstrated peak utilization rates of up to 90% during certain periods. Our scientists can now easily adjust resources to meet workload demands, increasing nodes for large language model (LLM) training when needed and reallocating them to deep learning tasks as required.”
“Leveraging DGX SuperPOD with Equinix’s seamless integrations with public cloud providers ensured cost-effective data movement while achieving 55% overall cost savings compared to the prior model.”
Brian Wong
Director of Research Computing at BMS
After becoming operational in March 2024, the DGX SuperPOD provided a centralized AI platform for BMS research teams, offering a wealth of information and services. One of the projects was the development of a large-scale foundational model for oncology utilizing millions of CT scans from clinical trials. Manual analysis of these scans would have been impractical and the consolidated analysis virtually impossible.
To address this, the team used NVIDIA MONAI to process and transform the data, training it on the DGX SuperPOD to automatically segment lesions, tumors, and organs in standard CT scans. They enhanced the model’s robustness by training it on both internal and public data, using self-supervised learning and mask image modeling. This involved masking 80%–90% of thousands of CT scans and having the model reconstruct the missing image data.
The results were transformative. The model not only matched human accuracy in lesion detection, but surpassed it by quantifying complex features like texture and heterogeneity, aspects previously reliant on subjective interpretation, in near real time. The impact on the pace of research was profound, with improvements in both segmentation accuracy and processing speed. In addition, the model can now be fine-tuned for downstream tasks like organ segmentation and brain metastases detection, paving the way for further advances in drug discovery.
“Our scientists can now easily adjust resources to meet workload demands, increasing nodes for large language model (LLM) training when needed and reallocating them to deep learning tasks as required.”
Brian Wong
Director of Research Computing at BMS
In immunotherapy oncology, predicting who will respond to a treatment is a major challenge. BMS researchers tackled this using large language models and transformers to analyze complex clinical trial data. The innovation was to treat diverse data—including genomics, lifestyle, and treatment details—as grammatical sentences, similar to natural language processing. By embedding this data alongside outcomes like survival and adverse events, BMS built a model that predicts patient outcomes with unprecedented accuracy.
Success ultimately hinged on training models with extensive data, combining internal clinical data, public datasets, and scientific literature. Leveraging the DGX SuperPOD’s computational power, BMS models outperformed standard baseline transformers, transforming their data into actionable insights that could revolutionize oncology treatment strategies.
With dedicated compute resources, extensive data, deep technical expertise, and urgent scientific questions, researchers can now easily launch high-potential experiments. “By combining curiosity and courage with technology skills and capacity, we have assisted our research community in making innovative discoveries, all in the service of our patients,” adds Mayo. Looking forward, the successful implementation of the DGX SuperPOD has positioned BMS to leverage AI for continued innovation and advancements in drug discovery across all therapeutic areas and scientific functions.
NVIDIA DGX SuperPOD offers leadership-class accelerated infrastructure and scalable performance for the most challenging AI workloads—with industry-proven results.