IBM at NVIDIA GTC 2025

Browse the content from this year’s event experience
Colorful lenses in overlays
Put data to work for AI with IBM and NVIDIA

Thank you to those who joined us at NVIDIA GTC 2025! Our discussions  highlighted innovative strategies that elevate productivity and efficiency across industries. Together, IBM and NVIDIA are offering the solutions, services and technology to unlock, accelerate and protect data.

We discussed the following topics:

  • AI-ready data: IBM delivers data platforms and services integrated with NVIDIA technology to deliver enterprise-ready AI, no matter where your data resides.
  • Agentic AI: IBM watsonx.ai, Granite models and partnership with NVIDIA help organizations build governance frameworks and performant, reliable and trustworthy agents that integrate into specific enterprise systems to expand operations with agentic AI.
  • The future of AI is open: IBM’s hybrid and open approach to architecture will continue to reduce inference costs and fuel future research projects and new AI applications grounded in business data, further transforming industries.  
Event highlights
AI promise to profits: Maximize business ROI with accelerated computing

AI looks like a goldmine waiting to be tapped, but many organizations discover that success hinges on their ability to wrangle their data. In this session, learn how to unlock, accelerate and protect your data with the NVIDIA accelerated computing platform to minimize costs, address ethical dilemmas, and overcome technical hurdles that make sustainable monetization of AI tricky. We discuss how to solve for common AI delivery model challenges and accelerate time-to-value with customer-facing AI workloads that leverage NVIDIA technologies.

Watch the replay
From project to profits: Unleash AI’s power with a hybrid approach

To unlock AI’s full potential and speed the pace of innovation for specific business needs, organizations must evolve their IT landscape. As AI becomes woven into the fabric of business applications—from core enterprise to consumer-facing services to the edge—there's an increasing focus on reducing cost and maintaining flexibility of deployment. Through demos and conversations with experts, explore how adopting a hybrid platform approach, paired with the power of NVIDIA technologies, removes barriers to AI by delivering better reliability and performance, operational agility and managed costs, no matter your operating environment.

Watch the replay
Faster Triton kernels on NVIDIA Ampere and Hopper

We analyzed the performance of Triton kernels on NVIDIA A100 and H100 GPUs using Nsight Systems and Nsight Compute, which helped gain insights to performance bottlenecks. Modifications of the Triton kernels, using techniques such as SplitK parallelization and others, result in competitive performance results in the same ballpark as cuBLAS.

Enable intelligent storage to process data for AI applications

The common implementation of AI pipelines today is to bring data to AI. This works well when your dataset is relatively small and co-located. When we look at the next step of AI journey, we know one thing for sure: there will be a lot more data in a lot more locations. The effective way to address this challenge is to push AI processing closer to where the data is. This concept is “AI Content-Aware Storage (AI CASt).” The vision of content-aware storage is to enable intelligent storage to process data for AI applications. In this session, we demonstrate the architecture of AI CAST by leveraging NVIDIA Blueprints and NIMs to accelerate the retrieval-augmented generation (RAG) pipeline by incorporating storage and storage metadata in the Continuous Data Ingest and vector DB management.

Watch the replay
How to automate AI transparency: Enhancing model card transparency with minimal effort

Producing model card documentation is a state-of-the-art best practice for advancing AI transparently, and educating enterprise developers and nontechnical influencers like customers, investors, users and policymakers about AI in a clear and uniform manner. Learn how to streamline the creation and management of model cards through the model card generator. Understand how the model card automation engine captures essential metadata during training and from model source code, ensuring comprehensive, consistent and up-to-date documentation for AI models.

Watch the replay
Explore our products

Learn more about the IBM products featured at NVIDIA GTC 2025.

IBM Storage Scale

Leverage a scale-out file and object, software-defined storage platform designed for AI, machine learning and high-performance computing workloads.

Learn more
IBM Fusion

Experience the easiest way to deploy OpenShift applications and harness IBM watsonx™ AI capabilities, while seamlessly integrating virtualization and containerization.

Learn more
IBM Granite

Our third generation of AI language models are here. Fit for purpose and open sourced, these enterprise-ready models deliver exceptional performance against safety benchmarks and across a wide range of enterprise tasks from cybersecurity to RAG.

Learn more
IBM Cloud

Scale efficiently with on-demand access to NVIDIA GPUs that are purpose-built for AI and accelerated data processing, HPC, Visualization use cases across VPC and OpenShift instances.

Learn more
IBM watsonx.ai

IBM® watsonx.ai™ is an enterprise-grade studio for developing AI services and deploying them into your applications of choice―with a collection of the APIs, tools, models and runtimes you need to turn your ideas and requirements into reality.

Learn more
IT technician working in a server room
Accelerating workloads with IBM Storage Scale and Storage Scale System Read the summary
Subscribe to the Think newsletter

Stay informed about cutting-edge breakthroughs in AI, new solutions, industry trends and upcoming events.

Subscribe today