Support Services for Slurm and Slinky

Support Services for Slurm and Slinky

Learn More: Slurm | Slinky

Overview

Support and Services for Open Source Slurm and Slinky

Open source software provides data center operators, developers, and researchers the most flexibility for workload management, but it can take expert help to utilize it to its full capacity. From implementation to customizations, get direct access to the experts from SchedMD (now part of NVIDIA) who actively develop Slurm and Slinky every day and have deep expertise in managing high-performance computing (HPC) and AI workloads.

Benefits

Experience the Benefits of Support Services for Open Source Software

Direct Access to the Experts

Get expert guidance, personalized instruction, bug fixes, and proactive recommendations for your workload management with direct-to-engineering support services. Standard support includes consulting hours to ensure you get the most from your deployment.

Increased Utilization and Uptime

Keep your AI and HPC applications operating at peak performance with various levels of enterprise-grade support and deep connection with Slurm and Slinky developers, decreasing the time and money lost on disruptions and boosting utilization.

Faster Time to Results

Get faster results with a range of services, including standard support that complements your open source software, expert assistance with new deployments, on-site training, and working with a dedicated technical account manager (TAM).

Access the Support Portal

Looking to submit a support request? Support tickets can be submitted through the support portal. An email address with your organization's domain is required to validate your support entitlement.

For more information on initial support and service purchases, contact your authorized NVIDIA enterprise partner or your NVIDIA sales team.

What Customers Are Saying About Slurm Support

"We have been a SchedMD (now part of NVIDIA) support customer for nine years. They've always given timely, high-quality responses."

– Technical University of Denmark

Offers

Get Direct-to-Engineering Access

Standard Support

Standard support gives users and administrators direct access to experts, with eight hours of consulting services included, increasing utilization and uptime by decreasing the time and money lost on disruptions. Standard support is licensed by both number of nodes and per GPU (if applicable) and requires administrators to be on the latest or previous two versions of Slurm. Coverage includes bug fixes, configuration setup and optimization, and Slurm upgrade assistance.

Standard support for Slinky is considered an add-on to Slurm support and is also licensed by both the number of nodes and per GPU.

Deployment Services

NVIDIA offers a proof-of-concept (PoC) service to launch new Slurm and Slinky deployments with expert assistance from their developers. This service validates configurations, tunes performance, and transfers best practices to the team from day one.

The NVIDIA PoC service for Slurm is considered an add-on to Slurm standard support and requires an active Slurm support entitlement.

On-Site Training

Comprehensive face-to-face training maximizes the potential of HPC environments. This add-on service provides a three-day, on-site training for site-specific use cases with hands-on lab workshops from the experts at SchedMD (now part of NVIDIA) who actively work on Slurm and Slinky.

NVIDIA on-site training for Slurm is considered an add-on to Slurm standard support and requires an active Slurm support entitlement.

Technical Account Manager

This service makes it possible to partner with a designated TAM who knows the Slurm environment and can coordinate support cases end to end, driving timely and effective resolutions. The NVIDIA TAM service for Slurm is considered an add-on to Slurm standard support and requires an active Slurm support entitlement.

NVIDIA Enterprise Support offers TAM services for other products in the NVIDIA portfolio. The TAM for Slurm is a separate service with access to an expert from SchedMD (now part of NVIDIA).

Documentation

Explore Slurm and Slinky Documentation

Slurm Repository

Slurm provides a cluster resource management and job scheduling system for Linux that strives to be simple, scalable, portable, fault-tolerant, and interconnect agnostic. Visit the repository to get started.

Slurm Documentation

Stay up to date on releases, get access to documentation for Slurm users and administrators—including the latest release notes, quick-start guides, and more—and learn how you can contribute to Slurm development.

Slinky Repository

Slinky provides a powerful set of tools for bringing Slurm's capabilities into Kubernetes. It offers users flexibility and ease of use for managing HPC and cloud-native AI workloads. Visit the repository to get started.

Slinky Documentation

Learn more about how the Slinky project allows you to run Slurm on Kubernetes, and get access to the latest updates for slurm-operator and slurm-bridge, quick-start guides, and more.

Next Steps

Get Support for Slurm

SchedMD (now part of NVIDIA) experts are here for you at every step in this fast-paced journey. The support portal provides you access to create and manage all your Slurm and Slinky support cases.

Renewals

Renewing your support services for Slurm and Slinky helps ensure continuous uptime and optimized utilization. To renew your Slurm and Slinky support, please contact NVIDIA.

Purchase Support

For more information on initial support and service purchases, contact your authorized NVIDIA enterprise partner or NVIDIA sales team.