NVIDIA-Accelerated Data Science

The only hardware-to-software stack optimized for data science.

解決方案
特色
透過 GPU 加速進行中的業務
RAPIDS
合作夥伴

解決方案
特色
透過 GPU 加速進行中的業務
RAPIDS
合作夥伴

GPU 加速你的資料科學工作流程

資料科學工作流程一直以來都十分緩慢且缺乏效率，仰賴 CPU 來載入、篩選和操控資料，以及訓練和部署模型。GPU 大幅降低基礎架構成本，並為使用 NVIDIA RAPIDS ™ 函式庫的端對端資料科學工作流程提供卓越效能。隨處都能使用 GPU 加速的資料科學，無論是桌上型電腦、資料中心、邊緣端及雲端都沒問題。

功能與優點

提供最佳生產力

減少等待最寶貴分析洞見的時間，並加速投資報酬率。

成效更高

加快機器學習速度最高達 215 倍，並執行更多迭代、增加實驗，以及進行更深度的探索。

經濟實惠

降低資料科學基礎架構成本，同時提升資料中心效率。

RAPIDS 為 Apache Spark 3.0 的 GPU 加速

Apache Spark 3.0 是 Spark 系列的第一個版本，可為分析和人工智慧工作負載提供完全整合且順暢的 GPU 加速效能。無論在本機或雲端，都可以利用 Spark 3.0 (含 GPU ) 的強大功能，且無需變更程式碼。GPU 的突破性效能夠讓企業和研究人員更頻繁訓練更大的模型，最終借助人工智慧功能發揮巨量資料的價值。

深入瞭解

150X

Faster Pandas with cuDF

* Benchmark on Groupy advanced operation (5GB) DuckDB Data Benchmark
HW: Intel Xeon Platinum 8480CL CPU and NVIDIA Grace Hopper™ GPU
SW: pandas v1.5 and cudf.pandas v23.10

5X

Faster Spark with the RAPIDS Accelerator for Spark

* NDS 2.0 benchmarks were run with parquet decimal data @ SF3K with UCX off

CPU-only: 8x n1-standard-32

GPU: 8x g2-standard-16, 8x L4 24GB
SW: Spark RAPIDS 24.02

48X

Faster NetworkX with cuGraph

* Benchmark on PageRank with synthetic dataset having ~16,384 vertices and ~524,288 edges
HW: Intel Xeon Platinum 8480CL CPU and NVIDIA H100 80GB (1x GPU)
SW: NetworkX v3.2 and cuGraph v23.10

使用 NVIDIA GPU 訓練 XGBOOST

GPU 加速 XGBoost 將為全球頂尖機器學習演算法的單一節點與分散式部署帶來顛覆的效能﹐資料科學團隊藉由比 CPU 大幅提升的訓練速度，能夠處理更龐大的資料集、迭代更快速並調整模型，以將預測精準度與商業價值極大化。

Data Prep

XGBoost

End-to-end

CPU: Core i9 | End-to-end time = Data Prep + Conversion + Training + Validation

現在開始瞭解如何以 GPU 加速 XGBoost

立即開始

適用於資料科學的 NVIDIA GPU 解決方案

透過多種不同的 NVIDIA GPU 解決方案，探索無與倫比的加速能力。

PC

著手使用機器學習。

深入瞭解

工作站

全新的資料科學工作站。

深入瞭解

資料中心

適用企業生產的人工智慧系統。

深入瞭解

雲端

多功能加速機器學習。

深入瞭解

透過 GPU 加速進行中的業務

讓機器學習工作流程展現最佳效能、生產力和投資報酬率。

RAPIDS: 資料科學函式庫套件

利用 NVIDIA CUDA-X AI 打造的 RAPIDS 運用 NVIDIA® CUDA® 逾 15 年的開發與機器學習專業知識。這款功能強大的應用軟體能在 GPU 中完整執行端對端資料科學訓練流程，將原本需要好幾天的訓練時間縮短為幾分鐘。

下載原始碼

下載 NGC 容器

RAPIDS, a GPU-accelerated data science platform, is a next-generation computational ecosystem powered by Apache Arrow. The NVIDIA collaboration with Ursa Labs will accelerate the pace of innovation in the core Arrow libraries and help bring about major performance boosts in analytics and feature engineering workloads.

- Wes McKinney, Head of Ursa Labs and Creator of Apache Arrow and Pandas

I got 24x speedup using RAPIDS XGBOOST and can now replace hundreds of CPU nodes, running my biggest ML workload on a single node with 8 GPUs. You made XGBOOST too fast!?

- Streaming Media Company

My previous bottleneck was I/O. …10 minutes to pull in data for 10 stores (about 1 million rows). With RAPIDS, we can pull in data for about 6000 stores (millions of rows) in less than 3 minutes. That scale could have easily taken us 4 days on legacy infrastructure … just plain awesome.

- A mid-market specialty retailer with 6000 stores