Home

ACCESS Operations API

v3.45.0

CiDeR Resource Detail

All Resources

As of: 2024-11-21 08:32 UTC

Descriptive Name SDSC Voyager Habana Training and Inference Processor based AI System
Info ResourceID voyager.sdsc.access-ci.org
CiDeR Type Compute
Latest Status production from 2022-05-01 till None
Current Statuses production
CiDeR ID 2088
SiteID sdsc.access-ci.org
Description Voyager is a heterogeneous system designed to support complex deep learning AI workflows. The system features 42 Intel Habana Gaudi training nodes, each with 8 training processors (336 in total). Each training node has 512GB of memory and 6.4TB of node local NVMe storage. The Gaudi training processors feature specialized hardware units for AI, HBM2, and on-chip high-speed Ethernet. The on-chip ethernet ports are used in a non-blocking all-to-all network between processors on a node and the remaining ports are aggregated into 6 400G connections on each node that are plugged into a 400G Arista switch to provide scale out of network. Voyager also has two first-generation inference nodes, each with 8 inference processors (16 in total). In addition to the custom AI hardware, the system also has 36 Intel x86 processors compute nodes for general purpose computing and data processing. Voyager features 3PB of storage currently deployed as a Ceph filesystem.
Recommended Use
Access Description
Affiliation ACCESS
Updated At 2024-11-08T16:37:29.761000Z