Project “NVIDIA HPC Infiniband Homelab GPU Cluster”: Part 3: RDMA Performance Testing

The project review outlines the successful construction and configuration of a 3-node GPU cluster with InfiniBand networking. It details performance testing procedures and confirms healthy RDMA performance with minimal tuning.

Project “NVIDIA HPC Infiniband Homelab GPU Cluster”: Part 2: Infiniband Setup

The document details the installation and configuration of three Mellanox ConnectX-4 Adapters across multiple servers. It covers verifying detection, driver loading, InfiniBand setup, subnet management, and IP over InfiniBand configuration for effective connectivity and testing in a lab environment.

Project “NVIDIA HPC Infiniband Homelab GPU Cluster”: Part 1: Project Overview

Introduction InfiniBand is a mature interconnect technology known for high bandwidth and low latency. It has long been used in supercomputing and HPC environments, and has also been deployed in certain storage and clustered infrastructure designs as an alternative to Fibre Channel. More recently, InfiniBand has seen strong continued adoption in large-scale AI and GPU … Continue reading Project “NVIDIA HPC Infiniband Homelab GPU Cluster”: Part 1: Project Overview