RHEL 10 – Enable Health Monitoring for NVIDIA GPUs Using DCGM Exporter

Nvidia Datacenter GPU Manager (DCGM) facilitates GPU health monitoring, performance telemetry, and diagnostics for NVIDIA GPUs on servers. This post outlines installation and setup for RHEL 10, including driver validation and metric visualization.

Dell iDRAC Service Module on RHEL 10.1

The Dell iDRAC Service Module (ISM) is a tool that can be used for better integration between the Dell iDRAC and a running OS. It can provide additional monitoring and metrics to Idrac by brigding the gap between the OS and the underlying Dell hardware. iSM collects data from both the operating system and hardware … Continue reading Dell iDRAC Service Module on RHEL 10.1

Homelab: Shushing a loud Dell Server.

Rack mount servers are notoriously loud. They are designed to run in data centers which and not next to your head. In a data center no one really notices or cares if a machine's fans are spinning faster than they need to. 2RU Servers are bad, but 1U are even worse as the smaller the … Continue reading Homelab: Shushing a loud Dell Server.

Essential Commands to Monitor Nvidia GPUs in Linux

Identify Your GPU Via the Linux CLI Identify that your card is recognized by the OS via the CLI command below, hwinfo # hwinfo --gfxcard --short graphics card: nVidia TU104GL [Tesla T4] nVidia TU104GL [Tesla T4] Matrox G200eR2 Primary display adapter: #58 Or you can see similar output with lshw # lshw -C display *-display … Continue reading Essential Commands to Monitor Nvidia GPUs in Linux

Finding and Mapping Jetson OS and JetPack Versions on the Nvidia Jetson

Updated - 2/24/2026 Below are all the methods that I have found to either find your Jetson OS version, or your Jetpack version (which includes Jetson OS version, Ubuntu version, CUDA Version, NVIDIA drivers, and firmware). First let's review the matrix and see how JetsonOS Maps to JetPack version (along with Ubuntu version, CUDA version, … Continue reading Finding and Mapping Jetson OS and JetPack Versions on the Nvidia Jetson

Installing the GPU Power Supply Expansion Board into the Dell T620

Introduction I recently picked up a couple of used Dell T602s for my homelab for AI/ML project work. Dell Tower form factor servers are very attractive to homelabbers due to their availability, their low costs, the fact that they are rather low noise, and due to the fact that they are easily expandable. For example, … Continue reading Installing the GPU Power Supply Expansion Board into the Dell T620