The shift from srvadmin (OMSA) to iSM (iDRAC Service Module) marks the end of bloated, "in-band" server management. This occurred between Dell 12th gen and Dell 13th gen server. If you have a 12th Gen Dell Server, you can still leverage Dell srvadmin (Idrac 7). I wrote a post on it here. While OMSA ran … Continue reading Moving Beyond OMSA: A Guide to Dell iSM Installation on RHEL 10 and PowerEdge R730
hardware
Dell OpenManage Server Administrator: Comprehensive Guide for Hardware Monitoring (RHEL)(Dell 12 Gen)
Dell OpenManage Server Administrator (OMSA) is Dell’s on-host hardware management and monitoring framework for PowerEdge servers. It runs inside the operating system and provides direct visibility into system hardware such as RAID controllers, physical and virtual disks, power supplies, fans, temperatures, memory, processors, and chassis health. OMSA communicates with the server’s iDRAC and hardware controllers … Continue reading Dell OpenManage Server Administrator: Comprehensive Guide for Hardware Monitoring (RHEL)(Dell 12 Gen)
Step-by-Step Nvidia CUDA Toolkit Installation Guide: Ubuntu 24.04
The NVIDIA CUDA Toolkit is essential for developing GPU-accelerated applications and includes various tools. Users must uninstall older versions from Ubuntu before installing the latest toolkit from NVIDIA’s website for better performance and compatibility. The summary highlights differences between installation methods regarding version availability, ease, and update management.
Step-by-Step Nvidia CUDA Toolkit Installation Guide: Ubuntu 22.04
"The NVIDIA CUDA Toolkit is a software development kit that helps users create GPU-accelerated applications. It includes libraries, compilers, debuggers, and optimization tools". Since we have an Nvidia GPU we will install it as shown below. Note: If you installed the nvidia-cuda-toolkit from default ubuntu noble repo, uninstall it first as that version is probably too old. … Continue reading Step-by-Step Nvidia CUDA Toolkit Installation Guide: Ubuntu 22.04
Essential Commands to Monitor Nvidia GPUs in Linux
Identify Your GPU Via the Linux CLI Identify that your card is recognized by the OS via the CLI command below, hwinfo # hwinfo --gfxcard --short graphics card: nVidia TU104GL [Tesla T4] nVidia TU104GL [Tesla T4] Matrox G200eR2 Primary display adapter: #58 Or you can see similar output with lshw # lshw -C display *-display … Continue reading Essential Commands to Monitor Nvidia GPUs in Linux
Selecting a GPU for a Dell T620
This post discusses the top GPU options for the Dell T620 server, focusing on compatibility, power, and thermal considerations. It highlights GPUs like the NVIDIA Tesla T4 and Quadro series while offering tips on optimizing performance through cooling upgrades and PCIe configuration.
Running RHEL 8 on Dell R710/610 via Raid Controller Retrofit
A while back, I wrote a blog post that outlined a process of injecting deprecated storage controller drivers into RHEL 8 via a Driver Update Disk. In a nutshell, this process allows you to install RHEL 8 on the R710/R610 (11th Gen)This process worked fine, unless you wanted to yum update your server. Rather than … Continue reading Running RHEL 8 on Dell R710/610 via Raid Controller Retrofit