Nvidia Datacenter GPU Manager (DCGM) facilitates GPU health monitoring, performance telemetry, and diagnostics for NVIDIA GPUs on servers. This post outlines installation and setup for RHEL 10, including driver validation and metric visualization.
Fix GPG Check Failed Error on RHEL 10.1
Overview On some RHEL 10.1 installs users are running into this error, post-install, when attempting to install packages via dnf. Unsure if the issue is isolated to users attempting to install RHEL via the full DVD ISO, from the minimal boot ISO, and users deploying RHEL 10.1 via kickstart. GPG Keys are configured as: file:///etc/pki/rpm-gpg/RPM-GPG-KEY-redhat-release … Continue reading Fix GPG Check Failed Error on RHEL 10.1
Configuring LACP on TP-Link SX3008F for RHEL 9/10
This guide details setting up three LACP port-channels on a TP-Link SX3008F switch for RHEL 9/10 hosts, enabling 20Gbe connectivity for efficient NFS backups.
Moving Beyond OMSA: A Guide to Dell iSM Installation on RHEL 10 and PowerEdge R730
The shift from srvadmin (OMSA) to iSM (iDRAC Service Module) marks the end of bloated, "in-band" server management. This occurred between Dell 12th gen and Dell 13th gen server. If you have a 12th Gen Dell Server, you can still leverage Dell srvadmin (Idrac 7). I wrote a post on it here. While OMSA ran … Continue reading Moving Beyond OMSA: A Guide to Dell iSM Installation on RHEL 10 and PowerEdge R730
Dell OpenManage Server Administrator: Comprehensive Guide for Hardware Monitoring (RHEL)(Dell 12 Gen)
Dell OpenManage Server Administrator (OMSA) is Dell’s on-host hardware management and monitoring framework for PowerEdge servers. It runs inside the operating system and provides direct visibility into system hardware such as RAID controllers, physical and virtual disks, power supplies, fans, temperatures, memory, processors, and chassis health. OMSA communicates with the server’s iDRAC and hardware controllers … Continue reading Dell OpenManage Server Administrator: Comprehensive Guide for Hardware Monitoring (RHEL)(Dell 12 Gen)
Install APC Netbotz Advanced View on Ubuntu 24.04
I recently picked up a Netbotz 450 for almost nothing on Ebay. I have always wanted one of these things since I first deployed one in a SunGard facility in Philly about 12-15 years ago. These devices allow you to connect environmental sensors to monitor temperatures, rack intrusion, humidity, noise. You can also connect a … Continue reading Install APC Netbotz Advanced View on Ubuntu 24.04
Fixing Weak SSH Ciphers On a Cisco ASAv
By default the Cisco ASAv has weak ssh config that many modern OS's do not support. ssh admin@10.1.10.56 Unable to negotiate with 10.1.10.56 port 22: no matching host key type found. Their offer: ssh-rsa This error is from the client side ssh client refusing to use any of the host key algorithms that are offered … Continue reading Fixing Weak SSH Ciphers On a Cisco ASAv