Simple RAG with Ollama, OpenWebUI, and VectorDB on Ubuntu 22.04

Prerequisites I have already installed Nvidia proprietary drivers and the Nvidia Cuda Toolkit. I documented the install of the Cuda toolkit in an older post which can be found here. Since I have Nvidia GPUs in my host system, and I intend to run some services in containers, I want to make sure that I … Continue reading Simple RAG with Ollama, OpenWebUI, and VectorDB on Ubuntu 22.04

Getting Started with RamaLama with Nvidia Cuda Support On Ubuntu 24.04

Introduction to RamaLama Streamlining AI Deployment with OCI Containers RamaLama is an open-source project developed to simplify AI model deployment and management using OCI (Open Container Initiative) containers. Ramalama enables seamless execution of AI workloads across different hardware configurations, supporting both GPU-accelerated and CPU-based environments. By leveraging container engines like Podman and Docker, RamaLama includes … Continue reading Getting Started with RamaLama with Nvidia Cuda Support On Ubuntu 24.04

Essential Commands to Monitor Nvidia GPUs in Linux

Identify Your GPU Via the Linux CLI Identify that your card is recognized by the OS via the CLI command below, hwinfo # hwinfo --gfxcard --short graphics card: nVidia TU104GL [Tesla T4] nVidia TU104GL [Tesla T4] Matrox G200eR2 Primary display adapter: #58 Or you can see similar output with lshw # lshw -C display *-display … Continue reading Essential Commands to Monitor Nvidia GPUs in Linux

Installing the GPU Power Supply Expansion Board into the Dell T620

Introduction I recently picked up a couple of used Dell T602s for my homelab for AI/ML project work. Dell Tower form factor servers are very attractive to homelabbers due to their availability, their low costs, the fact that they are rather low noise, and due to the fact that they are easily expandable. For example, … Continue reading Installing the GPU Power Supply Expansion Board into the Dell T620