GPU/Neocloud Billing using Rafay’s Usage Metering APIs
Cloud providers offering GPU or Neo Cloud services need accurate and automated mechanisms to track resource consumption. Usage data becomes the foundation for billing,... Read more.
Deep Dive into nvidia-smi: Monitoring Your NVIDIA GPU with Real Examples
Whether you’re training deep learning models, running simulations, or just curious about your GPU’s performance, nvidia-smi is your go-to command-line... Read more.
Introduction to Dynamic Resource Allocation (DRA) in Kubernetes
In the previous blog, we reviewed the limitations of Kubernetes GPU scheduling. These often result in: Resource fragmentation – large portions of GPU memory... Read more.
Rethinking GPU Allocation in Kubernetes
Kubernetes has cemented its position as the de-facto standard for orchestrating containerized workloads in the enterprise. In recent years, its role has expanded... Read more.
Understanding ArgoCD Reconciliation: How It Works, Why It Matters, and Best Practices
ArgoCD is a powerful GitOps controller for Kubernetes, enabling declarative configuration and automated synchronization of workloads. One of its core functions... Read more.
Choosing the Right Fractional GPU Strategy for Cloud Providers
As demand for GPU-accelerated workloads soars across industries, cloud providers are under increasing pressure to offer flexible, cost-efficient, and isolated access... Read more.
Demystifying Fractional GPUs in Kubernetes: MIG, Time Slicing, and Custom Schedulers
As GPU acceleration becomes central to modern AI/ML workloads, Kubernetes has emerged as the orchestration platform of choice. However, allocating full GPUs for... Read more.
Custom GPU Resource Classes in Kubernetes
In the modern era of containerized machine learning and AI infrastructure, GPUs are a critical and expensive asset. Kubernetes makes scheduling and isolation easier—but... Read more.
The Rise of AI Agents: From Zero to Production
Artificial Intelligence (AI) has moved far beyond simple chat bots and rigid automation. At the frontier of this evolution lies a powerful new paradigm : AI Agents.... Read more.
Configure and Manage GPU Resource Quotas in Multi-Tenant Clouds
In multi-tenant GPU cloud environments, effective resource management is critical to ensure fair usage and prevent contention. GPU resource quotas allow organizations... Read more.