Mohan Atreya Archives

Uncategorized

GPU/Neocloud Billing using Rafay’s Usage Metering APIs

by Mohan Atreya September 13, 2025 0

Cloud providers offering GPU or Neo Cloud services need accurate and automated mechanisms to track resource consumption. Usage data becomes the foundation for billing,... Read more.

Uncategorized

Deep Dive into nvidia-smi: Monitoring Your NVIDIA GPU with Real Examples

by Mohan Atreya August 24, 2025 0

Whether you’re training deep learning models, running simulations, or just curious about your GPU’s performance, nvidia-smi is your go-to command-line... Read more.

Uncategorized

Introduction to Dynamic Resource Allocation (DRA) in Kubernetes

by Mohan Atreya August 23, 2025 0

In the previous blog, we reviewed the limitations of Kubernetes GPU scheduling. These often result in: Resource fragmentation – large portions of GPU memory... Read more.

Uncategorized

Rethinking GPU Allocation in Kubernetes

by Mohan Atreya August 20, 2025 0

Kubernetes has cemented its position as the de-facto standard for orchestrating containerized workloads in the enterprise. In recent years, its role has expanded... Read more.

Uncategorized

Understanding ArgoCD Reconciliation: How It Works, Why It Matters, and Best Practices

by Mohan Atreya August 4, 2025 0

ArgoCD is a powerful GitOps controller for Kubernetes, enabling declarative configuration and automated synchronization of workloads. One of its core functions... Read more.

Uncategorized

Choosing the Right Fractional GPU Strategy for Cloud Providers

by Mohan Atreya July 14, 2025 0

As demand for GPU-accelerated workloads soars across industries, cloud providers are under increasing pressure to offer flexible, cost-efficient, and isolated access... Read more.

Uncategorized

Demystifying Fractional GPUs in Kubernetes: MIG, Time Slicing, and Custom Schedulers

by Mohan Atreya July 11, 2025 0

As GPU acceleration becomes central to modern AI/ML workloads, Kubernetes has emerged as the orchestration platform of choice. However, allocating full GPUs for... Read more.

Uncategorized

Custom GPU Resource Classes in Kubernetes

by Mohan Atreya July 10, 2025 0

In the modern era of containerized machine learning and AI infrastructure, GPUs are a critical and expensive asset. Kubernetes makes scheduling and isolation easier—but... Read more.

Uncategorized

The Rise of AI Agents: From Zero to Production

by Mohan Atreya July 4, 2025 0

Artificial Intelligence (AI) has moved far beyond simple chat bots and rigid automation. At the frontier of this evolution lies a powerful new paradigm : AI Agents.... Read more.

Uncategorized

Configure and Manage GPU Resource Quotas in Multi-Tenant Clouds

by Mohan Atreya June 30, 2025 0

In multi-tenant GPU cloud environments, effective resource management is critical to ensure fair usage and prevent contention. GPU resource quotas allow organizations... Read more.

LIVE WEBINAR | OCT. 21 : From AI PODs to GPU Cloud: How Cisco and Rafay Deliver Production-Ready, Multi-Tenant AI Infrastructure

Author: Mohan Atreya

Author

GPU/Neocloud Billing using Rafay’s Usage Metering APIs

Deep Dive into nvidia-smi: Monitoring Your NVIDIA GPU with Real Examples

Introduction to Dynamic Resource Allocation (DRA) in Kubernetes

Rethinking GPU Allocation in Kubernetes

Understanding ArgoCD Reconciliation: How It Works, Why It Matters, and Best Practices

Custom GPU Resource Classes in Kubernetes

The Rise of AI Agents: From Zero to Production

Configure and Manage GPU Resource Quotas in Multi-Tenant Clouds

Open Source