Prashant Dudami
HomeAboutBlogProjectsServicesPublications
Get in Touch

Blog

Deep dives into AI architecture, LLM optimization, and enterprise implementation strategies. Practical insights from real-world projects.

January 20, 2026·5 min read
Optimizing Data for LLMs: An Introduction to TOON
How Token Optimized Object Notation reduces LLM token consumption by 40-60% for enterprise workloads. A practical guide to data representation optimization.
LLMToken OptimizationEnterprise AITOONCost Optimization
January 15, 2026·6 min read
API vs. Self-Hosted LLMs: A Cost Analysis Framework
When does it make sense to run your own models? A decision framework for enterprise architects with real cost modeling and trade-off analysis.
LLMEnterprise AICost AnalysisSelf-HostedCloud Architecture
January 13, 2026·5 min read
Why Your RAG System Fails: The Data Architecture Problem
Most RAG implementations fail not because of the LLM, but because of inadequate data architecture. Here's how to fix it.
RAGData ArchitectureLLMEnterprise AIData Lake
January 10, 2026·4 min read
Claude Code with Gemini: A Setup Guide
How to configure Claude Code to use Google's Gemini models through Claude Code Router, avoiding the need for an Anthropic API subscription.
Claude CodeGeminiLLMDeveloper ToolsTutorial
LinkedInGitHubTwitter
AboutBlogProjectsServicesPublications

© 2026 Prashant Dudami. All rights reserved.