Skip to content

 

  • Facebook
  • X
  • Instagram
  • YouTube
  • LinkedIn
Architect's Brief

Architect's Brief

Tag: Large Language Models

Insights into LLM capabilities, limitations, and their role in production AI systems.

  • AI Cost Optimization

    Token costs: what actually moves the needle in production

    The real problem If your LLM bill surprised you last month, it probably was not the flashy features. It was the quiet stuff you never show the user: bloated system…

    by

    sudaangi
    March 19, 2025

Category Name

  • Generative AI in Production

    Why Most RAG Architectures Break Under Real User Load

    by

    sudaangi
    December 18, 2025
  • AI Architecture & System Design

    Why Your RAG System Retrieves the Wrong Data (and How to Fix It)

    by

    sudaangi
    December 3, 2025
  • AI Architecture & System Design

    The real cost breakdown of running LLM apps on AWS

    by

    sudaangi
    November 21, 2025

Recent Posts

  • Generative AI in Production

    Why Most RAG Architectures Break Under Real User Load

  • AI Architecture & System Design

    Why Your RAG System Retrieves the Wrong Data (and How to Fix It)

  • AI Architecture & System Design

    The real cost breakdown of running LLM apps on AWS

  • MLOps & LLMOps

    AI Observability: Stop Guessing, Start Instrumenting

Categories

  • AI Architecture & System Design 13
  • AI Cost Optimization 5
  • AI Pitfalls & Lessons Learned 4
  • AI Strategy & Leadership 3
  • Generative AI in Production 8
  • MLOps & LLMOps 7

Sudarshan

Angirash

Follow Us

  • Facebook
  • X
  • Instagram
  • YouTube
  • LinkedIn

About Me

I help engineering teams design, build, and scale production-grade GenAI and multi-agent AI systems. From architecture decisions to working code — I have done it at AWS scale and delivered 4 products solo.

Email Us: sudaangi@techtonicis.com

Editor Picks

  • Why Most RAG Architectures Break Under Real User Load

  • Why Your RAG System Retrieves the Wrong Data (and How to Fix It)

  • The real cost breakdown of running LLM apps on AWS

Popular Posts

  • Why Most RAG Architectures Break Under Real User Load

  • Why Your RAG System Retrieves the Wrong Data (and How to Fix It)

  • The real cost breakdown of running LLM apps on AWS

Navigation

  • Home
  • Career
  • Projects
  • Consulting
  • Publications
  • Products
  • Contact