Learning Track

AI Engineering

RAG, multi-agent systems, prompt engineering, token economics, evaluations and guardrails — the production discipline of building with LLMs.

7+

Topics

7

Published

100%

Free

Browse articles ↓

7 articlesin AI Engineering

How to Build a Production-Ready AI System (Azure OpenAI + AI Search — Real Architecture)

How to Build a Production-Ready AI System (Azure OpenAI + AI Search — Real Architecture)

Azure OpenAI + AI Search + embeddings — real-world architecture for production AI systems, including legacy data, orchestration, hybrid retrieval, cost control, and failure modes.

18 Feb 202615 min

Token Economics: Understanding and Optimizing LLM Costs

Token Economics: Understanding and Optimizing LLM Costs

A practical guide to understanding token pricing, measuring real costs, and implementing optimization strategies — caching, prompt compression, model routing.

12 Feb 202610 min

Building Reliable AI Agents with Semantic Kernel

Building Reliable AI Agents with Semantic Kernel

Plugin architecture, memory, planners, and error handling patterns for building production AI agents in .NET with Semantic Kernel.

31 Jan 202611 min

Building a Personal AI Knowledge Base

Building a Personal AI Knowledge Base

How to build a personal RAG system over your notes, bookmarks, and documents — using embeddings, vector search, and a conversational interface.

25 Jan 202611 min

When to Fine-Tune vs Few-Shot vs RAG

When to Fine-Tune vs Few-Shot vs RAG

A decision framework for choosing between fine-tuning, few-shot prompting, and RAG for production LLM applications.

22 Jan 202610 min

The AI Gateway Pattern: Why Every Production LLM Needs One

The AI Gateway Pattern: Why Every Production LLM Needs One

API Management, rate limiting, semantic caching, and cost control with Azure APIM as an AI Gateway.

21 Jan 202610 min

Designing RAG Systems That Actually Scale

Designing RAG Systems That Actually Scale

Chunking strategies, embedding pipelines, retrieval patterns, and when RAG breaks down in production systems.

14 Jan 202612 min