Learning Track

Models

LLM evaluation, fine-tuning strategies, model selection, benchmarking, and cost-performance analysis.

51+

Topics

Published

100%

Free

Browse articles ↓

Published — click to readComing soon← swipe to explore →

GPT & Claude

12 topics2 live

01Claude vs GPT for Engineering+3

02Cost vs Capability Trade-offs+3

Coming Soon

03GPT-4o & GPT-4.1 Deep Dive+3

04Claude Model Family+3

05Gemini in Production+3

06Model Selection Framework+3

07o1 & o3 Reasoning Models+3

08Grok & xAI Models+3

09Command R+ (Cohere)+3

10DeepSeek Analysis+3

11Provider Migration Guide+3

12Mistral Large & Le Chat+3

12 topics

Open Source Models

14 topics

01Llama & Mistral in Prod+3

02Running Models Locally+3

03Ollama & LM Studio+3

04Open vs Closed Source+3

05Quantized Models (INT4/INT8)+3

06Self-Hosted Inference+3

07vLLM & TGI Serving+3

08Phi & Gemma SLMs+3

09Qwen & Falcon Models+3

10Model Fine-Tuning (LoRA)+3

11GGUF & GGML Formats+3

12Hardware Requirements+3

13Model Merging Techniques+3

14Multimodal Open Models+3

14 topics

Embeddings & Prompting

13 topics1 live

01Prompting vs Architecture+3

Coming Soon

02Embedding Model Selection+3

03Few-Shot & Zero-Shot Design+3

04Structured Output Patterns+3

05Multimodal Embeddings+3

06Semantic Search Design+3

07Chain-of-Thought Prompting+3

08ReAct Prompting+3

09System Prompt Engineering+3

10Prompt Injection Defense+3

11Meta-Prompting Techniques+3

12Temperature & Sampling+3

13Prompt Compression+3

13 topics

Evaluation & Trade-offs

12 topics1 live

01Benchmarking Frameworks+3

Coming Soon

02Latency vs Quality+3

03Production Eval Pipelines+3

04MMLU & HumanEval Analysis+3

05Task-Specific Benchmarks+3

06Model Trade-offs in Prod+3

07Provider Reliability & Portability+3

08LLM-as-a-Judge+3

09Hallucination Detection+3

10Consistency Testing+3

11Eval Dataset Curation+3

12RAGAS Evaluation+3

12 topics

5 articlesin Models

models

Token Economics: Understanding and Optimizing LLM Costs

A practical guide to understanding token pricing, measuring real costs, and implementing optimization strategies — caching, prompt compression, model routing.

12 Feb 202610 min

Read →

models

LLM Evaluation Beyond Vibes

Systematic approaches to evaluating LLM outputs — automated metrics, human evaluation frameworks, regression testing, and building evaluation pipelines.

05 Feb 20269 min

Read →

models

Small Language Models in Production

When and how to use small language models like Phi, Gemma, and Mistral in production — quantization, deployment patterns, and latency-cost trade-offs.

29 Jan 202611 min

Read →

models