Tag
evaluation
2 articles tagged with “evaluation”
AIAI-SearchAPI-gatewayASP.NET-CoreAzure-APIMAzure-OpenAICIClaudeCopilotCosmos-DBCursorEvent-GridGPTGemmaGitHubJSONLLMMCPMistralModel-Context-ProtocolPhiPineconeQdrantRAGSDKSLMSemantic-KernelService-BusWindsurfagentsarchitectureasyncautomationbenchmarksbest-practicescachingchatbotcode-reviewcoding-agentscomparisoncost-controlcost-optimizationdecision-frameworkdotnetembeddingsengineeringevaluationevent-drivenfew-shotfine-tuningfunction-callingintegrationknowledge-basememorymetricsmocksmodel-routingmulti-agentorchestrationpersonal-AIplanningpluginsproductionprompt-designprompt-engineeringqualityquantizationqueuesregression-testingschemastructured-outputtestingtoken-economicstool-callingtool-usevector-dbvector-searchversion-control
models
LLM Evaluation Beyond Vibes
Systematic approaches to evaluating LLM outputs — automated metrics, human evaluation frameworks, regression testing, and building evaluation pipelines.
9 min
Read →engineering
Testing LLM-Powered Features Without Going Broke
Mock strategies, evaluation harnesses, snapshot testing, and cost-aware CI for LLM-integrated applications.
9 min
Read →