Tag
benchmarks
2 articles tagged with “benchmarks”
AIAI-SearchAPI-gatewayASP.NET-CoreAzure-APIMAzure-OpenAICIClaudeCopilotCosmos-DBCursorEvent-GridGPTGemmaGitHubJSONLLMMCPMistralModel-Context-ProtocolPhiPineconeQdrantRAGSDKSLMSemantic-KernelService-BusWindsurfagentsarchitectureasyncautomationbenchmarksbest-practicescachingchatbotcode-reviewcoding-agentscomparisoncost-controlcost-optimizationdecision-frameworkdotnetembeddingsengineeringevaluationevent-drivenfew-shotfine-tuningfunction-callingintegrationknowledge-basememorymetricsmocksmodel-routingmulti-agentorchestrationpersonal-AIplanningpluginsproductionprompt-designprompt-engineeringqualityquantizationqueuesregression-testingschemastructured-outputtestingtoken-economicstool-callingtool-usevector-dbvector-searchversion-control
experiments
Testing Autonomous Coding Agents: GitHub Copilot, Cursor, and Windsurf in Real Projects
A hands-on experiment comparing autonomous coding agents on real engineering tasks — multi-file refactoring, bug fixing, and feature implementation.
14 min
Read →models
Claude vs GPT for Engineering Workflows
A practical comparison of Claude and GPT models for real engineering tasks — code generation, debugging, architecture reviews, and documentation.
12 min
Read →