<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Yoke Agent — Blog</title>
    <link>https://yoke-agent.digital/blog/</link>
    <description>Essays on RAG and agent evaluation, grid-search methodology, and self-hosted AI quality workflows.</description>
    <language>en-US</language>
    <atom:link href="https://yoke-agent.digital/blog/feed.xml" rel="self" type="application/rss+xml" />
    <lastBuildDate>Tue, 28 Apr 2026 12:00:00 +0000</lastBuildDate>

    <item>
      <title>Grid-search for RAG: an old technique, retrofitted for a new problem</title>
      <link>https://yoke-agent.digital/blog/grid-search-for-rag/</link>
      <guid>https://yoke-agent.digital/blog/grid-search-for-rag/</guid>
      <pubDate>Tue, 28 Apr 2026 12:00:00 +0000</pubDate>
      <description>Grid search is a 60-year-old hyperparameter-tuning technique. Applying it to RAG required rethinking what a hyperparameter even is — here is how Yoke Agent rebuilt it for chunking, embeddings, retrievers and advanced strategies.</description>
    </item>

    <item>
      <title>How to evaluate a RAG pipeline end-to-end in 2026</title>
      <link>https://yoke-agent.digital/blog/evaluate-rag-pipeline-2026/</link>
      <guid>https://yoke-agent.digital/blog/evaluate-rag-pipeline-2026/</guid>
      <pubDate>Tue, 21 Apr 2026 12:00:00 +0000</pubDate>
      <description>A pillar guide to evaluating retrieval-augmented generation pipelines: datasets, RAGAS metrics, grid-search axes, improvement reports and production monitoring.</description>
    </item>
    <item>
      <title>DeepEval vs Yoke Agent: honest comparison</title>
      <link>https://yoke-agent.digital/blog/deepeval-vs-yoke-agent/</link>
      <guid>https://yoke-agent.digital/blog/deepeval-vs-yoke-agent/</guid>
      <pubDate>Tue, 14 Apr 2026 12:00:00 +0000</pubDate>
      <description>Where DeepEval wins, where Yoke Agent wins, and why most serious teams end up using both.</description>
    </item>
    <item>
      <title>The 14 agent evaluation metrics Yoke ships (and why)</title>
      <link>https://yoke-agent.digital/blog/14-agent-evaluation-metrics/</link>
      <guid>https://yoke-agent.digital/blog/14-agent-evaluation-metrics/</guid>
      <pubDate>Tue, 07 Apr 2026 12:00:00 +0000</pubDate>
      <description>Every G-Eval rubric metric Yoke Agent implements, with definitions, formulas and when to use each one.</description>
    </item>
    <item>
      <title>Benchmarking chunking strategies on a real corpus</title>
      <link>https://yoke-agent.digital/blog/benchmarking-chunking-strategies/</link>
      <guid>https://yoke-agent.digital/blog/benchmarking-chunking-strategies/</guid>
      <pubDate>Tue, 31 Mar 2026 12:00:00 +0000</pubDate>
      <description>Grid-searching four chunking strategies against a 500-document technical corpus — the numbers you actually need to pick one.</description>
    </item>
    <item>
      <title>Self-hosted LLM evaluation: a 2026 guide</title>
      <link>https://yoke-agent.digital/blog/self-hosted-llm-evaluation-2026/</link>
      <guid>https://yoke-agent.digital/blog/self-hosted-llm-evaluation-2026/</guid>
      <pubDate>Tue, 24 Mar 2026 12:00:00 +0000</pubDate>
      <description>Why self-hosted evaluation matters in 2026, what to demand from the tool, and how to migrate off a SaaS platform.</description>
    </item>
    <item>
      <title>Why notebooks fail for RAG evaluation (and what to do instead)</title>
      <link>https://yoke-agent.digital/blog/why-notebooks-fail-rag-evaluation/</link>
      <guid>https://yoke-agent.digital/blog/why-notebooks-fail-rag-evaluation/</guid>
      <pubDate>Tue, 17 Mar 2026 12:00:00 +0000</pubDate>
      <description>Five failure modes of notebook-driven RAG evaluation, and a practical migration path to reproducible grid-search.</description>
    </item>
  </channel>
</rss>
