<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Learning Loop Blog</title>
    <link>https://gradientdisco.com/blog/</link>
    <description>Thinking on AI strategy, model optimisation, and building sovereign AI systems that compound in value over time.</description>
    <language>en</language>
    <atom:link href="https://gradientdisco.com/blog/feed.xml" rel="self" type="application/rss+xml"/>
    <item>
      <title>Teaching an Agent From Outcomes: Reinforcement Learning for Multi-Step AI Processes</title>
      <link>https://gradientdisco.com/blog/teaching-an-agent-from-outcomes.html</link>
      <description>Prompt tuning and fine-tuning both require labelled examples of correct behaviour. But for complex multi-step agent workflows, you often can only say whether the overall outcome was good. Reinforcement learning is exactly the right tool for that setting.</description>
      <pubDate>Thu, 18 Jun 2026 00:00:00 +0000</pubDate>
      <guid>https://gradientdisco.com/blog/teaching-an-agent-from-outcomes.html</guid>
    </item>
    <item>
      <title>Fine-Tuning a Small Model on Your Data: What It Takes and What You Get</title>
      <link>https://gradientdisco.com/blog/fine-tuning-a-small-model-on-your-data.html</link>
      <description>A fine-tuned 7B model trained on your domain data will outperform a frontier model on generic prompts for well-defined tasks — consistently, cheaply, and without sending your data to a third-party endpoint. Here is what the process actually involves.</description>
      <pubDate>Wed, 17 Jun 2026 00:00:00 +0000</pubDate>
      <guid>https://gradientdisco.com/blog/fine-tuning-a-small-model-on-your-data.html</guid>
    </item>
    <item>
      <title>Prompt Tuning Without the Guesswork: How Genetic Optimisation Replaces Manual Iteration</title>
      <link>https://gradientdisco.com/blog/prompt-tuning-without-the-guesswork.html</link>
      <description>Manual prompt engineering has no real feedback loop — you iterate by feel, test on a handful of examples, and hope it generalises. Genetic optimisation replaces that process with a systematic search over production traces. Here is how it works.</description>
      <pubDate>Tue, 16 Jun 2026 00:00:00 +0000</pubDate>
      <guid>https://gradientdisco.com/blog/prompt-tuning-without-the-guesswork.html</guid>
    </item>
    <item>
      <title>Three Ways to Make AI Better at Your Job: Prompt Tuning, Fine-Tuning, and Reinforcement Learning</title>
      <link>https://gradientdisco.com/blog/three-ways-to-make-ai-better-at-your-job.html</link>
      <description>There are three distinct strategies for making an AI model better at a specific job. Each works differently, costs differently, and produces a different kind of asset. Here is how to choose.</description>
      <pubDate>Mon, 15 Jun 2026 00:00:00 +0000</pubDate>
      <guid>https://gradientdisco.com/blog/three-ways-to-make-ai-better-at-your-job.html</guid>
    </item>
    <item>
      <title>Why &#x27;Human in the Loop&#x27; Is Broken — and What to Do Instead</title>
      <link>https://gradientdisco.com/blog/why-human-in-the-loop-is-broken.html</link>
      <description>Human-in-the-loop sounds safe. But it contains a structural flaw that guarantees the one genuinely dangerous decision gets the same shallow glance as the thousandth routine one.</description>
      <pubDate>Sun, 14 Jun 2026 00:00:00 +0000</pubDate>
      <guid>https://gradientdisco.com/blog/why-human-in-the-loop-is-broken.html</guid>
    </item>
    <item>
      <title>Two Ways to Measure What Your AI Doesn&#x27;t Know</title>
      <link>https://gradientdisco.com/blog/two-ways-to-measure-what-your-ai-doesnt-know.html</link>
      <description>LLMs cannot reliably report their own uncertainty — so you have to measure it from the outside. Here are the two methods that work, and when to use each.</description>
      <pubDate>Sat, 13 Jun 2026 00:00:00 +0000</pubDate>
      <guid>https://gradientdisco.com/blog/two-ways-to-measure-what-your-ai-doesnt-know.html</guid>
    </item>
    <item>
      <title>The Only AI Metric That Actually Matters: The Cost of Being Wrong</title>
      <link>https://gradientdisco.com/blog/the-cost-of-being-wrong.html</link>
      <description>Most AI deployments chase benchmark accuracy. But in production, value isn&#x27;t destroyed by average errors — it&#x27;s destroyed by the single ruinous tail event you didn&#x27;t cap.</description>
      <pubDate>Fri, 12 Jun 2026 00:00:00 +0000</pubDate>
      <guid>https://gradientdisco.com/blog/the-cost-of-being-wrong.html</guid>
    </item>
    <item>
      <title>Why Fine-Tuned Small Models Beat Prompt Engineering at Scale</title>
      <link>https://gradientdisco.com/blog/why-fine-tuned-small-models-beat-prompt-engineering.html</link>
      <description>Prompt engineering is a great starting point — but at production scale, a fine-tuned 7B model running on your own infrastructure will outperform a frontier model on generic prompts every time. Here is why, and when to make the switch.</description>
      <pubDate>Tue, 10 Jun 2025 00:00:00 +0000</pubDate>
      <guid>https://gradientdisco.com/blog/why-fine-tuned-small-models-beat-prompt-engineering.html</guid>
    </item>
  </channel>
</rss>
