<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Local-Ai on saurabh</title><link>https://unfoundbox.com/tags/local-ai/</link><description>Recent content in Local-Ai on saurabh</description><generator>Hugo</generator><language>en-gb</language><lastBuildDate>Tue, 09 Jun 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://unfoundbox.com/tags/local-ai/index.xml" rel="self" type="application/rss+xml"/><item><title>DIY Smart Robot With a Local Gemma Brain</title><link>https://unfoundbox.com/posts/diy-smart-robot-gemma/</link><pubDate>Tue, 09 Jun 2026 00:00:00 +0000</pubDate><guid>https://unfoundbox.com/posts/diy-smart-robot-gemma/</guid><description>&lt;p>&lt;img src="https://unfoundbox.com/posts/diy-smart-robot-gemma/cover.png" alt="A finished reference build: small rover, local AI workstation, camera, distance sensor, speaker, and safe wiring.">&lt;/p>
&lt;p>The best way to build a small AI robot is to keep the architecture boring.&lt;/p>
&lt;p>Use the local model for high-level reasoning, explanations, and plans. Use deterministic firmware for timing, motor control, watchdogs, limits, and emergency stop. The model should never drive motors directly.&lt;/p>
&lt;p>That separation is the whole project.&lt;/p>
&lt;h2 id="goal">Goal&lt;/h2>
&lt;p>Build a small rover that can:&lt;/p></description></item><item><title>Local Inference on WebGPU: Where Small Models Actually Win</title><link>https://unfoundbox.com/posts/webgpu-local-inference/</link><pubDate>Sun, 07 Jun 2026 00:00:00 +0000</pubDate><guid>https://unfoundbox.com/posts/webgpu-local-inference/</guid><description>&lt;p>The exciting version of browser AI is not &amp;ldquo;run a giant chatbot in a tab.&amp;rdquo; The useful version is narrower and more practical:&lt;/p>
&lt;blockquote>
&lt;p>Train or fine-tune a small model in Python, export it to ONNX or a browser-friendly runtime, and run the loop locally through WebGPU.&lt;/p>&lt;/blockquote>
&lt;p>As of this research snapshot, that loop is real enough to build with. The advantage is not universal, but in a few cases it is decisive: private data stays on device, latency drops below the threshold where interaction feels live, server cost goes to zero, and offline use becomes possible.&lt;/p></description></item></channel></rss>