<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Claude Code Archives - Vyftec</title>
	<atom:link href="https://vyftec.com/tag/claude-code/feed/" rel="self" type="application/rss+xml" />
	<link>https://vyftec.com/tag/claude-code/</link>
	<description></description>
	<lastBuildDate>Thu, 02 Jul 2026 10:52:30 +0000</lastBuildDate>
	<language>de-CH</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://vyftec.com/wp-content/uploads/2024/10/cropped-penta-only-300px-1-32x32.png</url>
	<title>Claude Code Archives - Vyftec</title>
	<link>https://vyftec.com/tag/claude-code/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>BrowserOS: The Open-Source AI Browser That Puts You in Control (Not a Black Box)</title>
		<link>https://vyftec.com/browseros-the-open-source-ai-browser-that-puts-you-in-control-not-a-black-box/</link>
		
		<dc:creator><![CDATA[damianhunziker]]></dc:creator>
		<pubDate>Thu, 02 Jul 2026 05:18:51 +0000</pubDate>
				<category><![CDATA[AI & Automation]]></category>
		<category><![CDATA[AI Agents]]></category>
		<category><![CDATA[browser automation]]></category>
		<category><![CDATA[BrowserOS]]></category>
		<category><![CDATA[Claude Code]]></category>
		<category><![CDATA[MCP]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[vision models]]></category>
		<category><![CDATA[web automation]]></category>
		<guid isPermaLink="false">https://vyftec.com/browseros-the-open-source-ai-browser-that-puts-you-in-control-not-a-black-box/</guid>

					<description><![CDATA[<p>Last month I wrote about getting browser-use working with local AI models. It works, but let&#8217;s be real — it&#8217;s a black box. You pipe commands in, hope the agent clicks the right thing, and when it fails, you&#8217;re debugging through abstraction layers. Then I found BrowserOS. And honestly? It changed how I think about [&#8230;]</p>
<p>The post <a href="https://vyftec.com/browseros-the-open-source-ai-browser-that-puts-you-in-control-not-a-black-box/">BrowserOS: The Open-Source AI Browser That Puts You in Control (Not a Black Box)</a> appeared first on <a href="https://vyftec.com">Vyftec</a>.</p>
]]></description>
										<content:encoded><![CDATA[		<div data-elementor-type="wp-post" data-elementor-id="12485" class="elementor elementor-12485">
				<div class="elementor-element elementor-element-1459609e e-flex e-con-boxed e-con e-parent" data-id="1459609e" data-element_type="container" data-e-type="container">
					<div class="e-con-inner">
				<div class="elementor-element elementor-element-41ea0cb5 elementor-widget elementor-widget-text-editor" data-id="41ea0cb5" data-element_type="widget" data-e-type="widget" data-widget_type="text-editor.default">
				<div class="elementor-widget-container">
									<p>Last month I wrote about getting browser-use working with local AI models. It works, but let&#8217;s be real — it&#8217;s a black box. You pipe commands in, hope the agent clicks the right thing, and when it fails, you&#8217;re debugging through abstraction layers.</p><p>Then I found <a href="https://browseros.com"><strong>BrowserOS</strong></a>. And honestly? It changed how I think about browser automation entirely.</p><h2>What Is BrowserOS?</h2><p><a href="https://github.com/browseros-ai/BrowserOS">BrowserOS</a> is an open-source Chromium fork — AGPL-3.0 licensed, 10,000+ GitHub stars — built from the ground up as an AI-native browser. Not a plugin. Not a proxy server. The AI lives <em>in</em> the browser.</p><p>It comes with a <strong>built-in MCP (Model Context Protocol) server</strong> that exposes 53+ browser automation tools and 40+ app integrations through a single HTTP endpoint. No remote-debugging ports. No Node.js proxy processes. Just a browser that speaks MCP natively.</p><h2>The Setup: Laughably Simple</h2><p>Here&#8217;s the entire setup for Claude Code:</p><pre class="wp-block-code vyftec-code"><code># Step 1: Download BrowserOS from browseros.com — done
# Step 2: Open chrome://browseros/mcp — copy the URL
# Step 3: One command
claude mcp add --transport http browseros http://127.0.0.1:9239/mcp --scope user

# Done. Start using it:
claude --dangerously-skip-permissions</code></pre><p>That&#8217;s it. No <code>--remote-debugging-port</code>. No separate Node.js server. No WebDriver flags that get your sessions blocked. BrowserOS runs your <em>real</em> browser with your cookies, extensions, and logins. Sites can&#8217;t tell you&#8217;re automating because — well — you&#8217;re using a real browser.</p><p>Compare that to the browser-use setup where you need a Python environment, Playwright, a separate MCP server package, API keys, and pray that your vision model&#8217;s mmproj file works. If you&#8217;ve read my <a href="https://vyftec.com/browser-use-with-local-ai-why-llama-cpp-beats-ollama-and-how-to-set-it-up-in-minutes/">previous post on browser-use</a>, you know the pain.</p><h2>Vision Models: Works Out of the Box</h2><p><img decoding="async" class="wp-image-responsive" style="float: left; max-width: 350px; margin: 0 20px 20px 0;" src="https://images.unsplash.com/photo-1534723328310-e82dad3ee43f?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3w5ODk0NTl8MHwxfGFsbHx8fHx8fHx8fDE3ODI5NjUxOTd8&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=400" alt="Pepper robot symbolizing vision AI integration" /></p><p>BrowserOS supports <strong>Claude Opus 4.5, Sonnet 4.5, Haiku 4.5, Gemini Flash, GPT-4, and local models via Ollama/LM Studio</strong> — all through its built-in agent loop. The browser&#8217;s <code>take_snapshot</code> tool captures the full accessibility tree with interactive element IDs, and <code>take_enhanced_snapshot</code> adds structural context. Vision models can analyze <code>take_screenshot</code> output to understand page layouts, icons, and visual state.</p><p>The recommendation from the BrowserOS docs: <strong>Claude Opus 4.5 for agent mode</strong> (best quality), Sonnet 4.5 for speed. And unlike browser-use where vision model integration can be fragile, BrowserOS abstracts the complexity away. You configure your provider once, and the agent loop handles the rest.</p><h2>53+ Tools — And They&#8217;re All Visible</h2><p>This is where BrowserOS destroys the black-box argument. You get a catalog of <strong>54 browser automation tools</strong> that you can inspect and call directly:</p><ul><li><strong>Navigation &amp; Tabs</strong>: navigate, new_page, new_hidden_page (stealth tabs!), show_page, move_page, close_page, list_pages, get_active_page</li><li><strong>Content &amp; Observation</strong>: take_snapshot, take_enhanced_snapshot, get_page_content (Markdown!), get_page_links, get_dom, search_dom, take_screenshot, evaluate_script</li><li><strong>Interaction</strong>: click, click_at, hover, focus, fill, clear, check, uncheck, select_option, press_key, drag, scroll, upload_file, handle_dialog</li><li><strong>Window Management</strong>: list_windows, create_window, create_hidden_window, close_window, activate_window</li><li><strong>Tab Groups, Bookmarks, History</strong>: full CRUD for all three</li><li><strong>File &amp; Export</strong>: save_pdf, save_screenshot, download_file</li></ul><p>Every tool is documented, every parameter is known. When the agent does something unexpected, <strong>you can see exactly which tool was called and with what parameters</strong>. No more wondering why it clicked the wrong button.</p><p><img decoding="async" class="wp-image-responsive" style="float: right; max-width: 350px; margin: 0 0 20px 20px;" src="https://images.unsplash.com/photo-1655393001768-d946c97d6fd1?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3w5ODk0NTl8MHwxfGFsbHx8fHx8fHx8fDE3ODI5NjUxOTd8&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=800" alt="Robotic arm symbolizing automation capability" /></p><h2>Console Errors? Yes, It Reads Them</h2><p>Double-checked this one. BrowserOS&#8217;s MCP server includes a <code>get_console_messages</code> tool that reads console output from the page. The official docs literally say: <strong>&#8220;Claude tests your web app, reads console errors, and fixes the code — all in one loop.&#8221;</strong></p><p>Plus you have </p><p><code>evaluate_script</code> to run arbitrary JavaScript and capture return values, exceptions, and side effects. For agentic coding workflows, this is a game changer — your AI assistant can navigate to localhost:3000, click around, read console output, and fix frontend bugs without you lifting a finger.</p><h2>Registration Flows: Chain BrowserOS + Gmail MCP</h2><p>This is where it gets wild. BrowserOS ships with <strong>40+ built-in app integrations</strong> — including Gmail, Outlook, Google Calendar, Slack, GitHub, Linear, Jira, Notion, and more. They all work through the same MCP connection. Zero additional setup.</p><p>So picture this agent prompt:</p><blockquote class="wp-blockquote"><p>Go to freelancermap.de, click &#8220;Register Now&#8221;, fill in the form with these details: [name, email], submit. Then check my Gmail for the confirmation email. Click the verification link. Complete the profile setup wizard.</p></blockquote><p>One prompt. BrowserOS navigates, the Gmail MCP reads the inbox, clicks the link, completes the flow. <strong>The entire registration pipeline — from form fill to email verification to profile completion — done automatically.</strong></p><p>This is what makes BrowserOS qualitatively different from browser-use. It&#8217;s not just a browser tool — it&#8217;s an <strong>automation platform</strong> that includes the communication channels your workflows depend on.</p><h2>Responsive Testing &amp; Window Management</h2><p><img decoding="async" class="wp-image-responsive" style="float: right; max-width: 400px; margin: 0 0 20px 20px;" src="https://images.unsplash.com/photo-1485827404703-89b55fcc595e?crop=entropy&amp;cs=tinysrgb&amp;fit=max&amp;fm=jpg&amp;ixid=M3w5ODk0NTl8MHwxfGFsbHx8fHx8fHx8fDE3ODI5NjUxOTd8&amp;ixlib=rb-4.1.0&amp;q=80&amp;w=800" alt="AI robot representing BrowserOS intelligence" /></p><p>For frontend testing, you get full window management: create windows, hide them, activate them, list them. While <code>resize_page</code> for precise viewport sizing is marked as &#8220;coming soon&#8221; in the core, you can achieve the same through <code>evaluate_script</code> by calling <code>window.resizeTo()</code>. Plus there are third-party MCP servers like <a href="https://github.com/9pros/MCP-Browser-Inspector">MCP-Browser-Inspector</a> that add 50+ device presets for responsive testing.</p><h2>Where BrowserOS Still Struggles</h2><p>Let&#8217;s be honest about the rough edges:</p><ul><li><strong>Complex JavaScript forms and dropdowns</strong> — Custom select widgets, date pickers, and shadow DOM components can confuse the accessibility tree snapshot. The agent sometimes clicks the wrong element or misses options.</li><li><strong>Single-page apps with heavy JS routing</strong> — Vue.js and React SPAs that manage their own state can be tricky. The DOM may not be ready when the snapshot is taken.</li><li><strong>Debugging features</strong> — Compared to Chrome DevTools MCP, BrowserOS is still catching up on console/network inspection (though <code>get_console_messages</code> already exists).</li></ul><p>That said, these are <strong>solvable through better skill prompts and timing</strong>. For building a job application agent that registers on any platform — with BrowserOS + Gmail MCP + well-structured skills — you can absolutely pull it off. We&#8217;ve been doing it internally.</p><h2>Why It Beats browser-use</h2><table class="wp-table"><thead><tr><th>Dimension</th><th>BrowserOS MCP</th><th>browser-use MCP</th></tr></thead><tbody><tr><td>Setup</td><td>Copy URL, one command</td><td>Python env + Playwright + config</td></tr><tr><td>Open source</td><td>Full Chromium fork, AGPL-3.0</td><td>Python package, MIT</td></tr><tr><td>Tools</td><td>54 browser + 40+ app integrations</td><td>~15 browser tools</td></tr><tr><td>Architecture</td><td>Non-CDP (stealth), built-in</td><td>CDP-based, external process</td></tr><tr><td>Session</td><td>Real browser, cookies, extensions</td><td>Playwright browser, may be detected</td></tr><tr><td>App integrations</td><td>40+ (Gmail, Slack, GitHub, &#8230;)</td><td>None built-in</td></tr><tr><td>Vision models</td><td>Claude, GPT-4, Gemini, local</td><td>Depends on external provider config</td></tr><tr><td>Console reading</td><td>get_console_messages + evaluate_script</td><td>Not built-in</td></tr></tbody></table><h2>The Bottom Line</h2><p>BrowserOS is not just &#8220;another browser automation tool.&#8221; It&#8217;s a <strong>paradigm shift</strong> in how AI agents interact with the web. The combination of native MCP, real browser sessions, 40+ integrations, and transparent tooling makes it the most capable open-source option in 2026.</p><p>If you&#8217;ve been fighting with browser-use, Playwright MCP, or Chrome DevTools MCP — <a href="https://browseros.com">give BrowserOS a try</a>. Download the binary, run one command, and see what a properly designed AI browser can do.</p><p><em>We use BrowserOS at Vyftec for automated testing, registration workflows, and data extraction. <a href="https://vyftec.com/contact">Get in touch</a> if you want to build something similar.</em></p><hr /><p><strong>Sources:</strong></p><ol><li><a href="https://github.com/browseros-ai/BrowserOS">BrowserOS GitHub Repository</a></li><li><a href="https://docs.browseros.com">BrowserOS Documentation</a></li><li><a href="https://docs.browseros.com/comparisons/chrome-devtools-mcp">BrowserOS vs Chrome DevTools MCP</a></li><li><a href="https://docs.browseros.com/features/use-with-claude-code">Using BrowserOS with Claude Code</a></li><li><a href="https://deepwiki.com/browseros-ai/BrowserOS/5-mcp-server">BrowserOS MCP Server Architecture (DeepWiki)</a></li></ol><hr /><h2>Vyftec &#8211; BrowserOS: The Open-Source AI Browser</h2><p>Unlock the power of automation with our expertise in AI and web technologies. Experience Swiss-quality solutions tailored to your needs—let’s transform your digital landscape together!</p><p><img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f4e7.png" alt="📧" class="wp-smiley" style="height: 1em; max-height: 1em;" /> damian@vyftec.com | <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f4ac.png" alt="💬" class="wp-smiley" style="height: 1em; max-height: 1em;" /> <a href="https://chat.whatsapp.com/Kv4QWmlLJXJGyJnSsp6fzc">WhatsApp</a></p>								</div>
				</div>
					</div>
				</div>
				</div>
		<p>The post <a href="https://vyftec.com/browseros-the-open-source-ai-browser-that-puts-you-in-control-not-a-black-box/">BrowserOS: The Open-Source AI Browser That Puts You in Control (Not a Black Box)</a> appeared first on <a href="https://vyftec.com">Vyftec</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
