Hacker news

  • Top
  • New
  • Past
  • Ask
  • Show
  • Jobs

Show HN: Webctl – Browser automation for agents based on CLI instead of MCP (https://github.com)

134 points by cosinusalpha 4 days ago | 39 comments | View on ycombinator

binalpatel 4 days ago |

Cool to see lots of people independently come to "CLIs are all you need". I'm still not sure if it's a short-term bandaid because agents are so good at terminal use or if it's part of a longer term trend but it's definitely felt much more seamless to me then MCPs.

(my one of many contribution https://github.com/caesarnine/binsmith)

the_mitsuhiko 4 days ago |

At this point I'm fully down the path of the agent just maintaining his own tools. I have a browser skill that continues to evolve as I use it. Beats every alternative I have tried so far.

gregpr07 4 days ago |

Creator of Browser Use here, this is cool, really innovative approach with ARIA roles. One idea we have been playing around with a lot is just giving the LLM raw html and a really good way to traverse it - no heuristics, just BS4. Seems to work well, but much more expensive than the current prod ready [index]<div ... notation

TheTaytay 3 days ago |

I really like this idea!

I’d like to see this other browser plugin’s API be exposed via your same CLI, so I don’t have to only control a separate browser instance. https://github.com/remorses/playwriter (I haven’t investigated enough to know how feasible it is, but as I was reading about your tool, I immediately wanted to control existing tabs from my main browser, rather than “just” a debug-driven separate browser instance.)

Agent_Builder 3 days ago |

Interesting approach. In our experience, most failures weren’t about which interface agents used, but about how much implicit authority they accumulated across steps. Control boundaries mattered more than the abstraction layer.

randito 4 days ago |

If you look at Elixir keynote for Phoenix.new -- a cool agentic coding tool -- you'll see some hints about a browser control using a API tool call. It's called "web" in the video.

Video: https://youtu.be/ojL_VHc4gLk?t=2132

More discussion: https://simonwillison.net/2025/Jun/23/phoenix-new/

undefined 3 days ago |

undefined

renegat0x0 4 days ago |

A little bit different, but also allows to scrape efficiently. Json http communication rather than cli.

https://github.com/rumca-js/crawler-buddy

More like a framework for other mechanisms

philipbjorge 4 days ago |

This looks remarkably similar to https://github.com/vercel-labs/agent-browser

How is it different?

desireco42 4 days ago |

How are you holding session if every command is issues through cli? I assume this is essential for automation.

AI-love about 19 hours ago |

Good Work, keep IT up

grigio 4 days ago |

is there a benchmark? there are a lot of scraping agents nowdays..

undefined 4 days ago |

undefined