Story Detail of id 47392034 | Liveview Hacker News

dataviz100022 hours ago | on: Chrome DevTools MCP (2025)

I use Playwright to intercept all requests and responses and have Claude Code navigate to a website like YouTube and click and interact with all the elements and inputs while recording all the requests and responses associated with each interaction. Then it creates a detailed strongly typed API to interact with any website using the underlying API.

Yes, I know it likely breaks everybody's terms of service but at the same time I'm not loading gigabytes of ads, images, markup, to accomplish things.

If anyone is interested I can take some time and publish it this week.

bredren20 hours ago | parent | next

I also do this. My primary use case is for reproducing page layout and styling at any given tree in the dom. So, capturing various states of a component etc.

I also use it to automatically retrieve page responsiveness behavior in complex web apps. It uses playwright to adjust the width and monitor entire trees for exact changes which it writes structured data that includes the complete cascade of styles relevant with screenshots to support the snapshots.

There are tools you can buy that let you do this kind of inspection manually, but they are designed for humans. So, lots of clickety-clackety and human speed results.

---

My first reaction to seeing this FP was why are people still releasing MCPs? So far I've managed to completely avoid that hype loop and went straight to building custom CLIs even before skills were a thing.

I think people are still not realizing the power and efficiency of direct access to things you want and skills to guide the AI in using the access effectively.

Maybe I'm missing something in this particular use case?

ranyume3 hours ago | root | parent | next

> My first reaction to seeing this FP was why are people still releasing MCPs?

MCPs are more difficult to use. You need to use an agent to use the tools, can't do it manually easily. I wonder if some people see that friction as a feature.

AlphaSite16 hours ago | root | parent | next

its mostly because MCPs handle auth in a standardised way and give you a framework you can layer things like auth, etc on top of.

Without it youre stuck with the basic http firewall, etc which is extremely dangerous and this is maybe the 1 opportunity we have to do this.

re5i5tor7 hours ago | root | parent

And people forget, Claude Code isn’t the only Claude surface, and CLIs don’t help in other surfaces other than Cowork.

loading story #47400065

halJordan20 hours ago | parent | next

I love how HN is loving this idea when it's the exact same thing Anthropic and OpenAi (and every other llm maker) did.

It's God's gift to them when it lets them bypass ads and dl copyrighted material. But it's Satan's curse on humanity when the Zuck does it to train his llm and dl copyrighted material.

deaux18 hours ago | root | parent | next

Both scale and purpose make them completely different things. You're acting as if they're the same when they're not.

eipi10_hn17 hours ago | root | parent | next

I won't comment about dl but ads are trackers and spyware for me. I don't spy on websites' owners, I have my human rights to stop those trackers.

Zuck serves ads/spywares to other users, he deserves to taste his own medicines, not me.

coldtea8 hours ago | root | parent | next

Yes, it's a god's gift when the average user can do it, and satan's curse what a hated fucking mega-corp is doing it.

Where's the contradiction?

friendzis12 hours ago | root | parent | next

You can see this pattern in many different topics: updoots are highly correlated with a positive answer to "do I personally get to profit"?

loading story #47395956

cyberax15 hours ago | root | parent | next

I would love to pay for content. I'm _paying_ for YouTube Premium.

But heck. Do I hate the YouTube interface, it degraded far past usability.

loading story #47395276

tclancy19 hours ago | root | parent | next

So you’re that Hal Jordan then? Why would a Green Lantern feel the need to defend either? I feel like the Guardians would not accept your arguments as soon as you got to Oa, poozer. I guess what I am saying is don’t have a famous name. Seems obvious.

loading story #47393928

miki12321110 hours ago | root | parent

You conflate web crawling for inference with web crawling for training.

Web crawling for training is when you ingest content on a mass scale, usually indiscriminately, usually with a dumb crawler for scale's sake, for the purposes of training an LLM. You don't really care whether one particular website is in the dataset (unless it's the size of Reddit), you just want a large, diverse, high-quality data mix.

Web crawling for inference is when a user asks a targeted question, you do a web search, and fetch exactly those resources that are likely to be relevant to that search. Nothing ends up in the training data, it's just context enrichment.

People have a much larger issue with crawling for training than for inference (though I personally think both are equally ok).

loading story #47403051

loading story #47402531

Axsuul22 hours ago | parent | next

Why even use Playwright for this? I feel like Claude just needs agent-browser and it can generate deterministic code from it.

dsrtslnd2321 hours ago | root | parent | next

you mean this one? https://github.com/vercel-labs/agent-browser

dataviz100021 hours ago | root | parent

It is 2 months old!

My excuse for not keeping up is that I'm in so deep that Claude Code can predict the stock market.

I'll still publish mine and see if has any value but agent browser looks very complete.

Thank you for sharing!

loading story #47401077

loading story #47392880

loading story #47395221

loading story #47398215

loading story #47394910

thefreeman18 hours ago | root | parent

You can just start claude with the —chrome flag too and it will connect to the chrome extension.

kolinko9 hours ago | parent | next

Please do.

Did you compare playwright with mcp? Why one over another?

I use MCP usually, because I heard it’s less detectable than playwright, and more robust against design changes, but I didn’t compare/test myself

loading story #47402785

schainks21 hours ago | parent | next

Very interested. Would even pay for an api for this. I am doing something similar with vibium and need something more token efficient.

hugs15 hours ago | root | parent

have you tried vibium's cli + agent skill?

defen22 hours ago | parent | next

Would this hypothetically be able to download arbitrary videos from youtube without the constant yt-dlp arms race?

dawnerd21 hours ago | root | parent | next

Don’t know how this could be more stable than ytdlp. When issues come up they’re fixed really quickly.

loading story #47392587

dataviz100022 hours ago | root | parent | next

> yt-dlp arms race

I don't know anything about yt-dlp.

It would probably help people who want to go to a concert and have a chance to beat the scalpers cornering the market on an event in 30 seconds hitting the marketplace services with 20,000 requests.

I can try to see if can bypass yt-dlp. But that is always a cat and mouse game.

loading story #47392200

phantomathkg17 hours ago | root | parent

If it can save all the video/audio fragment and call ffmpeg to join them together. Maybe?

loading story #47398432

Johnny_Bonk21 hours ago | parent | next

Yes, please do and ping me when it's done lol. Did you make it into an agent skill?

dataviz100021 hours ago | root | parent

Exactly, it is an agent skill that interacts pressing buttons and stuff with a webpage capturing and documenting all the API requests the page makes using Playwright's request / response interception methods. It creates and strongly typed well documented API at the end.

loading story #47392522

loading story #47401297

miohtama20 hours ago | parent | next

I just ask Claude to reverse engineer the site with Chrome MCP. It goes to work by itself, uses your Chrome logged in session cookies, etc.

zacmps11 hours ago | parent | next

I would love it if you had time to publish it!

loading story #47398727

mikrl21 hours ago | parent | next

I was doing similar by capturing XHR requests while clicking through manually, then asking codex to reverse engineer the API from the export.

Never tried that level of autonomy though. How long is your iteration cycle?

If I had to guess, mine was maybe 10-20 minutes over a few prompts.

rkagerer15 hours ago | parent | next

I assume you're not logged into those sites, in order to avoid bans and the risk of hitting the wrong button like, say, "Delete Account".

dataviz100015 hours ago | root | parent

It turns any authenticated browser session into a fully typed REST API proxy — exposing discovered endpoints as local Hono routes that relay requests through the browser, so cookies and auth are automatic.

The point is that it creates an API proxy in code that a Typescript server calls directly. The AI runs for about 10 minutes with codegen. The rest of the time it is just API calls to a service. Remove the endpoint for "Delete Account" and that API endpoint never gets called.

This 100% breaks everyone's terms of service. I would not recommend nor encourage using.

3abiton13 hours ago | parent | next

I always used playwrite as an alternative to selenium, relatively surprised by its ability to interface with LLMs.

TimCTRL10 hours ago | parent | next

+1, publish, but how will we know when you have published...

xrd22 hours ago | parent | next

Yes, please do!

dataviz100022 hours ago | root | parent

100% I'll response to this by Friday with link to Github.

I use Patchright + Ghostery and I have a cleaver tool that uses web sockets to pass 1 second interval screenshots to the a dashboard and pointer / keyboard events to the server which allow interacting with websites so that a user can create authentication that is stored in the chrome user profile with all the cookies, history, local storage, ect.. in the cloud on a server.

Can you list some websites that don't require subscription that you would like to me to test against? I used this for Robinhood and I think Linked in would be a good example for people to use.

loading story #47401117

loading story #47392294

citizenpaul16 hours ago | parent | next

Id like to see this published as well thx!

liamdgray20 hours ago | parent | next

Please do!

toomuchtodo21 hours ago | parent | next

Please publish!

heystefan20 hours ago | parent | next

Commenting to follow up.

fuzzyfizz12 hours ago | parent | next

Wow. Yes please.

retinaros21 hours ago | parent | next

isnt it what everyone that needs web validation does?

lizhang21 hours ago | parent

[dead]

#visit	13,135,942
#session	74,665
#live-session	0