frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: ScrapeCopilot – Notebook Code Interface + Puppeteer + AI Copilot

3•erichi•1y ago
Hi HN, I’m Eric, and I’m building ScrapeCopilot, an AI assistant designed to eliminate friction in browser automation development.

Here is the link to VS Code extension - https://marketplace.visualstudio.com/items?itemName=scrapeco...

I've built browser automations for more than 5 years, and the constant frustration was always the sheer friction involved in getting working code – especially when debugging in headless mode or connecting to remote browsers.

When I started using LLMs to generate automation code, I found myself stuck in a repetitive loop: navigate to the desired page state, copy-paste HTML into the AI chat, and ask it to generate code. The worst part is that there was no easy way to run that generated code without losing the page state, forcing me to restart the browser session constantly. This wasted large amounts of time and mental energy. I built ScrapeCopilot to make this workflow seamless.

How it works:

ScrapeCopilot combines the power of a Jupyter-style notebook with a live Puppeteer browser session and integrated AI.

- Live Interactive Development: When you create an automation notebook, it initiates a fresh Puppeteer browser session. The page object is exposed directly to your notebook cells, allowing you to run any Puppeteer code against the live browser state and see the results instantly.

- AI-Powered Assistance: It integrates with GitHub Copilot (via the @scrapecopilot chat participant). The AI automatically sees the current page HTML, allowing it to generate highly relevant Puppeteer code based on your instructions directly within the chat.

- LLM Code Export: Once you've developed your automation logic, you can easily export the final, complete Puppeteer script based on your instructions.

This tool saves me hours daily, but even more importantly, it improves the developer experience in browser automation which is frustrating area.

I believe ScrapeCopilot can complement existing browser automation tools and frameworks by providing an interactive AI-assisted development experience.

Current Status & Future Plans:

- The extension currently works within VS Code. It will work in Cursor, but without chat support initially. I'm actively working on integrating a backend server to enable full chat functionality with Cursor.

- Currently the key workflow assumes that you create a new browser automation step by step, using code cells. But in my work I spend half of the time fixing existing automations, so my focus now is trying to adapt extension for debugging and fixing existing code.

- Playwright support is also on the list.

Check out short videos: - Demo: Headless False - https://scrapecopilot.ai/assets/demo-headless-false-Dhc_jeNR... - Demo: Headless True - https://scrapecopilot.ai/assets/demo-headless-true-PRQndDxP....

I'd love to hear your thoughts, feedback, and any suggestions!

Why local AI – and why it matters

https://nexusfoundation.ngo/why-local-ai
1•kris_osiadacz•2m ago•0 comments

Braess's Paradox

https://en.wikipedia.org/wiki/Braess%27s_paradox
1•the-mitr•3m ago•0 comments

Bobby Prince's Doom (1993) Soundtrack Inducted into Library of Congress

https://playday.one/2026/05/14/bobby-princes-doom-soundtrack-inducted-into-library-of-congress/
1•nomilk•4m ago•0 comments

Phoebus Cartel

https://en.wikipedia.org/wiki/Phoebus_cartel
1•thunderbong•4m ago•0 comments

Build your own vulnerability harness

https://blog.cloudflare.com/build-your-own-vulnerability-harness/
1•Gipsyfine•6m ago•0 comments

Finding the Link Uber Missed: How to Connect AI Code Spend to Business Value

https://medium.com/@navigara/you-cant-measure-roi-on-ai-coding-tools-if-you-never-measured-engine...
1•alienll•8m ago•0 comments

Show HN: NetSentinel – a local network security scanner and connectivity monitor

https://github.com/ossianericson/netsentinel
2•ossianericson•9m ago•0 comments

BookMarkr – Local first visually organized bookmark Chrome extension

https://chromewebstore.google.com/detail/bookmarkr-—-visual-bookma/lianafemkbankodapdaokiefoffi...
1•mnomansd•9m ago•0 comments

Unreal deprecrates Blueprints, their visual programming language in UE6.1

https://xcancel.com/unrealengine/status/2067661808903577646
1•Signez•10m ago•0 comments

Lakehouse//RT: Real-Time Performance on a Unified Lakehouse

https://www.databricks.com/blog/introducing-lakehousert-real-time-performance-unified-lakehouse
1•throwaw12•10m ago•0 comments

How to Drive an LLM

https://home.robusta.dev/blog/how-to-drive-an-llm
1•nyellin•10m ago•0 comments

What 'Getting Your Hands Dirty' Means at LLM-Era

https://carette.xyz/posts/the_mud_and_the_mind/
5•maarcel93•17m ago•0 comments

The new HTTP QUERY method explained

https://kreya.app/blog/new-http-query-method-explained/
3•CommonGuy•18m ago•0 comments

Gemini provides phone number of scammer posing as Delta Airlines

https://old.reddit.com/r/artificial/comments/1u9t7mp/gemini_helped_me_get_scammed/
1•LeoPanthera•19m ago•0 comments

Ask HN: What do you use for scientific presentations?

2•hamburgererror•24m ago•0 comments

Show HN: UAVs FYI – Drone database with supply chain data, API and CLI

https://www.uavs.fyi/
1•Osoraku•25m ago•0 comments

GLM-5.2: Chop off 84% of the volume from a 1.5TB model, still retain 82% power

https://twitter.com/AYi_AInotes/status/2067642004184383564
4•vantareed•25m ago•1 comments

Claude Artifacts

https://claude.com/blog/artifacts-in-claude-code
3•czeizel•27m ago•1 comments

Show HN: One-click fork of "Everything Claude Code" onto an isolated microVM

https://www.jurniti.com/templates/ecc
1•shving90•29m ago•0 comments

Trillions of dollars spent just to work on customer services?

5•YihaoZhang•30m ago•0 comments

Capitol Alpha Machine – interactive viz of congressional stock trades

https://capitolalpha.app/
1•sylvainbe•33m ago•0 comments

GCP IAM Authorization Bypass

https://olearysec.com/research/config-connector-authorization-bypass/
3•sanbor•34m ago•0 comments

Show HN: Avera – a deterministic check that proves no regression was introduced

https://github.com/tc7kxsszs5-cloud/avera
1•kiku79•34m ago•0 comments

Build yor form back end infrastrture under 30sec

1•unaisshemim•35m ago•1 comments

Elysia Marginata

https://en.wikipedia.org/wiki/Elysia_marginata
1•ZeljkoS•37m ago•1 comments

RemotePower – self-hosted fleet monitoring with built-in vulnerability scanning

https://github.com/tyxak/remotepower
1•tyxak•42m ago•0 comments

Show HN: I was drowning in browser tabs, so I built this

https://microsoftedge.microsoft.com/addons/detail/gopeek/ffaeanmhghmohbponokefmbhfkkomnmk
4•formit34•43m ago•3 comments

Icon.museum – A curated gallery of app icon design

https://icon.museum
2•akashwadhwani35•43m ago•0 comments

Impossible Challenge

https://itch.io/jam/impossible-challenge
1•alisio85•43m ago•0 comments

Terminal-Bench Challenges: long-horizon, token-intensive, single-task benchmarks

https://www.tbench.ai/news/terminal-bench-challenges
1•matt_d•44m ago•0 comments