frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: ScrapeCopilot – Notebook Code Interface + Puppeteer + AI Copilot

3•erichi•1y ago
Hi HN, I’m Eric, and I’m building ScrapeCopilot, an AI assistant designed to eliminate friction in browser automation development.

Here is the link to VS Code extension - https://marketplace.visualstudio.com/items?itemName=scrapeco...

I've built browser automations for more than 5 years, and the constant frustration was always the sheer friction involved in getting working code – especially when debugging in headless mode or connecting to remote browsers.

When I started using LLMs to generate automation code, I found myself stuck in a repetitive loop: navigate to the desired page state, copy-paste HTML into the AI chat, and ask it to generate code. The worst part is that there was no easy way to run that generated code without losing the page state, forcing me to restart the browser session constantly. This wasted large amounts of time and mental energy. I built ScrapeCopilot to make this workflow seamless.

How it works:

ScrapeCopilot combines the power of a Jupyter-style notebook with a live Puppeteer browser session and integrated AI.

- Live Interactive Development: When you create an automation notebook, it initiates a fresh Puppeteer browser session. The page object is exposed directly to your notebook cells, allowing you to run any Puppeteer code against the live browser state and see the results instantly.

- AI-Powered Assistance: It integrates with GitHub Copilot (via the @scrapecopilot chat participant). The AI automatically sees the current page HTML, allowing it to generate highly relevant Puppeteer code based on your instructions directly within the chat.

- LLM Code Export: Once you've developed your automation logic, you can easily export the final, complete Puppeteer script based on your instructions.

This tool saves me hours daily, but even more importantly, it improves the developer experience in browser automation which is frustrating area.

I believe ScrapeCopilot can complement existing browser automation tools and frameworks by providing an interactive AI-assisted development experience.

Current Status & Future Plans:

- The extension currently works within VS Code. It will work in Cursor, but without chat support initially. I'm actively working on integrating a backend server to enable full chat functionality with Cursor.

- Currently the key workflow assumes that you create a new browser automation step by step, using code cells. But in my work I spend half of the time fixing existing automations, so my focus now is trying to adapt extension for debugging and fixing existing code.

- Playwright support is also on the list.

Check out short videos: - Demo: Headless False - https://scrapecopilot.ai/assets/demo-headless-false-Dhc_jeNR... - Demo: Headless True - https://scrapecopilot.ai/assets/demo-headless-true-PRQndDxP....

I'd love to hear your thoughts, feedback, and any suggestions!

Cerberus, an Open-Source USB protection device

https://github.com/Lab217MX/Cerberus-A-USB-Watchdog
1•glitchboi•1m ago•0 comments

Why the 2026 World Cup Ball Has Deeper Seams

http://liveatthewitchtrials.blogspot.com/2026/06/the-2026-world-cup-football-is-big.html
1•speckx•1m ago•0 comments

What the Fuck Happened to Nerds

https://mrmarket.bearblog.dev/what-the-fuck-happened-to-nerds/
1•mrmarket•2m ago•0 comments

Philtrum – It Started with a Prompt

https://philtrum.app/
1•pencilcheck•4m ago•1 comments

The Token Value of $200/mo Plans

https://twitter.com/SemiAnalysis_/status/2064815044085318040
1•thedebuglife•4m ago•0 comments

The Token Value of $200/mo Plans

https://link.mail.beehiiv.com/ss/c/u001.LDkxbMa7NCxUGG7E2Yh3ABiuUAE5LTRLvOwLxg7TbRtWwRuK02qKlX8wK...
1•thedebuglife•4m ago•0 comments

AI is about to get fast, and it's never going to slow down

https://medium.com/@NMitchem/ai-is-about-to-get-fast-and-its-never-going-to-slow-down-78e13e794375
2•Mitchem•4m ago•0 comments

Bio input based, instead of vision based, physical AI for industrial bio

https://diggest.substack.com/p/creating-a-benchmark-for-physical
1•digvijay0401•5m ago•0 comments

Merman: headless Mermaid.js in Rust

https://github.com/Latias94/merman
1•nateb2022•5m ago•0 comments

Forget Zune. Forget Vista. Copilot Is Microsoft's Biggest Failure

https://www.youtube.com/watch?v=ER0jRB3nhK4
3•valeg•7m ago•0 comments

Understanding the rationale behind a rule when trying to circumvent it

https://devblogs.microsoft.com/oldnewthing/20260611-00/?p=112415
1•ibobev•7m ago•0 comments

Why do you say that a COM STA thread must pump messages?

https://devblogs.microsoft.com/oldnewthing/20260522-00/?p=112348
1•ibobev•8m ago•0 comments

Quantity leads to quality (the origin of a parable) (2020)

https://austinkleon.com/2020/12/10/quantity-leads-to-quality-the-origin-of-a-parable/
1•crescit_eundo•8m ago•0 comments

Learning to be a Tech Lead (2024)

https://miryeh.medium.com/learning-to-be-a-tech-lead-e22a0b4f01d5
1•mooreds•8m ago•0 comments

The tanks in Cushing, Oklahoma, are hitting bottom

https://www.cnn.com/2026/06/12/business/cushing-oil-inventory
4•mooreds•10m ago•0 comments

Why Artists Are Running Their Own Data Centers

https://southpole.blog/artists-running-their-own-data-centers/
1•berlianta•11m ago•0 comments

Can smartphones help explain the drop in birth rates?

https://text.npr.org/nx-s1-5851795
1•mooreds•11m ago•0 comments

India says it is working to stop water flowing into Pakistan

https://www.channelnewsasia.com/asia/india-pakistan-conflict-water-treaty-disagreement-6173811
1•vrganj•12m ago•0 comments

Verizon sent man a refurbished phone with MDM, then deleted his data remotely

https://arstechnica.com/tech-policy/2026/06/verizon-sent-man-a-refurbished-phone-with-mdm-then-de...
3•Brajeshwar•12m ago•0 comments

Amazon.ca is down – everything is out of stock

https://www.amazon.ca/Decker-CBG110SC-Electric-Smartgrind-Grinder/dp/B07SZ9FFT9/ref=lp_2224068011...
1•Callicles•12m ago•0 comments

When should we expect to meet aliens?

https://aliens.fyi
3•avhwl•13m ago•1 comments

Solving a chess puzzle with Claude and Prolog

https://www.johndcook.com/blog/2026/06/11/prolog-claude/
2•ibobev•14m ago•0 comments

Author Jane Yolen, 87, died. Writer of fantasy, sci-fi, and children's books

https://locusmag.com/2026/06/jane-yolen-1939-2026/
1•speckx•14m ago•0 comments

Agentic-Engineering-Handbook

https://github.com/keyuchen21/agentic-engineering-handbook
2•keyuchen2020•15m ago•0 comments

Nvidia Is Developing an AI Healthcare Model with Startup Abridge

https://www.wsj.com/cio-journal/nvidia-is-developing-an-ai-healthcare-model-with-startup-abridge-...
1•bookofjoe•16m ago•1 comments

Vykar is a fast, encrypted, deduplicated backup tool written in Rust

https://vykar.borgbase.com
2•delduca•17m ago•0 comments

Google Sues to Stop Chinese Cybercrime Group from Using Its A.I

https://www.nytimes.com/2026/06/12/technology/google-lawsuit-china-ai-scams.html
1•ChrisArchitect•18m ago•1 comments

Open Knowledge Format

https://cloud.google.com/blog/products/data-analytics/how-the-open-knowledge-format-can-improve-d...
1•berlianta•18m ago•0 comments

Show HN: Memoriq – Private AI Memory for ChatGPT, Claude, Gemini and Grok

https://memoriq.me/
2•giekaton•22m ago•0 comments

WASI 0.3.0 Released

https://github.com/WebAssembly/WASI/releases/tag/v0.3.0
5•mavdol04•23m ago•0 comments