frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: ScrapeCopilot – Notebook Code Interface + Puppeteer + AI Copilot

3•erichi•8mo ago
Hi HN, I’m Eric, and I’m building ScrapeCopilot, an AI assistant designed to eliminate friction in browser automation development.

Here is the link to VS Code extension - https://marketplace.visualstudio.com/items?itemName=scrapeco...

I've built browser automations for more than 5 years, and the constant frustration was always the sheer friction involved in getting working code – especially when debugging in headless mode or connecting to remote browsers.

When I started using LLMs to generate automation code, I found myself stuck in a repetitive loop: navigate to the desired page state, copy-paste HTML into the AI chat, and ask it to generate code. The worst part is that there was no easy way to run that generated code without losing the page state, forcing me to restart the browser session constantly. This wasted large amounts of time and mental energy. I built ScrapeCopilot to make this workflow seamless.

How it works:

ScrapeCopilot combines the power of a Jupyter-style notebook with a live Puppeteer browser session and integrated AI.

- Live Interactive Development: When you create an automation notebook, it initiates a fresh Puppeteer browser session. The page object is exposed directly to your notebook cells, allowing you to run any Puppeteer code against the live browser state and see the results instantly.

- AI-Powered Assistance: It integrates with GitHub Copilot (via the @scrapecopilot chat participant). The AI automatically sees the current page HTML, allowing it to generate highly relevant Puppeteer code based on your instructions directly within the chat.

- LLM Code Export: Once you've developed your automation logic, you can easily export the final, complete Puppeteer script based on your instructions.

This tool saves me hours daily, but even more importantly, it improves the developer experience in browser automation which is frustrating area.

I believe ScrapeCopilot can complement existing browser automation tools and frameworks by providing an interactive AI-assisted development experience.

Current Status & Future Plans:

- The extension currently works within VS Code. It will work in Cursor, but without chat support initially. I'm actively working on integrating a backend server to enable full chat functionality with Cursor.

- Currently the key workflow assumes that you create a new browser automation step by step, using code cells. But in my work I spend half of the time fixing existing automations, so my focus now is trying to adapt extension for debugging and fixing existing code.

- Playwright support is also on the list.

Check out short videos: - Demo: Headless False - https://scrapecopilot.ai/assets/demo-headless-false-Dhc_jeNR... - Demo: Headless True - https://scrapecopilot.ai/assets/demo-headless-true-PRQndDxP....

I'd love to hear your thoughts, feedback, and any suggestions!

CIDR 2026 Proceedings

https://vldb.org/cidrdb/2026/
1•remywang•1m ago•0 comments

The Lost Art of XML

https://marcosmagueta.com/blog/the-lost-art-of-xml/
1•Curiositry•5m ago•0 comments

Over 1k Arizona teachers resigning plays a part in shortage

https://azpbs.org/horizon/2025/11/teacher-shortage-2/
1•toomuchtodo•6m ago•0 comments

Asciinema: Making Movies at the Command-Line

https://lwn.net/Articles/1053355/
1•signa11•8m ago•0 comments

Google decides what you see in Images and where invisible keywords are born

https://comuniq.xyz/post?t=738
1•01-_-•11m ago•0 comments

Microsoft investigating outage affecting Microsoft 365

https://www.cbsnews.com/news/microsoft-365-outage-outlook/
1•01-_-•12m ago•0 comments

Remotely unlocking an encrypted hard disk with systemd initrd on Arch

https://jyn.dev/remotely-unlocking-an-encrypted-hard-disk/
1•signa11•12m ago•0 comments

Show HN: Glean – RSS reader with AI-powered smart sorting and MCP integration

https://github.com/LeslieLeung/glean
1•3verest•16m ago•0 comments

Intel puts consumer chip production on back burner

https://www.theregister.com/2026/01/23/intel_earnings_q4_2025/
1•bovem•17m ago•0 comments

I Overengineered a Spinning Top

https://www.youtube.com/watch?v=Wp5NodfvvF4
1•bane•18m ago•0 comments

Man, these New York Times games are hard A computational perspective

https://arxiv.org/abs/2509.10846
1•PaulHoule•18m ago•0 comments

ChatGPT Self Portrait

https://thezvi.substack.com/p/chatgpt-self-portrait
1•gmays•19m ago•0 comments

Introducing: Postgres Best Practices

https://supabase.com/blog/postgres-best-practices-for-ai-agents
1•samuba•20m ago•0 comments

TikTok USDS Joint Venture LLC Established Under U.S. Regulatory Requirements

https://newsroom.tiktok.com/announcement-from-the-new-tiktok-usds-joint-venture-llc?lang=en
1•rzerowan•21m ago•1 comments

Thomas Edison: The Unintentional Founder of Hollywood

https://www.saturdayeveningpost.com/2021/03/thomas-edison-the-unintentional-founder-of-hollywood/
1•ronsor•22m ago•0 comments

Underground Resistance Aims to Sabotage AI with Poisoned Data

https://www.forbes.com/sites/craigsmith/2026/01/21/poison-fountain-and-the-rise-of-an-underground...
3•atomic128•26m ago•2 comments

The Cscript Style Guide – CScript is the standard C

https://github.com/domenukk/CScript
1•domenukk•26m ago•1 comments

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and RL

https://developer.nvidia.com/blog/how-to-train-an-ai-agent-for-command-line-tasks-with-synthetic-...
1•gmays•27m ago•0 comments

Waze built the largest crowdsourced surveillance system

https://twitter.com/harrris0n/status/2014197314571952167
3•takoid•27m ago•1 comments

Show HN: Bookmarklet for removing AI posts from Hacker News

https://dan-lovelace.github.io/hn-blocklist/
2•dandrew5•29m ago•0 comments

Show HN: An ultra-light, multilingual unit converter that keeps growing

https://mrunit.net/
1•thenodeshift•30m ago•0 comments

Who Just Bought TikTok

https://www.nytimes.com/2026/01/22/business/media/tiktok-investors-oracle-mgx-silver-lake-bytedan...
1•donohoe•32m ago•0 comments

Show HN: MCPxel – Navigation and rating station for Agent Skills (LLM-judged)

https://mcpxel.com
1•maxnew•33m ago•1 comments

Post-Micturition Convulsion Syndrome

https://en.wikipedia.org/wiki/Post-micturition_convulsion_syndrome
1•thunderbong•39m ago•1 comments

Google shows small models analyze smartphone screens to predict what users want

https://research.google/blog/small-models-big-results-achieving-superior-intent-extraction-throug...
1•rexbee•43m ago•0 comments

The Uncomfortable Math of Working for Yourself

https://thomasunise.com/the-uncomfortable-math-of-working-for-yourself/
2•eeko_systems•45m ago•0 comments

A Massacre in Mashhad

https://www.newyorker.com/news/as-told-to/a-massacre-in-mashhad
4•petethomas•46m ago•0 comments

What Margaret Atwood Would Like You to Know

https://newrepublic.com/article/204118/margaret-atwood-like-know-book-lives-memoir-review
1•petethomas•46m ago•1 comments

Lilliputian Hallucinations

https://www.sciencedirect.com/science/article/pii/S0149763421001068
1•rammy1234•48m ago•0 comments

Show HN: gRPC Transport for HashiCorp/Raft

https://github.com/dhiaayachi/raft-grpc-transport
1•neo2006•51m ago•0 comments