frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: ScrapeCopilot – Notebook Code Interface + Puppeteer + AI Copilot

3•erichi•1y ago
Hi HN, I’m Eric, and I’m building ScrapeCopilot, an AI assistant designed to eliminate friction in browser automation development.

Here is the link to VS Code extension - https://marketplace.visualstudio.com/items?itemName=scrapeco...

I've built browser automations for more than 5 years, and the constant frustration was always the sheer friction involved in getting working code – especially when debugging in headless mode or connecting to remote browsers.

When I started using LLMs to generate automation code, I found myself stuck in a repetitive loop: navigate to the desired page state, copy-paste HTML into the AI chat, and ask it to generate code. The worst part is that there was no easy way to run that generated code without losing the page state, forcing me to restart the browser session constantly. This wasted large amounts of time and mental energy. I built ScrapeCopilot to make this workflow seamless.

How it works:

ScrapeCopilot combines the power of a Jupyter-style notebook with a live Puppeteer browser session and integrated AI.

- Live Interactive Development: When you create an automation notebook, it initiates a fresh Puppeteer browser session. The page object is exposed directly to your notebook cells, allowing you to run any Puppeteer code against the live browser state and see the results instantly.

- AI-Powered Assistance: It integrates with GitHub Copilot (via the @scrapecopilot chat participant). The AI automatically sees the current page HTML, allowing it to generate highly relevant Puppeteer code based on your instructions directly within the chat.

- LLM Code Export: Once you've developed your automation logic, you can easily export the final, complete Puppeteer script based on your instructions.

This tool saves me hours daily, but even more importantly, it improves the developer experience in browser automation which is frustrating area.

I believe ScrapeCopilot can complement existing browser automation tools and frameworks by providing an interactive AI-assisted development experience.

Current Status & Future Plans:

- The extension currently works within VS Code. It will work in Cursor, but without chat support initially. I'm actively working on integrating a backend server to enable full chat functionality with Cursor.

- Currently the key workflow assumes that you create a new browser automation step by step, using code cells. But in my work I spend half of the time fixing existing automations, so my focus now is trying to adapt extension for debugging and fixing existing code.

- Playwright support is also on the list.

Check out short videos: - Demo: Headless False - https://scrapecopilot.ai/assets/demo-headless-false-Dhc_jeNR... - Demo: Headless True - https://scrapecopilot.ai/assets/demo-headless-true-PRQndDxP....

I'd love to hear your thoughts, feedback, and any suggestions!

AI enthusiasts in a race against time, AI skeptics in a race against entropy

https://charity.wtf/2026/06/15/ai-demands-more-engineering-discipline-not-less-xpost/
1•The_Fox•1m ago•0 comments

An LLM agent that emits typed intent

https://github.com/gabert/ontocortex
1•gabert•3m ago•0 comments

Straw: Compress big infra into one md file – 99.5% LLM token reduction

https://github.com/ilyesarf/straw/
1•ilyesarf•3m ago•0 comments

U.S. health spending on pace to hit $6T

https://www.statnews.com/2026/06/24/health-care-spending-up-7-point-3-percent-6-trillion-dollars-...
1•brandonb•4m ago•0 comments

Jest/Vitest interactive course (runs in the browser)

https://howtotestfrontend.com/courses/jest-vitest-fundamentals
1•howToTestFE•6m ago•1 comments

Show HN: Dspyer – self-correcting, optimizable LLM steps for DSPy and LangGraph

https://github.com/theramkm/dspyer
1•ramkm•6m ago•0 comments

You Increased Your Prices – Did It Help or Hurt?

1•kingmailer•12m ago•0 comments

Lost Indiana Jones Adventure Discovered [video]

https://www.youtube.com/watch?v=HhTUUmQKmFU
1•austinallegro•13m ago•0 comments

Taiwan Chip Firm ASE Expands for AI Boom

https://fivetakes.news/taiwans-ase-expands-capacity-to-meet-ai-demand
1•mmeirovich•13m ago•0 comments

Bitwarden icons bidirectional C2 channel

https://thecontractor.io/bitwarden-c2/
1•bialyalibaba•13m ago•0 comments

Slop Paralysis

https://elijahpotter.dev/articles/slop-paralysis
2•chilipepperhott•13m ago•0 comments

Show HN: Slick, a desktop client mod for Slack

https://github.com/3kh0/slick
1•Agreed3750•15m ago•0 comments

Astryx – open-source design system customizable and agent ready

https://astryx.atmeta.com/
5•peterhunt•16m ago•0 comments

TopoGlyph: A dual-encoding topological language

https://topoglyph.net
1•zwyld•16m ago•1 comments

Earth to Cosmic Clusters

https://www.facebook.com/share/r/14kyEg4LWNd/
1•Asheed•16m ago•0 comments

Export controls for Fable are too late to slow proliferation

https://dualuse.dev/posts/export-controls-on-fable
1•lebovic•20m ago•1 comments

How your generosity made Weblate better for everyone

https://antennapod.org/de/blog/2026/06/weblate
1•ericdanielski•22m ago•0 comments

I built a fleet-scale inference control plane using Crossplane

https://blog.crossplane.io/building-modelplane/
1•negz•23m ago•1 comments

Elastic Layoffs?

4•nunocoracao•23m ago•2 comments

Google – Alphabet's Sour Soup

1•IAMAGINIT•25m ago•0 comments

Dev shops sell you seniors, then staff the work with juniors

https://twoheads.net/dev-shops-sell-you-seniors/
5•hey-fk•26m ago•0 comments

Robusta's Reckoning: Vietnam's Coffee Boom Running Out of Forest, Water and Time

https://coffeewatch.org/vietnams-robustas-reckoning/
2•littlexsparkee•32m ago•0 comments

Pillars of an Autonomous Agentic System

https://sohit.substack.com/p/pillars-of-an-autonomous-agentic
1•sohitkeshri•32m ago•0 comments

Using the Gini Coefficient to Plan Edge Capacity

https://www.fastly.com/blog/using-gini-coefficient-plan-edge-capacity
5•bshanks•32m ago•0 comments

How Physicists Track and Trap the Elusive Neutrino

https://www.quantamagazine.org/how-physicists-track-and-trap-the-elusive-neutrino-20260624/
2•wasting_time•35m ago•0 comments

Code review powered by an LLM council

https://dromeas.ai/blog/code-review-evolved
1•manos-saratsis•36m ago•1 comments

Incoming: Vanguard On-Demand

https://www.riotgames.com/en/news/vanguard-on-demand
1•Nuthen•37m ago•1 comments

Submodular Context Selection as a Pluggable Engine for LLM Agents

https://arxiv.org/abs/2606.20047
1•Elof•39m ago•0 comments

A knowledge graph blog that turns an Obsidian vault of Markdown notes

https://github.com/halit/hblog-ng
1•nofool•39m ago•0 comments

Record Type Inference for Dummies

https://haskellforall.com/2026/06/record-type-inference-for-dummies
2•birdculture•42m ago•0 comments