frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: VibeScrape – Paste a URL and JSON schema, get working web scraper code

https://vibescrape.ai/
1•sourdesi•2h ago
Hey HN, I built VibeScrape — it takes a website URL and a JSON schema describing the data you want, then analyzes the page, writes real Python code to extract that data, and refines the code until the output is accurate.

While there's lots of tools these days (e.g. Firecrawl) that feed the entire HTML of a webpage to an LLM to extract data from it, this always seemed like a really slow & expensive approach to me.

On the other hand, handwriting web-scraping code seems archaic at this point. This type of code is incredibly tedious to write, and immediately becomes throwaway code once the webpage's layout changes even a little.

VibeScrape aims to automate the process of writing this type of code.

1. Grabs the rendered HTML — the same view a browser sees. 2. Has an LLM extract data from the HTML into your target JSON schema (the “ground truth”). 3. Generates Python scraper code to reproduce that "ground truth" output. 4. Runs and compares results against the ground truth. 5. Refines the code automatically until the outputs match.

I've found that letting the LLM take control of both the code generation and iteration process e2e has worked pretty well for producing working scraper code for many of the websites I've tested it on!

It still has some limitations in terms of handling pagination, captchas, infinite scrolling, etc. Hoping to get some early feedback from the HN community to see if this is a valuable tool. There's a promo code FIRST5 on the site that gets you 5 credits for free, but am happy to give more credits to anyone that reaches out at contact@vibescrape.ai !

Thanks!

Emotions Recognized in Vocal Bursts

https://s3-us-west-1.amazonaws.com/vocs/map.html
1•rendx•59s ago•0 comments

A bestiary of single-file programming language implementations

https://github.com/marcpaq/b1fipl
1•fanf2•1m ago•0 comments

Veezow – See your site the way AI does

https://www.veezow.com/
1•Atbech•5m ago•0 comments

Chip Hall of Fame: Intel 8088 Microprocessor

https://spectrum.ieee.org/chip-hall-of-fame-intel-8088-microprocessor
1•stmw•6m ago•0 comments

The tiny library [the Bibliomotocarro] bringing books to remote villages (2019)

https://www.bbc.com/culture/article/20190125-the-tiny-library-bringing-books-to-remote-villages
1•toomuchtodo•7m ago•1 comments

The Multiverse (1995)

https://www.youtube.com/watch?v=SDZ454K_lBY
1•jinwoo68•9m ago•1 comments

China's analogue AI chip could work 1k times faster than Nvidia GPU: study

https://www.scmp.com/news/china/science/article/3329820/chinas-analogue-ai-chip-could-work-1000-t...
1•JSR_FDED•11m ago•1 comments

Milton Glaser I ♥ NY concept sketch

https://www.moma.org/collection/works/128649
1•geox•13m ago•0 comments

Clojure Runs ONNX AI Models Now – Join the AI Fun

https://dragan.rocks/articles/25/Clojure-Runs-ONNX-AI-Models-Now
2•savodj•14m ago•0 comments

Bind CVE-2025-40778: Cache poisoning attacks with unsolicited RRs

https://kb.isc.org/docs/cve-2025-40778
1•zdw•15m ago•0 comments

Hatch Act Guidance on Social Media [pdf]

https://osc.gov/Documents/Hatch%20Act/Advisory%20Opinions/Federal/Social%20Media%20Guidance.pdf
1•triska•15m ago•0 comments

Show HN: Hermes – Self-hosted video downloader

https://github.com/TechSquidTV/Hermes
3•TechSquidTV•22m ago•0 comments

KanDDDinsky 2025: Our First Time at Europe's Community-Driven DDD Conference

https://docs.eventsourcingdb.io/blog/2025/10/27/kandddinsky-2025-our-first-time-at-europes-commun...
1•goloroden•24m ago•0 comments

Cat Bombs More Prevalent Than Previously Thought (2013)

https://www.theatlantic.com/technology/archive/2013/02/update-cat-bombs-more-prevalent-than-previ...
2•gscott•26m ago•1 comments

Department of Labor creates "Project Firewall" to stop high skilled H-1B workers

https://bsky.app/profile/joshtpm.bsky.social/post/3m3yang3ndc25
2•ck2•26m ago•0 comments

Show HN: Doodl – Free Online Doodling Canvas (No Account Needed)

http://doodl.it.com/
1•tidalboot•28m ago•0 comments

Show HN: Zimic: TypeScript-first HTTP integrations

https://zimic.dev/
1•diego-aquino•30m ago•0 comments

Regional coordination can alleviate the cost burden of a low-carbon electricity

https://www.nature.com/articles/s41467-025-64093-8
1•wslh•30m ago•0 comments

Ask HN: Handwriting OCR Options?

1•giantg2•30m ago•1 comments

Unix Shell Basics

https://elixirestonia.github.io/2025-04-25-shell-novice/
1•mooreds•30m ago•0 comments

Halloween candy's getting lighter on the chocolate

https://www.marketplace.org/story/2025/10/24/halloween-candys-getting-lighter-on-the-chocolate
1•mooreds•30m ago•0 comments

Programming Languages as Languages (2014)

https://programmingzen.com/programming-languages-as-languages/
2•todsacerdoti•37m ago•0 comments

We Saved $500k per Year by Rolling Our Own "S3"

https://engineering.nanit.com/how-we-saved-500-000-per-year-by-rolling-our-own-s3-6caec1ee1143
2•mpweiher•37m ago•0 comments

Git commit hashes that spark joy

https://tylercipriani.com/blog/2024/09/29/subliminal-git-commits/
1•thcipriani•38m ago•1 comments

How Netflix's Nuclear War Movie Holds Up to the Real World

https://www.nytimes.com/2025/10/23/opinion/house-of-dynamite-bigelow-nuclear.html
2•bookofjoe•41m ago•1 comments

FIFA's 2026 ticket scheme is a late-capitalist hellscape

https://www.theguardian.com/football/2025/oct/11/fifa-2026-world-cup-tickets-dynamic-pricing-nft-...
4•PaulHoule•42m ago•1 comments

Connecticut needs to plan for its energy future

https://rhodeislandcurrent.com/2025/10/26/conn-needs-to-plan-for-its-energy-future-but-the-view-i...
1•chmaynard•42m ago•0 comments

ExecuTorch 1.0

https://pytorch.org/blog/introducing-executorch-1-0/
2•jonbaer•42m ago•0 comments

Show HN: iOS 26 broke my favorite feature. I built my own mobile browser

https://testflight.apple.com/join/Z9A3P1gg
2•GabrielMMMM•42m ago•1 comments

Physicists Find Hidden Quantum Mirrors That Trap Light in 2D Materials

https://scitechdaily.com/physicists-find-hidden-quantum-mirrors-that-trap-light-in-2d-materials/
1•westurner•45m ago•2 comments