frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a Web-Scraper API that is 6-7x more efficient than current ones

https://scrapewithruno.com/
15•polaritymaking•1h ago
Runo is a web-scraping API that returns typed, structured JSON. You define a schema (field name, type, example value), and Runo fetches the page and returns the data. No HTML, no parsers, no post-processing.

Over the past few weeks, I have been building this non stop. Currently, every scraper API out there solves the site fetching problem but left the extraction of the actual data entirely to users. Runo makes that completely disappear.

For Runo, I went ahead and added JS rendering, stealth mode, and full LLM extraction to make this a fully functional and capable of scraping most if not all sites.

Also, another major problem with current web scrapers is that they charge per feature or bundle them into expensive credit tiers. A single large or JS rendered request can cost 5-75 credits, which means you essentially get nothing out of their plans. Runo is flat per request, no matter the site. At the Scale tier, Runo works out to $0.90 per 1,000 effective requests vs. around $6 for the nearest Firecrawl equivalent. My jaw dropped when I was testing Runo and came across these numbers.

I created a free tier that is 500 requests/month, no credit card required. Take it for a spin and let me what can be improved. I would love feedback.

Comments

eddy-sekorti•1h ago
good, how much can be done in 500 requests/ month, i want to try it for something
rvz•1h ago
How does this work against sites that use Google's next-gen reCAPTCHA that uses hardware attestations?
drewrbaker•1h ago
Do a lot of scraping at work. Curious how you handle simulating UI interactions? Or if I can supply cookies in my requests?
superfrank•55m ago
I've done a lot of work in the scraping space for a side project I've worked on for years. At this point I've built my own tools, but over the years I've tested just about every paid scraping service that currently exists.

I think it would be really valuable to add some sort of tool that allows me to enter a URL and see a subset of the data returned, latency measurement, and flag if the Enhanced Bot Detection was required to get that data.

I can't count the number of times I've signed up for a service to see if they could get data from a URL that's giving me problems only to find that they couldn't scrape it either. It would be really helpful to know before I sign up what I need to pay for and that your tool can actually get the data I want. While it may result in fewer sign ups for your site, I think it will result in the customer who do sign up being higher value customers and potentially reduce support burden.

jeromechoo•31m ago
You’re probably also looking for that tool to be available unauthed. And yeah agreed. We do this at Diffbot and the test drive is the 2nd highest visited page.
nlitened•48m ago
Now we’re the forum for captcha-evading scrapers, boys
djoldman•44m ago
> Runo extracts by meaning, not DOM position. Site redesigns and HTML changes won't ever break your pipeline.

Bold claim.

Removing the modem and GPS from my 2024 RAV4 hybrid

https://arkadiyt.com/2026/05/13/removing-the-modem-and-gps-from-my-rav4/
477•arkadiyt•5h ago•277 comments

Amazonbot is finally respecting robots.txt

https://xeiaso.net/notes/2026/amazonbot-respecting-robots-txt/
80•xena•2h ago•20 comments

First public macOS kernel memory corruption exploit on Apple M5

https://blog.calif.io/p/first-public-kernel-memory-corruption
166•quadrige•4h ago•25 comments

RTX 5090 and M4 MacBook Air: Can It Game?

https://scottjg.com/posts/2026-05-05-egpu-mac-gaming/
438•allenleee•7h ago•114 comments

New Nginx Exploit

https://github.com/DepthFirstDisclosures/Nginx-Rift
242•hetsaraiya•5h ago•57 comments

Tesla Wall Connector bootloader bypasses the firmware downgrade ratchet

https://www.synacktiv.com/en/publications/exploiting-the-tesla-wall-connector-from-its-charge-por...
35•p_stuart82•2h ago•0 comments

Work with Codex from Anywhere

https://openai.com/index/work-with-codex-from-anywhere/
85•mikeevans•2h ago•23 comments

A Few Words on DS4

https://antirez.com/news/165
13•caust1c•20m ago•1 comments

RISC-V Router

https://router.start9.com/
37•janandonly•2h ago•21 comments

Infracost (YC W21) Is Hiring Sr Dev Advocate to make agents cloud cost-aware

https://www.ycombinator.com/companies/infracost/jobs/NzwUQ7c-senior-developer-advocate
1•akh•1h ago

Wrap Go binaries in Python wheels

https://github.com/simonw/go-to-wheel
10•ankitg12•2d ago•4 comments

Porting 3D Movie Maker to Linux

https://benstoneonline.com/posts/porting-3d-movie-maker-to-linux/
45•speckx•3d ago•10 comments

HDD Firmware Hacking

https://icode4.coffee/?p=1465
106•jsploit•6h ago•9 comments

OVMS: Open source electric vehicle remote monitoring, diagnosis and control

https://www.openvehicles.com/home
10•BHSPitMonkey•59m ago•2 comments

The Biochemical Beauty of Retatrutide: How GLP-1s Work

https://acesounderglass.com/2025/10/13/the-biochemical-beauty-of-retatrutide-how-glp-1s-actually-...
25•surprisetalk•3d ago•14 comments

Claude for Legal

https://github.com/anthropics/claude-for-legal
50•Einenlum•1h ago•40 comments

The Power of a Free Popsicle (2018)

https://www.gsb.stanford.edu/insights/power-free-popsicle
54•NaOH•4h ago•23 comments

New arXiv policy: 1-year ban for hallucinated references

https://twitter.com/tdietterich/status/2055000956144935055
135•gjuggler•2h ago•14 comments

Int a = 5; a = a++ + ++a; a =? (2011)

https://gynvael.coldwind.pl/?id=372
75•e-topy•2d ago•131 comments

Computer Hobby Movement in Canada

https://museum.eecs.yorku.ca/exhibits/show/hobby_canada/hobby_canada
173•rbanffy•9h ago•59 comments

A message from President Kornbluth about funding and the talent pipeline

https://president.mit.edu/writing-speeches/video-transcript-message-president-kornbluth-about-fun...
552•dmayo•7h ago•620 comments

You Don't Align an AI, You Align with It

https://danieltan.weblog.lol/2026/05/you-dont-align-an-ai-you-align-with-it
80•danieltanfh95•4h ago•40 comments

Understanding the Linux Kernel: The Linux Kernel Startup

https://internals-for-interns.com/posts/linux-kernel-startup/
69•valyala•4h ago•11 comments

What's in a GGUF, besides the weights – and what's still missing?

https://nobodywho.ooo/posts/whats-in-a-gguf/
66•bashbjorn•5h ago•29 comments

WinUI 3 Performance: A Leap Forward

https://github.com/microsoft/microsoft-ui-xaml/discussions/11096
73•whatever3•3h ago•55 comments

AI is making me dumb

https://jpain.io/god-damn-ai-is-making-me-dumb/
359•Eighth•4h ago•221 comments

DIY open-source ultrasound hardware on the rp2040/rp2350

http://un0rick.cc/pic0rick
36•kelu124•4h ago•3 comments

Rewrite Bun in Rust has been merged

https://github.com/oven-sh/bun/pull/30412
435•Chaoses•14h ago•511 comments

Fossils show millipede and centipede ancestors evolved legs underwater

https://phys.org/news/2026-05-ancient-sea-fossils-millipede-centipede.html
69•gmays•3d ago•2 comments

60fps Video on a CGA? – The GlyphBlaster

https://martypc.blogspot.com/2026/05/60fps-video-on-cga-glyphblaster.html
67•tambourine_man•4d ago•9 comments