frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•10mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

How the Iran war threatens global food supply

https://www.npr.org/2026/03/20/nx-s1-5750812/how-the-iran-war-threatens-global-food-supply
1•kaycebasques•1m ago•0 comments

When Amazon and JD.com lock horns, it's shoppers that win

https://www.ft.com/content/c90aee36-1d5e-4a44-bc83-51f244ccd056
1•hhs•1m ago•0 comments

Fluux Messenger 0.14.0 – A Modern Cross Platform XMPP Client (TypeScript)

https://www.process-one.net/blog/fluux-messenger-0-14-0/
1•neustradamus•2m ago•0 comments

Tech Billionaires Renege on Giving Pledge

https://www.nytimes.com/2026/03/15/business/the-billionaire-backlash-against-a-philanthropic-drea...
1•theahura•2m ago•1 comments

Red kite with sausage roll snapped by photographer

https://www.bbc.com/news/articles/cx2g290ve2vo
1•mooreds•3m ago•0 comments

Weight-loss treatment is on the verge of a dramatic shift – again

https://www.cnn.com/2026/03/19/health/weight-loss-drugs-glp-1
1•mooreds•3m ago•0 comments

Another sodium-ion EV battery emerges in China with 4C fast charging in 11 mins

https://electrek.co/2026/03/20/sodium-ion-ev-battery-breakthrough-achieves-11-min-fast-charging/
1•breve•3m ago•0 comments

Cardea was the ancient Roman goddess of the hinge

https://en.wikipedia.org/wiki/Cardea
1•mooreds•3m ago•0 comments

We Crawled 479 Pages to Find What AI Platforms Cite – It's Not What SEO Says

https://aiplusautomation.com/blog/what-ai-platforms-actually-cite
1•anthonylee991•4m ago•0 comments

LibreOffice's native format ODF Python library, odfpy, is abandoned

https://github.com/eea/odfpy/issues/123
1•nogajun•5m ago•0 comments

Open Source becomes standard in Germany's Administration

https://www.heise.de/en/news/Administration-Open-Source-becomes-standard-11219712.html
1•doener•5m ago•0 comments

MacBook Neo, best repairability score in years, still 6/10

https://www.ifixit.com/News/116152/macbook-neo-is-the-most-repairable-macbook-in-14-years
1•hackerBanana•7m ago•0 comments

Cyberattack leaves drivers with breathalyzer systems unable to start vehicles

https://wgme.com/news/local/cyberattack-leaves-maine-drivers-with-breathalyzer-test-systems-unabl...
1•iamnothere•7m ago•0 comments

Show HN: Coding vs. Learning with LLMs

https://substack.com/@bxrne/p-191061382
1•bxrne•8m ago•0 comments

Elon Musk misled Twitter investors, jury finds

https://www.bbc.co.uk/news/articles/c62j3yl842eo
4•ColinWright•8m ago•0 comments

Google Keep Is Down

https://downdetector.com/status/google-keep/
2•ortusdux•12m ago•0 comments

For Banksy, crime does actually pay

https://www.nationalreview.com/2026/03/for-banksy-crime-does-actually-pay/
1•hhs•13m ago•0 comments

Rawq – semantic code search for AI agents (4x fewer wasted tokens, Rust, OSS)

https://github.com/auyelbekov/rawq
1•Yerzhigit•13m ago•0 comments

Kevin Lewis: My AI Coding Setup (March 2026)

https://lws.io/blog/my-ai-coding-setup-march-2026/
2•nadis•13m ago•0 comments

The Acqui-Hire Is No Longer a Distress Sale

https://www.heavybit.com/library/article/the-acqui-hire-is-no-longer-a-distress-sale
2•nadis•13m ago•0 comments

Production Is Where the Rigor Goes

https://www.honeycomb.io/blog/production-is-where-the-rigor-goes
1•kiyanwang•14m ago•0 comments

Elon Musk Misled Twitter Investors Before 2022 Buyout, Jury Says

https://www.bloomberg.com/news/articles/2026-03-20/elon-musk-misled-twitter-investors-before-2022...
6•toomanyrichies•16m ago•1 comments

Neugebauer Lutnick Confrontation hints at trouble with data center project

https://www.politico.com/news/2026/03/20/confrontation-ceo-and-lutnick-00838496
1•defrost•16m ago•0 comments

One Battle After Another: PTA and the Death of Revolutionary Cinema

https://old.reddit.com/r/TrueFilm/comments/1nv83g1/one_battle_after_another_paul_thomas_anderson_...
1•kaycebasques•17m ago•0 comments

How 30+ AI agent frameworks handle context rot, memory and tools

https://github.com/vasilyevdm/ai-agent-handbook
1•rocketrider•18m ago•0 comments

User Interface Hall of Fame (1999)

http://hallofshame.gp.co.at/mfame.htm
1•12_throw_away•18m ago•0 comments

Stash: Fast and easy local-first file sync for agents

https://github.com/telepath-computer/stash
4•stlhood•19m ago•0 comments

The Anti-Portfolio

https://www.bvp.com/anti-portfolio
1•gone35•21m ago•0 comments

After nearly $1M in donations,78‑year‑old DoorDash driver says he's not retiring

https://www.nbcdfw.com/news/national-international/doordash-driver-not-retiring-1m-donations/3998...
1•teleforce•23m ago•0 comments

AI Fatigue

https://thethinkingbuilder.substack.com/p/on-ai-fatigue
3•kondov•25m ago•0 comments