frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Travel back to 1998 and use Lovable on Windows 98

https://www.sinalytica.com/
1•teddyX•4m ago•0 comments

Ahoy, DECmate II the little PDP-8 that could

http://oldvcr.blogspot.com/2026/05/ahoy-decmate-ii-little-pdp-8-that-could.html
1•TMWNN•6m ago•0 comments

Building a LangGraph pipeline for production data engineering

https://labyrinthanalyticsconsulting.com/blog/building-first-langgraph-pipeline
2•labyrinthAC•10m ago•0 comments

Peter Thiel's Move to Argentina Reflects Billionaire Trend

https://www.businessinsider.com/peter-thiel-argentina-billionaire-moving-abroad-2026-5
2•nreece•23m ago•0 comments

Microsoft NetMeeting was more important than you think [video]

https://www.youtube.com/watch?v=qhay6VryyvE
1•jervant•26m ago•0 comments

Just for fun, generate your websites alter ego

https://webalterego.com
1•pointscard•26m ago•1 comments

Ghostbase – describe an agent in plain English, it runs on a webhook or cron

https://ghostbase.ai/
1•florianberisha•38m ago•0 comments

Cancer jab can eradicate tumours in patients, trial shows

https://www.theguardian.com/science/2026/may/30/cancer-jab-can-eradicate-entire-tumours-in-patien...
4•Teever•38m ago•0 comments

If Windows were designed today, would the Registry exist? [video]

https://www.youtube.com/watch?v=bkrbew3Ls60
4•LelouBil•42m ago•1 comments

86Box v6.0

https://86box.net/2026/05/31/86box-v6-0.html
1•chungy•44m ago•0 comments

Open models lag closed models by 4 months

https://epoch.ai/data-insights/open-closed-eci-gap
3•intelkishan•45m ago•1 comments

Adverse childhood experiences and risk of mental disorders: A systematic review

https://www.sciencedirect.com/science/article/pii/S0001691826007559
3•rendx•55m ago•0 comments

The Digital Maieutic: Socrates and the Art of Prompting

https://forum.effectivealtruism.org/posts/qsG9LwjD9ZAERiTFc/the-digital-maieutic-socrates-and-the...
1•rramadass•56m ago•1 comments

Hydrogen and oxygen make water [video]

https://www.youtube.com/watch?v=UV8KbQyF228
1•tripdout•1h ago•0 comments

Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

https://arxiv.org/abs/2510.18245
1•matt_d•1h ago•0 comments

Show HN: Live, system-wide USB transfer sniffer in eBPF

https://github.com/yeet-src/usbsnoop
2•r3tr0•1h ago•0 comments

E-Scooter Rider and Cyclist Are Killed in Head-On Crash on NYC Bridge

https://www.nytimes.com/2026/05/28/nyregion/queensboro-bridge-nyc-bike-scooter-crash-deaths.html
2•ChrisArchitect•1h ago•1 comments

The One Terminal you will need

https://github.com/8868derek/oneterminal
2•DerekFan•1h ago•0 comments

Dating apps were never built to deliver you to anyone

https://pilgrimsage.substack.com/p/the-two-doors
1•momentmaker•1h ago•2 comments

Show HN: OWASP Agent Memory Guard – Stop AI Agent Memory Poisoning

https://github.com/OWASP/www-project-agent-memory-guard
2•vgudur297•1h ago•0 comments

Please Do Not Vibe Fuck Up This Software – Rsync

https://github.com/RsyncProject/rsync/issues/929
65•justdotJS•1h ago•24 comments

The Home-Insurance Coin Flip: Nearly Half of Claims Result in Zero Payout

https://www.wsj.com/finance/the-home-insurance-coin-flip-nearly-half-of-claims-result-in-zero-pay...
4•JumpCrisscross•1h ago•0 comments

Local-first, single-binary chaos engineering CLI for indie devs

https://github.com/1999labs/antics
2•knownquantity_•1h ago•0 comments

America Has a Pangram Problem

https://www.theatlantic.com/technology/2026/05/pangram-ai-detection-accuracy/687381/
2•paulpauper•1h ago•1 comments

The Feeling of Control Slipping Away

https://www.theatlantic.com/technology/2026/05/ai-agents-agency-crisis-humanity/687379/
3•paulpauper•1h ago•1 comments

80k Hours: The Book

https://marginalrevolution.com/marginalrevolution/2026/05/80000-hours-the-book.html
1•paulpauper•1h ago•0 comments

Chinese Actors' Impersonation and Stolen Narratives in Digital Repression

https://citizenlab.ca/research/how-chinese-actors-use-impersonation-and-stolen-narratives-to-perp...
2•WaitWaitWha•1h ago•0 comments

The radical network redesign that led AWS to forge a more resilient cloud

https://www.aboutamazon.com/stories/aws-random-graph-theory-data-center-network-design
3•tanelpoder•1h ago•0 comments

How to Build a Shitty Robot

https://mariozechner.at/posts/2026-05-30-shitty-robot/
1•zdw•1h ago•1 comments

What Distinguishes the Super Rich from the Rest of Us

https://knowledge.insead.edu/leadership-organisations/what-distinguishes-super-rich-rest-us
2•makerdiety•2h ago•3 comments