frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•6mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Washington Court: Data Captured on Flock Safety Cameras Are Public Records

https://www.eff.org/deeplinks/2025/11/washington-court-rules-data-captured-flock-safety-cameras-a...
1•hn_acker•30s ago•1 comments

Jimmy Wales, Co-Founder of Wikipedia, quits interview angrily after one question

https://old.reddit.com/r/interesting/comments/1ox0lkz/jimmy_wales_cofounder_of_wikipedia_quits/
1•martinlaz•8m ago•0 comments

Facts that might mess with your sense of time

https://www.datawrapper.de/blog/historic-events-time-overlap
1•glitchdout•10m ago•0 comments

USDA head says 'everyone' on SNAP will now have to reapply

https://thehill.com/homenews/administration/5606715-agriculture-secretary-snap-reapply/
2•sipofwater•10m ago•1 comments

Developers Can Choose the Right AI Coding Tool Without Feeling Overwhelmed

https://practicalsecurity.substack.com/p/the-complete-2025-guide-to-ai-coding
1•atilla_bilgic•10m ago•0 comments

Show HN: OpEx, an agentic LLM toolkit for Elixir

https://github.com/kenforthewin/opex
2•kenforthewin•12m ago•0 comments

Fumadocs – open-source React.js docs framework

https://github.com/fuma-nama/fumadocs
1•el_hacker•12m ago•0 comments

Google DeepMind is using Gemini to train agents inside Goat Simulator 3

https://www.technologyreview.com/2025/11/13/1127921/google-deepmind-is-using-gemini-to-train-agen...
1•gnabgib•16m ago•0 comments

CAIDA Spoofer - minimize the Internet's susceptibility to spoofed DDoS attacks

https://www.caida.org/projects/spoofer/
1•pabs3•18m ago•0 comments

Networks vulnerable to IP spoofing for the last year

https://spoofer.caida.org/as_stats.php
1•pabs3•19m ago•0 comments

From ETL to AI(e)tl: Rethinking Data Pipelines for the AI Era

https://evanvolgas.substack.com/p/from-etl-to-aietl-rethinking-data
1•nthanki•21m ago•0 comments

Go's Sweet 16

https://go.dev/blog/16years
8•0xedb•22m ago•0 comments

Keeping dependencies up to date with Renovate

https://kiersnowski.dev/posts/renovate/
2•Ganji00•22m ago•1 comments

Intro to Routing: Mixture-of-Experts and Expert Choice

https://www.neelsomaniblog.com/p/intro-to-routing-mixture-of-experts
1•nsomani•23m ago•0 comments

AI Village

https://theaidigest.org/village
2•vinhnx•24m ago•1 comments

XAML and Flutter with Hot Reload

https://twitter.com/wieslawsoltes/status/1989445377356173444
1•wiso•28m ago•0 comments

Coding Trance Music from Scratch (Switch Angel) [video]

https://www.youtube.com/watch?v=iu5rnQkfO6M
2•aSig•30m ago•0 comments

New Glenn booster welds itself onto the deck after landing

https://old.reddit.com/r/SpaceXMasterrace/comments/1owfgoc/new_glenn_welds_itself_onto_the_deck_a...
6•huhtenberg•30m ago•0 comments

Obsidian IRC Client

https://github.com/ObsidianIRC/ObsidianIRC
1•rickcarlino•31m ago•0 comments

China's new aircraft supercarrier challenges U.S. dominance in Pacific

https://www.washingtonpost.com/world/2025/11/14/us-china-fujian-aircraft-carrier-pacific/
1•bookofjoe•31m ago•1 comments

Russia Produces More Artillery Than All of NATO Combined

https://aaronlee.substack.com/p/russia-is-producing-more-artillery
1•aajailee•33m ago•0 comments

Live Asteroid and Neo Visualization

https://untilanasteroid.com/
1•dimarco•34m ago•0 comments

Sam Altman celebrates ChatGPT following em dash formatting rules

https://arstechnica.com/ai/2025/11/forget-agi-sam-altman-celebrates-chatgpt-finally-following-em-...
1•fraXis•34m ago•0 comments

Grokipedia cites a Nazi forum and fringe conspiracy websites

https://indicator.media/p/grokipedia-cites-a-nazi-forum-and-fringe-conspiracy-websites
3•tastyface•35m ago•1 comments

Ask HN: Interviewing currently (or recently)? how have interviews changed?

2•akudha•35m ago•0 comments

A zero dependency deterministic placeholder text generator in TypeScript

https://github.com/lokicik/placetext
1•lokicik•37m ago•1 comments

UFC fighter rejects White House cage match: '"Hunger Games" type of f‑ing s‑‑‑'

https://thehill.com/blogs/in-the-know/5606562-ufc-fighter-slams-white-house-matches/
10•c420•38m ago•2 comments

SSL Configuration Generator

https://ssl-config.mozilla.org/
1•smartmic•40m ago•0 comments

Convergence vs. Consensus in Distributed Systems

https://gowthamk.github.io/convergence-vs-consensus.html
2•typsfty•45m ago•0 comments

All operational US LNG terminals have violated pollution limits

https://www.theguardian.com/us-news/2025/oct/29/liquefied-natural-gas-terminal-pollution-limits
2•PaulHoule•46m ago•0 comments