frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

How to Just Do a Thing

https://www.raptitude.com/2026/05/how-to-just-do-a-thing/
1•crescit_eundo•56s ago•0 comments

Langwag: Learn a Language from News Stories

https://langwag.com/en
1•danosull•2m ago•1 comments

Dell confirms its SupportAssist software causes Windows BSOD crashes

https://www.bleepingcomputer.com/news/software/dell-confirms-its-supportassist-software-causes-wi...
1•Brajeshwar•2m ago•0 comments

VectorSmuggle: Steganographic exfiltration in vector embedding stores

https://arxiv.org/abs/2605.13764
1•smugglereal•3m ago•0 comments

I cut my OpenRouter bill 43% by fixing my own prompts. Then shipped the tool

https://fixmyprompt.net/
1•chewymole•3m ago•0 comments

From latency to instant: Modernizing GitHub Issues navigation performance

https://github.blog/engineering/architecture-optimization/from-latency-to-instant-modernizing-git...
1•soheilpro•3m ago•0 comments

The Graveyard of the Internet

https://doublespeed.ai/
1•nwhnwh•3m ago•0 comments

Httpx2

https://tildeweb.nl/~michiel/httpx2.html
1•thm•6m ago•0 comments

The Agent Is a Workflow That Writes Itself

https://www.getauctor.com/blog/the-agent-is-a-workflow-that-writes-itself
4•tjbai•6m ago•0 comments

Interrogatory LLM

https://martinfowler.com/bliki/InterrogatoryLLM.html
1•soheilpro•6m ago•0 comments

Cuba says it has run out of fuel, blames U.S. embargo

https://www.upi.com/Top_News/World-News/2026/05/14/Cuba-says-oil-reserves-totally-drained/9311778...
2•thm•8m ago•0 comments

"This was a user-friendly computer."

https://unsung.aresluna.org/this-was-a-user-friendly-computer/
1•bobbiechen•9m ago•0 comments

All 11 onboard survived after plane made emergency landing near the Bahamas

https://www.theguardian.com/us-news/2026/may/14/florida-atlantic-plane-rescue-fuel-low
1•tosh•10m ago•0 comments

Magnetic switch for multimodal, chip-free, and batteryless airway sensing

https://www.science.org/doi/10.1126/sciadv.aed3998
1•PaulHoule•10m ago•0 comments

Restoring the Flow: A Milestone in the Revival of the Everglades

https://e360.yale.edu/features/picayune-strand-restoration-project
1•Brajeshwar•11m ago•0 comments

'The Most Bipartisan Issue Since Beer': Opposition to Data Centers

https://www.nytimes.com/2026/05/01/us/politics/liberals-conservatives-data-centers.html
2•Teever•11m ago•0 comments

Performance per engineer doubled at six big tech companies, and the metric used

https://medium.com/@navigara/performance-per-engineer-doubled-at-six-big-tech-companies-heres-how...
1•alienll•12m ago•0 comments

Turning recorded D&D sessions into comics

https://haffi112.github.io/2026/05/14/dnd-comics/
1•haffi112•12m ago•0 comments

Req Is All You Need

https://benreinhart.com/blog/req-is-all-you-need/
2•ezekg•13m ago•0 comments

I Prefer Djot over Markdown

https://www.karl.berlin/djot.html
1•Tomte•14m ago•0 comments

Apple's Security Has Been Tough to Crack. Mythos Helped Find a Way In

https://www.wsj.com/tech/ai/anthropic-mythos-apple-macos-bug-339da403
2•jbredeche•14m ago•0 comments

You didn't start a business to write content

https://www.muana.page/
1•oktra_dev•14m ago•0 comments

Desperate Trump Taps "Tim Apple," Jensen Huang, Elon Musk to Attend Xi Summit

https://arstechnica.com/tech-policy/2026/05/desperate-trump-taps-tim-apple-jensen-huang-elon-musk...
3•rbanffy•16m ago•0 comments

The SpaceX IPO is upending the stock market

https://www.axios.com/2026/05/14/spacex-musk-sp-stocks
1•Brajeshwar•21m ago•1 comments

Cyber lack of security and AI governance

https://thezvi.substack.com/p/cyber-lack-of-security-and-ai-governance
1•maxutility•21m ago•0 comments

I let Claude autonomously create a graphic novel

https://twitter.com/deepwhitman/status/2054466408835350985
1•bilater•23m ago•1 comments

Extracting the Root VCEK Seed on EPYC Milan via a Software-Only Attack

https://arxiv.org/abs/2605.12990
1•Timofeibu•25m ago•0 comments

RTX 5090 and M4 MacBook Air: Can It Game?

https://scottjg.com/posts/2026-05-05-egpu-mac-gaming/
3•allenleee•25m ago•0 comments

Fully-functional RTX 3070 16GB gets frankensteined into existence

https://www.tomshardware.com/pc-components/gpus/fully-functional-rtx-3070-16gb-gets-frankensteine...
2•Timofeibu•25m ago•0 comments

Help People Up the Stairs

https://reactiverobot.com/writing/stairs.html
1•ref-matt•26m ago•0 comments