frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Facebook is paying people overseas promoting Alberta separatism

https://www.cbc.ca/news/canada/facebook-overseas-alberta-separtism-9.7223966
1•vrganj•23s ago•0 comments

Productivity Effects Across Generations of AI Coding Tools

http://muratbuffalo.blogspot.com/2026/06/writing-code-vs-shipping-code.html
1•ingve•5m ago•0 comments

A game's homemade crypto fell to a DIY supercomputer

https://www.ud2.rip/blog/towerunite/
1•vmfunc•8m ago•0 comments

Siri AI for iPhones and iPads will be delayed indefinitely in the EU

https://www.engadget.com/2189932/siri-ai-for-iphones-and-ipads-will-be-delayed-indefinitely-in-th...
1•adwmayer•10m ago•0 comments

QuillOS: The only Swift-first OS after macOS

https://quillOS.cloud/
1•ljlolel•11m ago•2 comments

Do Better Research with NotebookLM

https://blog.google/innovation-and-ai/products/notebooklm/better-research-notebooklm/
1•nkko•16m ago•0 comments

Is There a Link Between Listening to Music and Mental Health?

https://www.aesthetics.mpg.de/en/newsroom/news/news-article/article/is-there-a-link-between-liste...
1•XzetaU8•16m ago•0 comments

SpaceX CFO telecom analyst discuss

https://twitter.com/elonmusk/status/2064196509780893957
1•__patchbit__•19m ago•0 comments

Suprised to see the open data sources on internet

1•akd29121988•20m ago•0 comments

Stop Asking Claude to Agree with You

https://www.questionpro.com/engineering/engineering/developer%20tools/ai%20&%20machine%20learning...
1•skyDoesWork38•28m ago•0 comments

NASA's X-59 Aircraft Flies Supersonic for First Time

https://www.nasa.gov/aeronautics/x-59-first-supersonic-flight/
3•divbzero•32m ago•0 comments

SpaceX offers details on orbital data center satellites

https://spacenews.com/spacex-offers-details-on-orbital-data-center-satellites/
1•MrBuddyCasino•34m ago•0 comments

Show HN: I created an app to copy OTP from Google Voice to your macOS Clipboard

https://github.com/ptrinh/Notiful
1•ptrinh•40m ago•0 comments

iPhone almost like a birth control device, fertility rates falling after 2007

https://www.indiatoday.in/technology/news/story/iphone-almost-like-a-birth-control-device-fertili...
1•rustoo•42m ago•0 comments

Ask HN: Do you need go-to-market strategy at early stage?

1•2ero_wf•47m ago•0 comments

Built to benefit everyone: our plan By Sam Altman and Jakub Pachocki

https://openai.com/index/built-to-benefit-everyone-our-plan/
1•echan00•50m ago•1 comments

Show HN: Clawcall – give your self-hosted OpenClaw agent inbound phone calls

https://github.com/CODEANDTRUST/clawcall
1•pakbry•52m ago•0 comments

L'Affaire Siloxane

https://mceglowski.substack.com/p/laffaire-siloxane
1•idlewords•53m ago•0 comments

Make Something Wonderful

https://joshuawold.com/make-something-wonderful/
1•ethanplant•59m ago•0 comments

Vulnerability and malware checks in UV: uv audit, malware check in uv add, sync

https://astral.sh/blog/uv-audit
3•Terretta•1h ago•1 comments

OxyJen v0.5: a deterministic graph runtime for AI workflows

https://github.com/11divyansh/OxyJen
1•bdivyansh11•1h ago•0 comments

The Capability Curve Has No Memory

https://medium.com/@vektormemory/the-capability-curve-has-no-memory-7c5fe5cde09f
1•vektormemory•1h ago•1 comments

ThumbLoop: Thumbnails Which Get Clicks

https://loop-tube.com/blog/how-to-make-youtube-thumbnails
1•yashness•1h ago•0 comments

Apple Investors Give Lukewarm Reaction to New Siri, AI Platform

https://www.bloomberg.com/news/articles/2026-06-08/apple-unveils-next-generation-of-ai-platform-i...
1•petethomas•1h ago•0 comments

Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon

https://tridao.me/blog/2026/gram-newton-schulz/
2•jxmorris12•1h ago•0 comments

Siri AI at WWDC 2026

https://simonwillison.net/2026/Jun/8/wwdc/
2•lumpa•1h ago•0 comments

I built a free car lease transfer marketplace after the paid ones burned me

https://www.trademylease.com
2•mknweb•1h ago•0 comments

CRDTs merge concurrent edits. Why not concurrent creation?

https://loro.dev/blog/mergeable-containers
4•czx111331•1h ago•0 comments

OpenLTM – Local, self-decaying memory for AI coding agents

https://github.com/RohiRIK/OpenLtm
2•RohiRik•1h ago•0 comments

What Apple Knows About AI That Silicon Valley Won't Admit

https://www.thealgorithmicbridge.com/p/what-apple-knows-about-ai-that-silicon
5•CharlesW•1h ago•3 comments