frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•4mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

DNA cassette tape can store every song ever recorded

https://www.newscientist.com/article/2495758-dna-cassette-tape-can-store-every-song-ever-recorded/
1•Maf1•1m ago•0 comments

IETF Draft: Authenticated Transfer Repo and Sync Specification

https://www.ietf.org/archive/id/draft-holmgren-at-repository-00.txt
1•diggan•1m ago•1 comments

Is AI video the future of storytelling, or just the end of stories?

1•jamessmithe•4m ago•0 comments

Lesser-Known but C Language Facts That Might Surprise You

1•whyandgrowth•7m ago•0 comments

Cheese cave fungi reveal how genetic mutations drive rapid evolutionary change

https://phys.org/news/2025-09-cheese-cave-fungi-reveal-genetic.html
1•pseudolus•7m ago•0 comments

The Startup Killer Nobody Talks About: Domain Negotiations

https://www.brandhunt.com/
1•brandhunt•11m ago•1 comments

Meta-abstraction in the physical and social sciences (2021)

http://edwardfeser.blogspot.com/2021/03/meta-abstraction-in-physical-and-social.html
1•danielam•14m ago•0 comments

For the climate, little things don't add up

https://andymasley.substack.com/p/for-the-climate-little-things-dont
1•NavinF•14m ago•0 comments

Carbon emissions from oil giants directly linked to deadly heatwaves

https://www.theguardian.com/environment/2025/sep/10/link-oil-giants-heatwaves-research-legal-liab...
2•Anon84•15m ago•0 comments

Evorca: Fast and Minimal PlmDCA in Jax

https://github.com/suzuki-2001/evorca
1•ss-13•17m ago•1 comments

Algebraic Types are not Scary

https://blog.aiono.dev/posts/algebraic-types-are-not-scary,-actually.html
2•Bogdanp•29m ago•0 comments

A Technical Analysis on the Chinese Great Firewall [pdf]

https://interseclab.org/wp-content/uploads/2025/09/The-Internet-Coup_September2025.pdf
1•zdkaster•33m ago•0 comments

I Am Trapped in Insta-Purgatory with No Recourse

http://blog.mjb.me.uk/2025/09/i-am-trapped-in-insta-purgatory-with-no.html
4•mjb8086•33m ago•1 comments

Free Image Composer for Online News and Social Media OG Image

https://imgcomposer.com/
2•bruceduk•35m ago•0 comments

Register your Web3 Identity in Blockchain world

https://invite.mec.me/en-US?type=download&code=zebt0svb
1•metaearth•41m ago•1 comments

LLM rerankers for production RAG: tips and tricks

https://fin.ai/research/using-llms-as-a-reranker-for-rag-a-practical-guide/
4•mathcircler•41m ago•0 comments

'You're going about your day and suddenly see a little Godzilla'

https://www.theguardian.com/environment/2025/sep/15/little-godzilla-bangkok-reckons-with-its-gian...
2•n1b0m•42m ago•0 comments

The Free Dividends Fallacy

https://www.bloomberg.com/graphics/2025-gen-z-dividend-investing-etfs
1•basilesimon•43m ago•0 comments

I Put ChatGPT into Jail and Let Him Code Anyays

https://www.indiehackers.com/post/why-i-put-chatgpt-into-jail-and-let-him-code-anyays-4e2b915f15
2•scrumbuddy_ai•45m ago•1 comments

Boss of degrading sex-trade ring in Dubai's glamour districts unmasked by BBC

https://www.bbc.com/news/articles/cx2r9y3kxy9o
3•Teever•48m ago•0 comments

ROMA: Meta-agents with task decomposition, backed by benchmark wins

https://github.com/sentient-agi/ROMA
2•mustaphah•49m ago•0 comments

Coders End, from Typers to Thinkers

https://etsd.tech/posts/coders-end/
1•elieteyssedou•49m ago•0 comments

Ask HN: What's new in operating systems these days?

4•yu3zhou4•52m ago•1 comments

D. B. Cooper

https://en.wikipedia.org/wiki/D._B._Cooper
2•thunderbong•52m ago•0 comments

Perl 7 FAQ

https://gist.github.com/Grinnz/be5db6b1d54b22d8e21c975d68d7a54f
3•TheWiggles•53m ago•1 comments

Ashen-wow, pure Vanilla World of Warcraft server

https://ashen-wow.space/
1•flgue•53m ago•1 comments

Anthropic Economic Index: Tracking AI's Role in the US and Global Economy

https://www.anthropic.com/research/economic-index-geography
3•perks_12•54m ago•0 comments

Detecting click fraud with only 1px

https://www.tirreno.com/bat/?post=2025-09-15
1•reconnecting•55m ago•0 comments

RustGPT: A pure-Rust transformer LLM built from scratch

https://github.com/tekaratzas/RustGPT
60•amazonhut•57m ago•17 comments

Amish Men Live Longer

https://plainanabaptistjournal.org/article/id/6590/
4•johntfella•58m ago•0 comments