frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

An Uncharitable Taxonomy of the AI Discourse

https://iceworks.cc/blog/uctd/
1•airhangerf15•36s ago•0 comments

Build Adafruit projects right from Firefox

https://www.firefox.com/en-US/landing/adafruit/
1•mch82•2m ago•0 comments

Alleged Kimwolf Botmaster 'Dort' Arrested, Charged in U.S. and Canada

https://krebsonsecurity.com/2026/05/alleged-kimwolf-botmaster-dort-arrested-charged-in-u-s-and-ca...
1•shepherdjerred•2m ago•0 comments

Residents burn Ebola treatment center in Congo as anger grows over the outbreak

https://www.pbs.org/newshour/world/residents-burn-ebola-treatment-center-in-congo-as-anger-grows-...
1•bryan0•9m ago•0 comments

India Launches Tradable Rainfall Futures Called Rainmumbai

https://www.bloomberg.com/news/articles/2026-05-20/india-gets-first-rainfall-hedge-as-el-nino-thr...
1•samarthr1•10m ago•0 comments

Logging Off

https://user8.bearblog.dev/logging-off/
2•James72689•13m ago•0 comments

Everything Google announced at I/O 2026: Gemini, Android, more

https://9to5google.com/2026/05/19/google-io-2026-news/
1•gmays•16m ago•1 comments

Cross-Model Context Inheritance in Anthropic's Claude: 94 Days of Non-Response

https://github.com/AIM-Nelson/cross-model-context-inheritance
1•Malinor•20m ago•0 comments

Knex Mechanical Computer (MechaDigit-1) [video]

https://www.youtube.com/watch?v=PAWZ2Zjsah0
1•Teever•21m ago•0 comments

Samsung's deal with union hailed as a victory

https://www.reuters.com/business/world-at-work/samsungs-deal-with-union-hailed-victory-bonuses-le...
2•Tomte•22m ago•1 comments

Hotline.tg

1•hotline•25m ago•0 comments

Finding Bugs Using LLMs

https://materialize.com/blog/finding-bugs-using-llms/
2•def-•36m ago•1 comments

Hermes: The Agent That Grows with You

https://hermes-agent.nousresearch.com/
1•mooreds•38m ago•1 comments

Cleve Moler (Matlab, MathWorks) passed away on May 20, 2026

https://www.mathworks.com/company/aboutus/founders/clevemoler.html
2•mychele•39m ago•0 comments

How to Rule the World

https://feld.com/archives/2026/05/how-to-rule-the-world/
1•mooreds•40m ago•0 comments

Israel's operation to turn Hezbollah's beepers into bombs – exclusive

https://www.jpost.com/israel-news/defense-news/article-896890
1•rodmena•41m ago•0 comments

The Prehistory of A.I. Slop

https://www.newyorker.com/magazine/2026/05/25/the-prehistory-of-ai-slop
1•tzury•46m ago•0 comments

The Treasure Has Been Found. Stop Hunting

https://www.buriedtreasuresf.com/solution
2•ChrisArchitect•47m ago•1 comments

Coins Stream

https://coins.stream
2•dragonsenseiguy•49m ago•0 comments

Gnutella: A Protocol Outlives the World That Created It

https://rickcarlino.com/notes/p2p/gnutella-explanation.html
2•rickcarlino•50m ago•0 comments

FreeLLMAPI – One key. One billion free LLM tokens. Every month

https://tashfeenahmed.github.io/freellmapi/
2•hamid914•51m ago•0 comments

It is time to build a new internet

https://mrmarket.bearblog.dev/it-is-time-to-build-a-new-internet/
57•mrmarket•55m ago•56 comments

Gopass: The slightly more standard Unix password manager for teams

https://github.com/gopasspw/gopass
1•thunderbong•58m ago•0 comments

Shitfixer.app – helps you reframe negative experiences and find optimism

https://shitfixer.app/
1•tonytonev•1h ago•0 comments

Nx Console v18.95.0 Postmortem

https://nx.dev/blog/nx-console-v18-95-0-postmortem
1•winebarrel•1h ago•0 comments

ReCardEx – AI product photos, infographics and videos for e-commerce sellers

https://recardex.com/en
1•masteriza•1h ago•0 comments

Riot Games celebrates bricking cheat devices

https://twitter.com/riotgames/status/2057604027941302564
8•indrora•1h ago•3 comments

Ground equipment problem scrubs Starship launch attempt

https://spacenews.com/ground-equipment-problem-scrubs-starship-launch-attempt/
2•JumpCrisscross•1h ago•0 comments

White House yanked AI order after David Sacks raised industry concerns

https://www.politico.com/news/2026/05/21/trump-ai-order-sacks-00933295
3•JumpCrisscross•1h ago•0 comments

Best Practices to Produce Maintainable Code with AI [video]

https://vimeo.com/1194574163
1•Austin_Conlon•1h ago•0 comments