frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•11mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Original Volkov Commander Sources

https://github.com/ddanila/vc
1•begoon•1m ago•0 comments

Mapping Notable Geo-Triangles

https://nickscip.xyz/projects/triangles/
1•nickscip•1m ago•0 comments

Open source desktop app for studying online courses

https://github.com/tonhowtf/omniget
1•axiomdata316•4m ago•0 comments

Show HN: Security Scanner for Agent Skills and MCP

https://github.com/snyk/agent-scan
1•lirantal•4m ago•0 comments

Lyra 2.0: Explorable Generative 3D Worlds

https://research.nvidia.com/labs/sil/projects/lyra2/
1•jonbaer•5m ago•1 comments

Past Ferrari Models, 1947–2023

https://www.ferrari.com/en-US/auto/past-model
1•NaOH•5m ago•0 comments

Show HN: A Privacy tool that finds and hides sensitive data in phtots/videos

https://www.scanon.ai/
1•lotuslabs•6m ago•0 comments

A GitHub for Maintainers

https://nesbitt.io/2026/05/02/a-github-for-maintainers.html
1•milkglass•6m ago•1 comments

Public Runtime for Convera for LLM's

https://github.com/cjparadise79/CONVERA-PUBLIC
1•cjparadise•8m ago•1 comments

Ableton Live MCP

https://github.com/bschoepke/ableton-live-mcp
1•bschoepke•9m ago•0 comments

MCP-x-Mac-Seed – An AI agent that discovers Mac apps and writes its own tools

https://github.com/reverendish/mcp-x-mac-seed
1•ishsitotombe•11m ago•0 comments

BYOMesh – New LoRa mesh radio offers 100x the bandwidth

https://partyon.xyz/@nullagent/116499715071759135
1•nullagent•11m ago•0 comments

What Happens When the World Is Fatherless

https://elaynekalila.substack.com/p/what-happens-when-the-world-is-fatherless
1•rendx•13m ago•0 comments

Show HN: Orchestrate Dockerized Claude Code sessions from your issue tracker

https://github.com/smithy-ai/smithy-ai
1•t0mas88•16m ago•0 comments

Should You Be a Carpenter? [video]

https://www.youtube.com/watch?v=RJyPVLMyyuA
1•DeathArrow•16m ago•0 comments

Caisi Evaluation of DeepSeek V4 Pro

https://www.nist.gov/news-events/news/2026/05/caisi-evaluation-deepseek-v4-pro
1•chvid•18m ago•0 comments

The clients you didn't know you lost

https://techlex.net/the-clients-you-didnt-know-you-lost/
1•basket278•20m ago•0 comments

A Dark-Money Campaign Is Paying Influencers to Frame Chinese AI as a Threat

https://www.wired.com/story/super-pac-backed-by-openai-and-palantir-is-paying-tiktok-influencers-...
2•chvid•20m ago•1 comments

LLMs Are Not a Higher Level of Abstraction

https://www.lelanthran.com/chap15/content.html
2•lelanthran•23m ago•0 comments

A framework agnostic platform to manage local agents from your phone

https://onepilotapp.com
2•elearia•26m ago•0 comments

Musk spars with OpenAI atty in trial over OpenAI's evolution from a nonprofit

https://apnews.com/article/musk-altman-openai-nonprofit-trial-bdbe85d62c2b678458fe68148eb6fba5
2•1vuio0pswjnm7•26m ago•1 comments

We Caught Prompt Security Leaking API Keys

https://www.youtube.com/watch?v=cZLdWtcSE04
1•acorn221•26m ago•0 comments

I Recreated the Apple Lisa Computer Inside an FPGA – The LisaFPGA Project

https://www.youtube.com/watch?v=8jNQDcpHc68
4•cyrc•29m ago•0 comments

Questions of US interventionism as 25story Juárez surveillance tower scrutinized

https://english.elpais.com/international/2026-05-03/amid-questions-of-us-interventionism-in-mexic...
2•c420•31m ago•0 comments

FCC votes to ban all Chinese labs from certifying electronics sold in the US

https://www.tomshardware.com/tech-industry/fcc-votes-to-ban-all-chinese-labs-from-certifying-elec...
3•jonbaer•33m ago•3 comments

Elon Musk Says AI 'Smarter Than Humans' Next Year During OpenAI Testimony

https://www.newsweek.com/elon-musk-vs-sam-altman-feud-explained-as-openai-trial-begins-11886815
3•1vuio0pswjnm7•33m ago•2 comments

PHP King Extension and KingRT Video Call App

https://kingrt.com/
1•bold_iggl•34m ago•1 comments

Space War

http://cleancoder.com/space-war
2•evo_9•35m ago•0 comments

Collaborative Editing in CodeMirror

https://marijnhaverbeke.nl/blog/collaborative-editing-cm.html
2•luu•35m ago•0 comments

Show HN: Local semantic memory for coding agents

https://github.com/Chadi00/thr
1•chadiiek•37m ago•0 comments