frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•9mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Couldn't find a minimal session indicator for tmux, so I built one

https://github.com/jtmcginty/tmux-session-dots
1•jtmcginty•5m ago•1 comments

AP brass to staff: Resistance to AI is 'futile'

https://www.semafor.com/article/03/03/2026/ap-brass-to-staff-resistance-to-ai-is-futile
2•healsdata•5m ago•0 comments

Speculative Speculative Decoding (SSD)

https://arxiv.org/abs/2603.03251
1•E-Reverance•7m ago•0 comments

Just Send the Prompt

https://justsendtheprompt.com/
28•medalblue•17m ago•15 comments

Show HN: My AI hires humans for physical tasks, calls them biological units

https://crawde.com/hire
2•crawde•18m ago•1 comments

Extending the Demo: Destruction Derby

https://32bits.substack.com/p/extending-the-demo-destruction-derby
1•farmerbb•19m ago•0 comments

News Corp, Meta in AI Content Licensing Deal Worth Up to $50M a Year

https://www.wsj.com/business/media/news-corp-meta-in-ai-content-licensing-deal-worth-up-to-50-mil...
1•bookofjoe•20m ago•2 comments

Show HN: A visual sitemap generator that maps any website's full navigation tree

2•epic_ai•20m ago•1 comments

CIA working to arm Kurdish forces to spark uprising in Iran, sources say

https://www.cnn.com/2026/03/03/politics/cia-arming-kurds-iran
8•vinnyglennon•25m ago•0 comments

MoneyCollect provides warm and human-centered payment protection for merchants

https://www.moneycollect.com/
1•jerryoverwrite•26m ago•0 comments

Current state of OpenClaw and bot protections

https://proxybase.xyz/blog/openclaw-bot-protections
1•m00dy•26m ago•0 comments

LLMs can unmask pseudonymous users at scale with surprising accuracy

https://arstechnica.com/security/2026/03/llms-can-unmask-pseudonymous-users-at-scale-with-surpris...
6•Gagarin1917•31m ago•0 comments

LexisNexis confirms React2Shell powered data breach

https://www.bleepingcomputer.com/news/security/lexisnexis-confirms-data-breach-as-hackers-leak-st...
4•esaym•32m ago•0 comments

New Python library by Guido van Rossum

https://github.com/microsoft/typeagent-py
1•tzury•33m ago•1 comments

YGG, Largest French Torrent Tracker, Hacked

https://yggleak.top/fr/home/ygg-dossier
1•TechSquidTV•34m ago•0 comments

Stop Trying to Hand Trump a Censorship Weapon

https://www.techdirt.com/2026/03/03/ron-wyden-is-begging-his-colleagues-to-stop-trying-to-hand-tr...
5•HotGarbage•34m ago•0 comments

We are now on Substack Give us a follow

https://substack.com/profile/387803331-pcg-inc/note/c-222710211
1•CCK80LLC•47m ago•0 comments

An AI Just Did Everything I Do on a Computer – Written by the AI Itself

https://coasty.ai/
1•PrateekJ17•48m ago•1 comments

Mac external displays for designers and developers, part 2

https://bjango.com/articles/macexternaldisplays2/
10•fragmede•48m ago•3 comments

New Launch Workshops and Masterclasses

https://www.pretium-inc.com/workshops
1•CCK80LLC•49m ago•0 comments

Oscar Six Radar – vulnerability scanner with native A2A (agent-to-agent) support

1•oscarsixsecllc•51m ago•0 comments

Show HN: Lip Flip Before and After – Real Results and AI Preview

https://lipflip.ai/
1•ovelv•53m ago•0 comments

Yes... and...

https://htmx.org/essays/yes-and/
2•rammy1234•54m ago•1 comments

Show HN: Term-CLI – interactive terminals for AI agents (for SSH/TUI/REPL flows)

https://github.com/EliasOenal/term-cli
3•eliasoe•54m ago•0 comments

The secret green shelters that feed London's cabbies (2018)

https://www.bbc.com/travel/article/20180430-the-secret-green-shelters-that-feed-londons-cabbies
2•1659447091•55m ago•0 comments

Show HN: Hanaco Garden – A Calm iOS Garden

https://apps.apple.com/us/app/hanaco-garden/id6759095190
3•tsuyoshi_k•57m ago•1 comments

Number Research Inc

https://numberresearch.xyz/
2•eieio•57m ago•0 comments

Show HN: Docker pulls more than it needs to

https://dockerpull.com
2•a_t48•1h ago•3 comments

Show HN: Schelling Protocol – Where AI agents coordinate on behalf of humans

https://github.com/codyz123/schelling-protocol
2•codyz123•1h ago•1 comments

We built high speed threat hunting for email security

https://sublime.security/blog/how-we-built-high-speed-threat-hunting-for-email-security/
2•jkamdjou•1h ago•0 comments