frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•10mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Kavka's Toxin Puzzle

https://en.wikipedia.org/wiki/Kavka%27s_toxin_puzzle
1•rzk•1m ago•0 comments

Harry Potter by Balenciaga (2026) [video]

https://www.youtube.com/watch?v=gtnt84CDP-s
1•GeoAtreides•1m ago•0 comments

I'm struggling and I don't have anyone else to share this with except you

1•owlcompliance•2m ago•0 comments

Queen's Wish: A Portmortem of Mixed Success

https://bottomfeeder.substack.com/p/queens-wish-a-portmortem-of-mixed
1•Tomte•2m ago•0 comments

The Agency: Meticulously crafted AI agent personalities

https://github.com/msitarzewski/agency-agents
1•danebalia•3m ago•1 comments

Practical Type Inference: High‑Throughput Recovery of Real‑World Types

https://arxiv.org/abs/2603.08225
1•matt_d•3m ago•0 comments

New 'negative light' technology hides data transfers in plain sight

https://www.unsw.edu.au/newsroom/news/2026/03/New-negative-light-technology-hides-data-transfers-...
1•wjSgoWPm5bWAhXB•5m ago•0 comments

'a window into the past': The homes revealing how Tudor people lived

https://www.bbc.com/culture/article/20260309-the-homes-revealing-how-tudor-people-really-lived
1•makaimc•5m ago•0 comments

Rabbit r1 with whatever model you want

https://github.com/ShayneP/rabbit-r1-livekit-skill
1•ShayneP•7m ago•1 comments

Request Copilot code review from GitHub CLI

https://github.blog/changelog/2026-03-11-request-copilot-code-review-from-github-cli/
2•danebalia•8m ago•1 comments

Microsoft brings new "Xbox mode" to Windows 11 PCs next month

https://www.windowscentral.com/microsoft/windows-11/windows-11-xbox-mode-announcement-gdc-2026-pr...
1•nikodunk•8m ago•0 comments

WordPress/PHP-AI-client: provider agnostic PHP client SDK to communicate with AI

https://github.com/WordPress/php-ai-client
1•ulrischa•8m ago•0 comments

Show HN: CAS – I reverse-engineered Claude Code to build a better orchestrator

https://github.com/codingagentsystem/cas
1•aceelric•9m ago•1 comments

Me preocupa más ver a un rumano con cara de "yo esto ya lo vi hace añOS"

https://borjamoskv.substack.com/p/el-tema-no-es-cuando-la-va-a-superar
1•borjamoskv•10m ago•0 comments

Stop rebuilding Word documents with PDF libraries

https://tmplvision.io/
1•benny00100•10m ago•1 comments

ThoughtWorks Retreat: Where does engineering go? [pdf]

https://www.thoughtworks.com/content/dam/thoughtworks/documents/report/tw_future%20_of_software_d...
1•danebalia•10m ago•1 comments

Someone forked and submitted my open-source project to a contest, and won $1000

https://cyao.dev/blog/contest.html
2•Cyao•11m ago•0 comments

Pro-Iran hackers claim cyberattack on major US medical device maker

https://www.cnn.com/2026/03/11/politics/pro-iran-hackers-cyberattack-medical-device-maker
7•zomg•12m ago•1 comments

PeerTube v8.1 Released

https://joinpeertube.org/news/release-8.1
3•toomuchtodo•13m ago•1 comments

Constellation Draw

https://neal.fun/constellation-draw
3•xia•15m ago•0 comments

Browserbase Fetch: Simple API for extracting web page content for AI agents

https://www.browserbase.com/blog/fetch-api
1•Kylejeong21•16m ago•0 comments

A Definition-Based Wordle Game

https://www.jlauf.com/dictle/
2•jlauf•16m ago•0 comments

Show HN: AutosArena – The most complete automotive data platform, available free

https://autos-arena.com
1•seeyam14•17m ago•0 comments

SetupClaw – White-Glove OpenClaw Deployment for Founders and Exec Teams

https://setupclaw.com
2•personjerry•18m ago•1 comments

Made my own programming language, kinda advanced

https://github.com/entrenchedosx/spl
1•redempt1on•18m ago•1 comments

Show HN: Pointify – Retro analog gauges for system stats and Claude usage

https://github.com/luftaquila/pointify
2•luftaquila•18m ago•0 comments

Charging Strategies for Battery Electric Trucks in Germany

https://www.mdpi.com/2032-6653/17/2/106
1•PaulHoule•19m ago•0 comments

Most Watched Software Engineering Talks of 2025

https://www.techtalksweekly.io/p/100-most-watched-software-engineering-talks-of-2025
2•m4lloc•19m ago•0 comments

Homebrew 5.1.0

https://brew.sh/2026/03/10/homebrew-5.1.0/
5•mikemcquaid•19m ago•1 comments

Show HN: Sandbox Flow – A Playground for Sandboxes

https://github.com/BandarLabs/sandboxflow
1•mkagenius•21m ago•0 comments