frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•11mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Fully Featured Audio DSP Firmware for the Raspberry Pi Pico

https://github.com/WeebLabs/DSPi
1•BoingBoomTschak•1m ago•1 comments

EgoNet: A Peer-to-Peer Digital Existence System – In Homage to Satoshi Nakamoto

https://zenodo.org/records/19633431
1•R_Horiguchi•2m ago•0 comments

Autonomous weapons are a game-changer

https://www.economist.com/special-report/2018/01/25/autonomous-weapons-are-a-game-changer
2•andsoitis•3m ago•0 comments

In Search of the Missing Artist

https://www.abc.net.au/news/2026-04-25/in-search-of-the-missing-artist-jean-paul-mangin/106593220
1•colinprince•6m ago•0 comments

NASA Releases Powerful LAVA Software to US Aerospace Industry

https://www.nasa.gov/aeronautics/nasa-releases-powerful-lava-software-to-us-aerospace-industry/
1•happy-go-lucky•7m ago•0 comments

The Benchmark Gap: 1,472 runs show coding-agent context changes outcomes

https://github.com/dorukardahan/benchmark-gap
1•dorukardahan•7m ago•1 comments

A New Type of Neuroplasticity Rewires the Brain After a Single Experience

https://www.quantamagazine.org/a-new-type-of-neuroplasticity-rewires-the-brain-after-a-single-exp...
1•Brajeshwar•8m ago•0 comments

Ask HN: Anyone managed to get Google trends API?

1•visox•9m ago•0 comments

WHO approves first Malaria drug for babies

https://www.who.int/news/item/24-04-2026-who-prequalifies-first-ever-malaria-treatment-for-newbor...
1•tchalla•16m ago•0 comments

FaceX – Face embedding in 3ms with handwritten C and AVX2,no dependencies

https://github.com/facex-engine/facex
1•bauratynov•18m ago•0 comments

HEALPix

https://en.wikipedia.org/wiki/HEALPix
1•hyperific•18m ago•0 comments

Under Blackout Threat, Wikimedia Reaches Compromise with Indonesia

https://www.barrons.com/news/under-blackout-threat-wikimedia-reaches-compromise-with-indonesia-e3...
4•exploraz•19m ago•1 comments

Scam Quantum Fud Headlines

https://decrypt.co/365444/bitcoin-q-day-draws-nearer-quantum-researcher-breaks-simplified-key
2•zeptonix•19m ago•1 comments

Show HN: A games website with a swipe feature like TikTok

https://www.brainafkgames.online/
2•sambex•22m ago•0 comments

Subvert's AI Policy

https://subvert.fm/ai-policy/
3•simonpure•22m ago•0 comments

Can Google Win the AI Hardware Race Through TPUs?

https://google-ai-race.pagey.site/
3•freakynit•24m ago•0 comments

Show HN: Limen – modern, composable auth for Go

https://limenauth.dev/blog/introducing-limen
3•brianiyoha•28m ago•0 comments

The output doesn't matter: Thoughts on Aristotle's Craftsmen in the age of LLMs

https://lambdacreate.com/posts/the-output-doesnt-matter
7•durrendal•30m ago•0 comments

Berkshire attracts interest as it slips further behind the S&P 500

https://www.cnbc.com/2026/04/25/berkshire-attracts-interest-as-it-slips-further-behind-the-sp-500...
5•episec•31m ago•0 comments

Jailbreaking a robot vacuum to run Tailscale and Valetudo

https://tailscale.com/blog/tailscale-sucks
7•theorchid•34m ago•0 comments

Polaroid's Showman (2023)

https://www.cabinetmagazine.org/kiosk/allen_jonathan_isenbart_jan_03_march_2023.php
6•XzetaU8•37m ago•0 comments

The World's Most Complex Machine

https://worksinprogress.co/issue/the-worlds-most-complex-machine/
5•mellosouls•42m ago•0 comments

Behind-the-Scenes of MacBook Neo Introduction Video

https://www.youtube.com/shorts/y4DnsCzJTRQ
60•0xbg•44m ago•2 comments

2× – nine months later: We did it

https://ideas.fin.ai/p/2x-nine-months-later
5•jamesblonde•45m ago•0 comments

The Tsundoku Trap: Why AI Makes You Start Everything and Finish Nothing

https://blog.danielvaughan.com/the-tsundoku-trap-why-ai-makes-you-start-everything-and-finish-not...
10•dvaughan•46m ago•0 comments

Building Code.overheid.nl Together

https://developer.overheid.nl/blog/2026/04/24/we-gaan-samen-code-overheid-bouwen
6•doughnutstracks•47m ago•0 comments

ALDI Eliminating an Additional 44 Ingredients – ALDI US

https://corporate.aldi.us/newsroom/news/aldi-eliminating-an-additional-44-ingredients
5•bilsbie•48m ago•0 comments

Show HN: Memweave CLI – search your AI agent's memory from the shell

https://github.com/sachinsharma9780/memweave
5•r2d2_•49m ago•0 comments

Civic-SLM is a domain-specialized fine-tune of Qwen2.5-7B for U.S. govt data

https://itsmeduncan.com/civic-slm/
7•itsmeduncan•51m ago•0 comments

Rotating Space Habitats

https://blog.engora.com/2026/04/rotating-space-habs.html
4•Vermin2000•51m ago•1 comments