frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Linux Eliminates the Strncpy API After Six Years of Work, 360 Patches

https://www.phoronix.com/news/Linux-7.2-Drops-strncpy
1•simonpure•1m ago•0 comments

You're Accountable for the Team. You're Not in Charge of It

https://yourdevteamcoach.com/blog/youre-accountable-for-the-team-youre-not-in-charge-of-it
1•sea-gold•3m ago•1 comments

The UK will scan asylum seekers' faces for age checks

https://www.wired.com/story/facial-age-estimate-uk-asylum-seekers/
2•Lihh27•8m ago•0 comments

Trump says he no longer views Anthropic as a threat after G7 meeting

https://thenextweb.com/news/trump-anthropic-not-national-security-threat-axios-interview
8•billybuckwheat•10m ago•0 comments

Incorrectly Generated RSA Keys: How I Learned to Recover Lost Plaintexts

https://academic.oup.com/comjnl/article-abstract/66/6/1342/6995423?redirectedFrom=fulltext
1•rbanffy•11m ago•0 comments

Seeing the world in radio waves with the QuadRF

https://hackaday.com/2026/06/20/seeing-the-world-in-radio-waves-with-the-quadrf/
3•ikbdsk•13m ago•0 comments

RFC 9958: Post-Quantum Cryptography for Engineers

https://datatracker.ietf.org/doc/html/rfc9958
2•hasheddan•13m ago•0 comments

Solo Founder Sales Playbook –sales tools for technical founders who hate selling

https://getsalesspark.com/
1•erichensley•14m ago•0 comments

Workflow builder built with just Sandboxes

https://twitter.com/karatzas_thomas/status/2061840456992919997
1•szaneer•14m ago•0 comments

Pulse – a local dashboard for Claude Code, approve tool calls from your phone

https://github.com/nikitadoudikov/claude-pulse
1•nikitadvd•14m ago•0 comments

MosaicLeaks: Can your research agent keep a secret?

https://huggingface.co/blog/ServiceNow/mosaicleaks
1•gmays•14m ago•0 comments

The Tech Billionaire Plan to Destroy Democracy – GIL Duran – TMR [video]

https://www.youtube.com/watch?v=stPijjCneXM
2•wturner•16m ago•0 comments

'Helmsniff'; a Helm Security Scanner

https://github.com/VahidR/helmsniff
1•vahid_r•18m ago•0 comments

Why the EU rewrote its landmark AI law

https://www.theparliamentmagazine.eu/news/article/why-the-eu-rewroteits-landmark-ai-law
1•thinkingemote•19m ago•0 comments

UK Home Office launches £75M 'PoliceAI' to capitalise on artificial intelligence

https://www.publictechnology.net/2026/06/15/public-order-justice-and-rights/home-office-launches-...
3•thinkingemote•20m ago•0 comments

Scaling a Monolith to 1M LOC: 113 Pragmatic Lessons

https://www.semicolonandsons.com/articles/scaling-a-monolith-to-1m-loc-113-pragmatic-lessons
1•jackkinsella•24m ago•0 comments

Show HN: Amiqo – a private app to text the friends you're drifting from

https://amiqo.life/
1•kyle11•24m ago•0 comments

GPUs and RAM Are in Short Supply, but the Real Bottleneck for AI Is Electricians

https://www.nextplatform.com/compute/2026/05/28/gpus-and-ram-are-in-short-supply-but-the-real-bot...
1•Gooblebrai•26m ago•0 comments

Ask HN: Do you use Claude Code, Codex, or something else?

3•JohnDSDev•26m ago•1 comments

Alice. Alice Is Impatient

https://brooker.co.za/blog/2026/06/19/waiting.html
2•birdculture•28m ago•0 comments

Cooren – I turned my family's dinner-voting app into a coordination API

https://github.com/McLeod-Interactive-Group-LLC/cooren-api
1•smac-mig•29m ago•0 comments

Agent-Native Code Hosting

https://gitlawb.com/
1•panikadak•30m ago•0 comments

Europe's Making Fewer Cars and Lots of Them Are Chinese

https://www.bloomberg.com/news/features/2026-06-19/stellantis-volkswagen-eye-risky-partnerships-w...
2•tchalla•30m ago•0 comments

World Cup tourists aren't leaving tips – and NYC restaurants are fighting back

https://nypost.com/2026/06/20/us-news/world-cup-tourists-arent-leaving-tips-and-nyc-restaurants-a...
2•donsupreme•30m ago•3 comments

AutoJack: A single page can RCE the host running your AI agent

https://www.microsoft.com/en-us/security/blog/2026/06/18/autojack-single-page-rce-host-running-ai...
4•p_stuart82•31m ago•0 comments

A Love Story

https://pudding.cool/2026/06/love-story/
1•simonebrunozzi•31m ago•1 comments

StoryScope: Investigating Idiosyncrasies in AI Fiction

https://arxiv.org/abs/2604.03136
2•amai•31m ago•0 comments

Show HN: Internal agent systems like Ramp Inspect for your company

https://brainbaselabs.com
1•egrigokhan•32m ago•0 comments

Show HN: OpenSpend – Invoicing for creators, popup shops and small businesses

https://openspend.riamu.io/
1•openspend•42m ago•0 comments

I built an offline tool to stabilize TV audio because nothing else worked

https://github.com/AdBusterOfficial/Adbuster--WinApp
2•Bo_Amigo_910•42m ago•0 comments