frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•8mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

What's on HTTP?

https://whatsonhttp.com/
1•elixx•1m ago•0 comments

Tumblr removed from Apple App Store over abuse images

https://www.bbc.com/news/technology-46275138
4•dmschulman•6m ago•0 comments

NASA ends space mission early due to astronaut medical condition

https://www.bbc.com/news/articles/cd9e2y7nkv8o
1•DarkContinent•10m ago•0 comments

Jane Street's Ron Minsky on the Future of Programming (2023)

https://signalsandthreads.com/future-of-programming/
2•weinzierl•13m ago•0 comments

Iran Goes Dark as Government Cuts Itself Off from Internet

https://www.kentik.com/analysis/iran-goes-dark-as-government-cuts-itself-off-from-internet/
1•m-hodges•14m ago•0 comments

Scientists Create Robots Smaller Than a Grain of Sand

https://www.wsj.com/science/scientists-create-robots-smaller-than-a-grain-of-sand-c3081fd0
1•Bostonian•14m ago•1 comments

Securely sending query parameters in HTTP headers

https://github.com/dickhardt/redirect-headers
1•mooreds•16m ago•0 comments

Waymo getting a ticket. It drove off with the ticket on the windshield

https://old.reddit.com/r/Austin/comments/1q7t4e4/waymo_getting_a_ticket_while_i_was_inside_it/
2•m-hodges•18m ago•0 comments

iOS 26 Shows Unusually Slow Adoption Months After Release

https://www.macrumors.com/2026/01/08/ios-26-shows-unusually-slow-adoption/
3•latexr•22m ago•3 comments

Study casts doubt on potential for life on Europa

https://www.reuters.com/science/study-casts-doubt-potential-life-jupiters-moon-europa-2026-01-06/
2•paulpauper•23m ago•0 comments

AI #150: While Claude Codes

https://thezvi.substack.com/p/ai-150-while-claude-codes
1•paulpauper•24m ago•0 comments

Vegetarians, spam, spite programming, and drug names

https://dynomight.substack.com/p/shorts-7
1•paulpauper•25m ago•0 comments

My Daily Lesson in Hacker News Etiquette

1•jannesblobel•29m ago•1 comments

OrbitHQ turns SEO audits and analytics into actionable tasks

https://tryorbithq.com/
1•astralshard•29m ago•1 comments

Valve: Linux hit another all-time high

https://www.gamingonlinux.com/2026/01/valve-amended-the-steam-survey-for-december-2025-linux-actu...
2•sergiotapia•31m ago•0 comments

ChatGPT for Healthcare

https://openai.com/index/openai-for-healthcare
1•tylerrobinson•33m ago•1 comments

Functional programming at the type level in TypeScript

https://github.com/gvergnaud/hotscript
1•RyanZhuuuu•33m ago•0 comments

Who Was Caroline Haslett?

https://www.bbc.co.uk/bitesize/articles/z3rxm39
1•susam•35m ago•0 comments

Effect Institute

https://www.effect.institute/
1•handfuloflight•36m ago•0 comments

Show HN: Legit, Open source Git-based Version control for AI agents

3•jannesblobel•44m ago•0 comments

Canadian statutory severance and termination pay calculator

https://canadaemploymentrules.ca/
1•cerdotca•45m ago•1 comments

Why Are Grok and X Still Available in App Stores?

https://www.wired.com/story/x-grok-app-store-nudify-csam-apple-google-content-moderation/
16•alwillis•46m ago•14 comments

Job postings evaluator against your resume (Chrome extension)

https://github.com/alikh31/job-ad-evaluator
1•alikhoramshahi•47m ago•0 comments

I built an AI agent that deploys a PR to production

2•amouehsan•48m ago•0 comments

Non-Traditional Profiling: "you can just put whatever you want in a jitdump"

https://www.mgaudet.ca/technical/2026/1/8/non-traditional-profiling
1•matt_d•49m ago•0 comments

Running a real consumer app on a 70B LLM at sub-cent cost per scan

https://www.cornstarch.ai/
1•rs1996•50m ago•1 comments

The Shaggs

https://en.wikipedia.org/wiki/The_Shaggs
5•jethronethro•50m ago•0 comments

NBA's new AI stat measures defensive gravity

https://www.nba.com/news/intro-to-gravity-stat-nba-2025
1•cyr0dj0hn•51m ago•0 comments

Reflection-Driven Control for Trustworthy Code Agents

https://arxiv.org/abs/2512.21354
1•PaulHoule•52m ago•0 comments

I built a fake chat generator in 18 hours because the existing ones all suck

https://messagesy.xyz/
1•hristoff•52m ago•1 comments