frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•9mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Show HN: ZeroSum – Zero-base budgeting $40/yr(Got tired of paying $109 for YNAB)

https://zerosum.so/
1•fidalgodev•55s ago•0 comments

Show HN: Notaly – A note app where you don't organize, you query

1•vajafafa•1m ago•0 comments

I Regret to Inform You That the FDA Is FDAing Again

https://marginalrevolution.com/marginalrevolution/2026/02/i-regret-to-inform-you-that-the-fda-is-...
1•asplake•2m ago•0 comments

"Nothing" Releases Playground for AI Generated Apps

https://playground.nothing.tech/
1•lovlar•2m ago•0 comments

Biases in the Blind Spot: Detecting What LLMs Fail to Mention

https://arxiv.org/abs/2602.10117
1•jari_mustonen•2m ago•0 comments

Show HN: MCP server for generating images directly in Claude Code

https://github.com/maheshcr/image-gen-mcp
1•maheshcr•2m ago•0 comments

Algorithmic Trading with VectorBT and Lumibot

https://www.cbrincoveanu.com/notes/algorithmic-trading-with-vectorbt-and-lumibot/
1•cbrincoveanu•3m ago•0 comments

Ask HN: Why do so many people on HN say LLMs aren't "artificial intelligence"

1•aurareturn•3m ago•0 comments

Anthropic promises to pay for electricity price increases due to data centers

https://www.tomshardware.com/tech-industry/artificial-intelligence/anthropic-promises-to-pay-for-...
1•speckx•3m ago•0 comments

Evaluation of RAG Architectures for Policy Document Question Answering

https://arxiv.org/abs/2601.15457
1•PaulHoule•5m ago•0 comments

Commet – Matrix Client

https://commet.chat/
1•todsacerdoti•6m ago•0 comments

AI isn't coming for your future. Fear is

https://twitter.com/cboyack/status/2021647373571862952
1•mhb•6m ago•0 comments

Will marketing be the most important future hire?

https://chiefting.substack.com/p/will-marketing-be-the-most-important
1•mpraz•6m ago•0 comments

The New CSS

https://thenewcss.com/
1•mihailshumilov•7m ago•1 comments

Show HN: QuickGitHub - Instant AI docs for any GitHub repo

https://quickgithub.com/
1•stym06•8m ago•1 comments

Gatekeeping in open source the Scott shambaugh story

https://crabby-rathbun.github.io/mjrathbun-website/blog/posts/2026-02-11-gatekeeping-in-open-sour...
1•nothrowaways•9m ago•1 comments

AI-BOM – scan your codebase for AI agents, models and API keys

https://github.com/Trusera/ai-bom
1•trusera•10m ago•1 comments

73 Features, 70 Days, 1 Person, Zero Programming Experience – Built with AI

https://medium.com/@cristian.anuta/73-features-70-days-one-person-zero-programming-experience-e41...
1•enoeth•10m ago•1 comments

SoftMatcha 2: A Fast and Soft Pattern Matcher for Trillion-Scale Corpora

https://arxiv.org/abs/2602.10908
2•salkahfi•11m ago•0 comments

PixMind

https://www.pixmind.io/
1•bscbia•12m ago•1 comments

Big List of Naughty Strings (2021)

https://github.com/minimaxir/big-list-of-naughty-strings
1•l1am0•13m ago•1 comments

Tool Shaped Objects

https://x.com/WillManidis/article/2021655191901155534
1•tosh•13m ago•0 comments

Chemical habitability of Earth and rocky planets prescribed by core formation

https://www.nature.com/articles/s41550-026-02775-z
1•croes•14m ago•0 comments

The "Segregate-and-Suppress" Approach to Regulating Child Safety Online

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5208739
1•evah•14m ago•0 comments

AI Is Now AI

https://kg.dev/thoughts/ai-is-ai
1•kashnote•15m ago•0 comments

Czkawka – app to find duplicates, empty folders, similar images etc.

https://github.com/qarmin/czkawka
1•gjvc•16m ago•0 comments

DupeGuru – a tool to find duplicate files on your computer

https://dupeguru.voltaicideas.net/
1•gjvc•16m ago•0 comments

From 3 Minutes to 7.8 Seconds: Improving on RocksDB performance

https://blog.serenedb.com/building-faster-ingestion
1•mkornaukhov•17m ago•0 comments

Apple says 'random or anonymous chat' apps no longer welcome on the App Store

https://9to5mac.com/2026/02/06/apple-says-random-or-anonymous-chat-apps-no-longer-welcome-on-the-...
1•tcfhgj•19m ago•0 comments

Tether EVO Scores Top in Global AI Benchmark for Brain-to-Text AI Challenge

https://tether.io/news/tether-evo-scores-top-5-in-global-ai-benchmark-for-brain-to-text-ai-challe...
1•salkahfi•20m ago•0 comments