frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•8mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Can the prescription drug leucovorin treat autism? History says, probably not

https://www.npr.org/sections/shots-health-news/2026/01/22/nx-s1-5684294/leucovorin-autism-folic-f...
1•pseudolus•49s ago•0 comments

Davos Stops Pretending

https://messaging-custom-newsletters.nytimes.com/dynamic/render
1•doener•1m ago•0 comments

For the Children: A short story about the endgame of EU Chat Control

https://gigaprojects.online/post/1
1•giga_private•2m ago•1 comments

An Adversarial Coding Test

https://runjak.codes/posts/2026-01-21-adversarial-coding-test/
1•birdculture•4m ago•0 comments

Go Developer Survey 2025: How Gophers Use AI Tools, Editors, and Cloud Platforms

https://go.dev/blog/survey2025
1•Lwrless•4m ago•0 comments

Ask HN: What's the current best local/open speech-to-speech setup?

1•dsrtslnd23•6m ago•0 comments

A Multi-Entry Control Flow Graph Design Conundrum

https://bernsteinbear.com/blog/multiple-entry/
1•chunkles•9m ago•0 comments

Bernstein vs. United States

https://en.wikipedia.org/wiki/Bernstein_v._United_States
1•u1hcw9nx•11m ago•0 comments

Show HN: Workmux – Parallel development in tmux with Git worktrees

https://workmux.raine.dev/
1•rane•11m ago•0 comments

Show HN: 9 years building an open-source financial platform

https://github.com/finmars-platform/finmars-core
1•ogreshnev•12m ago•0 comments

Ask HN: What 'AI feature' created negative ROI in production?

1•kajolshah_bt•13m ago•0 comments

TigerBeetle's Stablecoin Mistake

https://www.news.alvaroduran.com/tigerbeetle-stablecoin-mistake/
1•ohduran•13m ago•0 comments

What Will You Do When AI runs Out of Money and Disappear?

https://louwrentius.com/what-will-you-do-when-ai-will-run-out-of-money-and-disappear.html
1•louwrentius•15m ago•0 comments

Why is software still built like billions don't exist in 2026?

4•yerushalayim•17m ago•1 comments

Is Polish Scrabble the most difficult in the world? [video]

https://www.youtube.com/watch?v=aTIOHwT0FnY
1•nathell•17m ago•0 comments

Post-Agentic Code Forges

https://sluongng.substack.com/p/post-agentic-code-forges
1•todsacerdoti•18m ago•0 comments

In-memory analog computing for non-negative matrix factorization

https://www.nature.com/articles/s41467-026-68609-8
1•martinlaz•23m ago•0 comments

RT Superconductivity at 298K in Ternary LaScH System at High-Pressure Conditions

https://arxiv.org/abs/2510.01273
1•fluffybuns•25m ago•0 comments

Show HN: Waifu2x.live – Free AI image upscaler (2x/4x) & video generation

1•Nancy1230•25m ago•1 comments

Campaigner launches £1.5B legal action in UK against Apple over wallet's ...

https://www.theguardian.com/technology/2026/jan/23/campaigner-launches-legal-action-against-apple...
1•chrisjj•27m ago•1 comments

Anthropic: AI Is Transforming Jobs, Not Replacing Them

https://www.forbes.com/sites/anishasircar/2026/01/23/ai-is-transforming-jobs-not-replacing-them-a...
1•hochmartinez•28m ago•1 comments

AI Boosts Research Careers but Flattens Scientific Discovery

https://spectrum.ieee.org/ai-science-research-flattens-discovery
1•pseudolus•28m ago•0 comments

Google must face consumer antitrust lawsuit over search dominance,US judge rules

https://www.reuters.com/legal/government/google-must-face-consumer-antitrust-lawsuit-over-search-...
2•pseudolus•29m ago•0 comments

Do We Still Need Tech Blogs in the Era of GenAI?

https://blog.mrcroxx.com/posts/do-we-still-need-tech-blogs-in-the-era-of-gen-ai/
1•MrCroxx•30m ago•0 comments

Show HN: Simple esp-idf and esp-matter version manager

https://github.com/matterizelabs/espvm
1•abu-matterize•31m ago•0 comments

Booting a PC from a Vinyl Record

https://boginjr.com/it/sw/dev/vinyl-boot/
1•yesturi•31m ago•0 comments

Show HN: Kite – lightweight production-ready agentic AI framework with Ollama

https://github.com/thienzz/Kite
1•thienzz•33m ago•1 comments

Resisting the Rule of the Rich: Protecting Freedom from Billionaire Power

https://www.oxfamamerica.org/explore/research-publications/resisting-the-rule-of-the-rich/
1•decimalenough•34m ago•0 comments

Show HN: OPC Skills – 9 AI agent skills for solopreneurs (Claude Code, Cursor)

https://opc.dev/
1•Zephyr0x•34m ago•0 comments

Sonic Booms and Seismic Waves Can Reveal Where Space Junk Crash-Lands

https://www.nytimes.com/2026/01/22/science/space-junk-seismographs.html
1•_____k•38m ago•0 comments