frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•8mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Open-source self-driving for 325 car models from 27 brands

https://comma.ai
1•JumpCrisscross•2m ago•0 comments

All you need is an Acre

https://twitter.com/theOpusLABS/status/2014756424836866425
1•opuslabs•6m ago•0 comments

Docs.surf

https://docs.surf/
1•danabramov•13m ago•0 comments

Korea Issues Strict New AI Rules, Outpacing the West

https://www.wsj.com/tech/ai/south-korea-issues-strict-new-ai-rules-outpacing-the-west-2af7d7eb
1•JumpCrisscross•15m ago•0 comments

Anti-vax sentiment pushes Moderna away from new late-stage infectious diseases

https://www.fiercebiotech.com/biotech/anti-vaccine-sentiment-pushes-moderna-away-new-late-stage-i...
1•dcgudeman•16m ago•0 comments

Use of PQC in SMTP STARTTLS

https://www.netmeister.org/blog/smtp-pqc.html
1•8organicbits•16m ago•0 comments

Face to Face with History's Most Dangerous Painter

https://www.nytimes.com/interactive/2026/01/22/arts/jacques-louis-david-painter-french-revolution...
1•DiscourseFan•16m ago•0 comments

Study shows how earthquake monitors can track space junk through sonic booms

https://apnews.com/article/space-junk-seismic-booms-dadb5f9499fa9b52200baada0fdf1f15
1•JumpCrisscross•20m ago•0 comments

Hollywood Try to Take Pirate Sites Down Globally Through Indian Court

https://torrentfreak.com/disney-netflix-crunchyroll-try-to-take-pirate-sites-down-globally-throug...
1•thisislife2•30m ago•0 comments

Open-source ad infra for LLMs (reverse-engineered from ChatGPT)

https://github.com/system32miro/ai-ads-engine
1•system32miro•32m ago•0 comments

Kauldron: Modular, scalable library to train ML models

https://github.com/google-research/kauldron
1•lairv•33m ago•0 comments

AdaL Web, the local Claude co-work

https://www.youtube.com/watch?v=smfVGCI08Yk
3•meame2010•34m ago•8 comments

Show HN: Agentic Browser Testing Videos in GitHub PRs

https://twitter.com/morphllm/status/2014454667007426752
1•bhaktatejas922•37m ago•0 comments

Five Ways People Are Using Claude Code

https://www.nytimes.com/2026/01/23/technology/claude-code.html
2•hecanjog•38m ago•0 comments

A UX Case Study: How Notion's Billing Flaw Creates Epistemic Injustice

https://twitter.com/JacobRoss117/status/2014852639151079704
2•DocSeraphMercer•38m ago•1 comments

Brex CFO Erica Dorfman's Take on the Capital One Deal

https://www.cfo.com/news/brex-cfo-erica-dorfman-capital-one-deal-acquisition/810415/
1•brandonb•38m ago•0 comments

Scientists solve 66M-year-old mystery of how Earth's greenhouse age ended

https://phys.org/news/2026-01-scientists-million-year-mystery-earth.html
1•bikenaga•40m ago•1 comments

CertiK eyes IPO at $2B valuation

https://www.theblock.co/post/386882/certik-ipo-2-billion-valuation-first-public-web3-cybersecurit...
1•SaaSasaurus•42m ago•0 comments

Smartwatches detect abnormal heart rhythms 4x more often in clinical trial

https://www.usnews.com/news/health-news/articles/2026-01-23/smartwatches-help-detect-hidden-dange...
1•brandonb•45m ago•1 comments

Show HN: Dwm.tmux – a dwm-inspired window manager for tmux

https://github.com/saysjonathan/dwm.tmux
2•saysjonathan•47m ago•0 comments

Show HN: I built a dumb website using AI – Bets by Mitch

https://blog.bymitch.com/posts/bets-by-mitch/
1•mitch292•48m ago•0 comments

OpenHands: AI-Driven Development

https://github.com/OpenHands/OpenHands
1•kristianpaul•48m ago•0 comments

Infinite Pancakes, Anyone?

https://www.nytimes.com/2026/01/20/science/infinite-pancake-math-puzzle.html
1•Hooke•48m ago•0 comments

SSH has no Host header

https://blog.exe.dev/ssh-host-header
3•birdculture•49m ago•0 comments

Show HN: Booklife-MCP – MCP server unifying Libby, Hardcover, and your TBR

https://github.com/andylbrummer/booklife-mcp
1•andybrummer•49m ago•0 comments

PBM profits obscured by mergers and accounting practices: white paper

https://schaeffer.usc.edu/research/pbm-profits-obscured-mergers-rebates-accounting/
2•hhs•50m ago•0 comments

Tech Debt Is Good

https://system32.ai/blogs/tech-debt-is-good
1•debarshri•51m ago•0 comments

Self-boosting code snuck into a voted repo. Democracy overruled the maintainer

https://blog.openchaos.dev/posts/week-3-the-trojan-horse
2•skridlevsky•53m ago•0 comments

Show HN: Shorter: A domain shortener tool, written in Rust

https://shorter.dev
1•aanesn•55m ago•0 comments

Oloid-shaped Mixer

https://old.reddit.com/r/Fusion360/comments/1334xqd/oloid_mixer_inspired_from_a_postquestion_on_t...
1•downboots•55m ago•0 comments