frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•8mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Show HN: Orange Elephant – a browser extension to add annotations to HN users

https://github.com/sebastianlay/orange-elephant
2•sebastianlay•2m ago•0 comments

Quantum mechanics: A full breakdown of Bell's inequality

https://foundational.site/bells-inequality/
1•yonitate•2m ago•0 comments

eBay bans illicit automated shopping amid rapid rise of AI agents

https://arstechnica.com/information-technology/2026/01/ebay-bans-illicit-automated-shopping-amid-...
1•mikece•3m ago•0 comments

U.S. murder rate hits lowest level since 1900, report says

https://www.axios.com/2026/01/22/murder-rate-century-low
1•Jimmc414•5m ago•0 comments

She built an AI bot of her mother to help her grieve

https://restofworld.org/2026/james-multon-love-machines-book-ai-companions/
1•xve•7m ago•0 comments

Accidentally rm rfd a production server

https://old.reddit.com/r/cscareerquestions/comments/1qjsfv8/accidentally_rm_rfd_a_production_server/
1•vatsachak•7m ago•0 comments

AI code review needs specialized agents, not bigger models

https://www.qodo.ai/blog/the-next-generation-of-ai-code-review-from-isolated-to-system-intelligence/
1•timbilt•9m ago•0 comments

FreeBSD Is a No-Go for KDE's Plasma Login Manager

https://itsfoss.com/news/plasma-login-manager-drops-freebsd/
1•mikece•9m ago•0 comments

First Ethiopian wolf ever captured, nursed and returned to the wild

https://theconversation.com/africas-rarest-carnivore-the-story-of-the-first-ethiopian-wolf-ever-c...
2•PaulHoule•9m ago•0 comments

What will tech jobs look like in 2026?

https://restofworld.org/2026/tech-jobs-2026-ai-layoffs-hybrid-work/
1•NDAjam•11m ago•0 comments

Microsoft Outlook products appear to be down (it's not just you)

https://downdetector.com/status/outlook/
1•mnehring•13m ago•1 comments

Tesla begins public unsupervised Robotaxi rides

https://twitter.com/aelluswamy/status/2014398853991301538
1•dillona•13m ago•0 comments

I Learned the First Rule of ARIA the Hard Way

https://css-tricks.com/i-learned-the-first-rule-of-aria-the-hard-way/
2•mooreds•13m ago•0 comments

Ageing promotes microglial accumulation of slow-degrading synaptic proteins

https://www.nature.com/articles/s41586-025-09987-9
1•bookofjoe•14m ago•0 comments

Siri will be a chatbot in iOS 27

https://9to5mac.com/2026/01/21/apple-reportedly-replacing-siri-interface-with-actual-chatbot-expe...
1•naves•16m ago•0 comments

Why does SSH send 100 packets per keystroke?

https://eieio.games/blog/ssh-sends-100-packets-per-keystroke/
23•eieio•16m ago•2 comments

In Davos, the AI bubble is always someone else's problem

https://www.axios.com/2026/01/22/in-davos-the-ai-bubble-is-always-someone-elses-problem
2•zerosizedweasle•17m ago•0 comments

Rust's Golden Rule: The Signature Is the Contract

https://steveklabnik.com/writing/rusts-golden-rule/
1•zahrevsky•18m ago•0 comments

Scribe reduces SWE-bench token usage by 30% with no loss of accuracy

https://sibylline.dev/articles/2026-01-22-scribe-swebench-benchmark/
1•CuriouslyC•21m ago•0 comments

LLM Agents Solve the Table Merging Problem

https://futuresearch.ai/deep-merge-tutorial/
9•ddp26•22m ago•1 comments

Coffee, Tea, and Bone Density in Older Women: A 10-Year Study

https://www.mdpi.com/2072-6643/17/23/3660
1•gnabgib•23m ago•0 comments

The A.I. Startup Soap Opera Riveting Silicon Valley

https://www.nytimes.com/2026/01/22/technology/thinking-machines-ai-startup-openai.html
2•philip1209•25m ago•0 comments

The History of Light

https://beatingthehydra.substack.com/p/the-history-of-light
2•Kotlopou•27m ago•1 comments

Money for nothing? Gas Town meme coin:unsolicited

https://newsletterhunt.com/emails/217139
2•worik•29m ago•0 comments

Batocera.linux

https://batocera.org/
1•thunderbong•32m ago•0 comments

The Magic Behind UUID in Swift, How Your App Generates Unique Identifiers

https://www.swiftdifferently.com/blog/swift/the-magic-behind-uuid-in-swift
2•maguszin•33m ago•0 comments

Trump's 'unpredictable' policies to fuel multiyear shift from US, Pimco says

https://www.ft.com/content/9b2f8903-4350-45a5-a915-a58b6f9b35fb
3•toomuchtodo•33m ago•1 comments

Show HN: AI Search Index – Track which AI bots crawl your website

https://www.aisearchindex.com
2•ihmissuti•35m ago•0 comments

Sony and TCL Sign MoU for Strategic Partnership in Home Entertainment Field

https://www.sony.co.jp/en/news-release/202601/26-0120E/
1•saikatsg•37m ago•0 comments

6 MOQ Players You Need to Know About: Pros and Cons

https://www.red5.net/blog/6-moq-players-you-need-to-know-about/
1•mondainx•39m ago•0 comments