frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•6mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

A Clever Approach to AI Drawing I Accidentally Discovered – Bear. Best

https://bear.best/en/blog/a-clever-approach-to-ai-drawing/
1•BearBest•1m ago•0 comments

BusyBox-Only Linux

https://github.com/chirsz-ever/busybox-linux
1•chirsz•2m ago•0 comments

Versioning Events Without Breaking Everything

https://docs.eventsourcingdb.io/blog/2025/12/08/versioning-events-without-breaking-everything/
1•goloroden•3m ago•0 comments

IBM nears $11B deal to acquire Confluent

https://finance.yahoo.com/news/ibm-nears-roughly-11-billion-031139352.html
2•theanonymousone•6m ago•0 comments

Show HN: CatalystAlert – Free biotech catalyst calendar (985 companies tracked)

https://catalystalert.io
2•nykodev•6m ago•1 comments

Show HN: Brick Starter – .NET SaaS starter kit to ship production apps faster

https://www.brickstarter.net
1•plakhlani2•8m ago•1 comments

Russian authorities have imposed restrictions on FaceTime

https://apnews.com/article/russia-internet-crackdown-facetime-restrictions-06301be480510b18ae0203...
2•chmaynard•10m ago•0 comments

Kinesis Advantage2

https://danishpraka.sh/posts/kinesis-advantage2/
1•prakashdanish•18m ago•0 comments

Show HN: Peargent – A Simple Python Framework for Building AI Agents

https://github.com/Quanta-Naut/peargent
1•Quanta-Naut•21m ago•1 comments

Indian boy, aged 3, becomes youngest rated chess player in history

https://www.nytimes.com/athletic/6869534/2025/12/07/youngest-chess-player-age-india/
2•NewCzech•21m ago•1 comments

Finnix

https://en.wikipedia.org/wiki/Finnix
2•fuzztester•23m ago•1 comments

Fifty Years of Retracted Medical Publications from 1975 to 2024

https://jkms.org/DOIx.php?id=10.3346/jkms.2025.40.e300
1•XzetaU8•24m ago•0 comments

Apple Taps Meta Lawyer as General Counsel in Latest Shake-Up

https://www.bloomberg.com/news/articles/2025-12-04/apple-taps-top-meta-lawyer-as-general-counsel-...
1•mgh2•34m ago•1 comments

Estimate Trend at a Point in a Noisy Time Series

https://github.com/finite-sample/incline
1•neehao•35m ago•0 comments

Publishing Malicious VS Code Extensions: Bypassing VS Code Marketplace Analysis

https://mazinahmed.net/blog/publishing-malicious-vscode-extensions/
1•mazen160•41m ago•0 comments

IBM to Acquire Confluent for $11B

https://www.bloomberg.com/news/articles/2025-12-08/ibm-close-to-buying-confluent-in-11-billion-de...
3•marc__1•44m ago•2 comments

Dewy: Continuous deployments for VPS and bare metal, no K8s required

https://github.com/linyows/dewy
1•linyows•46m ago•1 comments

EVs 80% Worse Consumer Reports Lied – ICE Cars Are Failing at Record Levels

https://www.youtube.com/watch?v=f2kYoahAw5U
1•xbmcuser•47m ago•0 comments

2FAS Pass: Local-First Password Manager

https://2fas.com/pass/
1•thunderbong•47m ago•0 comments

Kazakhstan, France collaborate to boost aviation training capacity

https://qazinform.com/news/kazakhstan-france-collaborate-to-boost-aviation-training-capacity-4d2486
1•Bolat14•52m ago•0 comments

Earth needs energy. Atlanta's Super Soaker creator may have a solution

https://www.seattletimes.com/business/earth-needs-energy-atlantas-super-soaker-creator-may-have-a...
1•Gaishan•54m ago•0 comments

FiwixOS 3.5 Released

https://www.fiwix.org/news/20251115.html
1•coolcoder613•55m ago•0 comments

GeneralGiist – A Global Forum Built for Real, Unfiltered Conversations

1•cimaa•59m ago•1 comments

How to Use Git Worktree for Claude Code Development

https://medium.com/@naveensky/how-to-use-git-worktree-for-claude-code-development-43dfbd554b21
1•naveensky•59m ago•0 comments

Funerary figurines found in royal tomb identifies Pharoah

https://www.sciencealert.com/trove-of-225-exceptional-egyptian-figurines-solves-long-standing-mys...
1•Gaishan•1h ago•0 comments

The Forge Tier List

https://theforgetierlist.com/
1•quchao•1h ago•2 comments

Cybersecurity Must Block AI Browsers for Now

https://www.gartner.com/en/documents/7211030
1•gnabgib•1h ago•0 comments

CDC advisory panel delays vote on hepatitis B vaccines

https://www.nbcnews.com/health/health-news/cdc-advisory-panel-delays-vote-hepatitis-b-vaccines-rc...
1•gmays•1h ago•0 comments

Block all AI browsers for the foreseeable future: Gartner

https://www.theregister.com/2025/12/08/gartner_recommends_ai_browser_ban/
2•defrost•1h ago•0 comments

Show HN: I added coins to Dino Game

https://dinosaurgame.app/
2•coolwebtoolsguy•1h ago•1 comments