frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•8mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

AI Generated Music Barred from Bandcamp

https://old.reddit.com/r/BandCamp/comments/1qbw8ba/ai_generated_music_on_bandcamp/
1•cdrnsf•1m ago•0 comments

Notre-Dame sees record number of visitors, one year on from reopening

https://www.rfi.fr/en/france/20260105-notre-dame-sees-record-number-of-visitors-one-year-on-from-...
2•gnabgib•4m ago•0 comments

The rapid rise and slow decline of Sam Altman

https://garymarcus.substack.com/p/the-rapid-rise-and-slow-decline-of
1•treadump•4m ago•0 comments

DevOps'ish Returns

https://buttondown.com/devopsish/archive/devopsish-returns/
1•oaf357•4m ago•0 comments

Verizon to stop automatic unlocking of phones as FCC ends 60-day unlock rule

https://arstechnica.com/tech-policy/2026/01/fcc-lets-verizon-lock-phones-for-longer-making-it-har...
1•cdrnsf•4m ago•0 comments

Can Philanthropy Fast-Track a Flagship Telescope?

https://www.universetoday.com/articles/can-philanthropy-fast-track-a-flagship-telescope
1•rbanffy•5m ago•0 comments

Claude Code Questionnaires

https://djharper.dev/post/2026/01/10/claude-code-questionnaires/
1•speckx•5m ago•0 comments

Apple-1 Computer Prototype Board #0 Auction

https://www.rrauction.com/auctions/lot-detail/350902407346003-apple-1-computer-prototype-board-0-...
1•qingcharles•6m ago•0 comments

Semiconductor Fabs II: The Operation

https://nomagicpill.substack.com/p/semiconductor-fabs-ii-the-operation
1•paulpauper•6m ago•0 comments

Notes on Afghanistan

https://mattlakeman.org/2026/01/05/notes-on-afghanistan/
1•paulpauper•6m ago•0 comments

North America's Elevator Problem [video]

https://www.youtube.com/watch?v=Or1_qVdekYM
1•lisper•7m ago•0 comments

Show HN: XAI-style recursive FOL logic tree engine with GUI (2002)

https://TreeOfKnowledge.eu
1•JAnicaTZ•7m ago•0 comments

The Boundary Problem

https://jonasmoman.substack.com/p/the-boundary-problem
1•paulpauper•7m ago•0 comments

Segmented Turning Designer

https://cdelker.bitbucket.io/segbowl/
1•DriftRegion•9m ago•0 comments

How to Design Python AI Projects That Don't Fall Apart

https://www.decodingai.com/p/how-to-design-python-ai-projects
1•rbanffy•10m ago•0 comments

Differences Between Lead Roles and How to Find Your Right Path

https://newsletter.eng-leadership.com/p/differences-between-lead-roles-and
1•rbanffy•10m ago•0 comments

The truth behind the 2026 J.P. Morgan Healthcare Conference

https://www.owlposting.com/p/the-truth-behind-the-2026-jp-morgan
1•abhishaike•11m ago•0 comments

Show HN: Pick a point on Earth and see the last/next total solar eclipse there

https://findmyeclipse.com
1•spookyuser•12m ago•0 comments

3D Anatomy Viewer

https://www.talktoanatomy.com/
1•ashahrourr•12m ago•0 comments

Publishers fear AI search summaries and chatbots mean 'end of traffic era'

https://www.theguardian.com/media/2026/jan/12/publishers-fear-ai-search-summaries-and-chatbots-me...
2•bookofjoe•13m ago•0 comments

Scott Adams, comic strip author of 'Dilbert' dies at 68

https://apnews.com/article/scott-adams-dies-dilbert-cartoonist-ccddff117f962854cb70d973c3075544
1•Physkal•13m ago•0 comments

Contrakit: Predicting Model Hallucination Before Training

https://github.com/off-by-some/contrakit/tree/main/examples/hallucinations
1•off-by-some•15m ago•0 comments

Global Stocks Are Projected to Return 11% in the Next 12 Months

https://www.goldmansachs.com/insights/articles/global-stocks-are-projected-to-return-11-percent-i...
1•0xedb•16m ago•0 comments

Show HN: cubic 2.0 – improving our AI code reviewer (3x more accurate,2x faster)

https://www.cubic.dev/blog/cubic-2.0
2•pomarie•16m ago•0 comments

Is AI the Answer?

https://www.siliconpublishing.com/blog/surfing-the-data-wave-is-ai-really-the-answer/
2•maxdunn1•17m ago•0 comments

Show HN: Simple browser game to teach AI transformation concepts to small biz

https://aiquest.futureu.co/
1•Renjit•19m ago•1 comments

Using Proxies to Hide Secrets from Claude Code

https://www.joinformal.com/blog/using-proxies-to-hide-secrets-from-claude-code/
1•drewgregory•21m ago•1 comments

Nitrous oxide for the treatment of depression: a systematic review

https://www.thelancet.com/journals/ebiom/article/PIIS2352-3964(25)00467-0/fulltext
1•geox•21m ago•0 comments

Vibe coded terminal rendered Counter Strike 1.6 clone

https://old.reddit.com/r/vibecoding/comments/1qblg0x/i_vibe_coded_a_terminal_rendered_counter_str...
1•idanb•22m ago•0 comments

Microplastic exposure alters sperm snRNA and affects offspring metabolic health

https://academic.oup.com/jes/advance-article/doi/10.1210/jendso/bvaf214/8383852?login=false
2•PaulHoule•25m ago•0 comments