frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•9mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Tux.wf Beta 2.0 – Free Short URLs and Sub-Domain Hosting (Community-Driven)

https://tux.wf
1•tuxyz•35s ago•1 comments

Star collapse into a black hole without a supernova

https://www.sciencedaily.com/releases/2026/02/260213223855.htm
1•wglb•55s ago•0 comments

CiderPress: Turn your voice memos into living knowledge

https://github.com/appstart-one/ciderpress/blob/main/README.md
1•hutch777•3m ago•1 comments

A Journey into a Mainframe

https://blog.hermesloom.org/p/a-journey-into-a-mainframe
1•sigalor•4m ago•0 comments

Don't switch context when waiting for AI

https://silentsand.me
1•brotmitkot•6m ago•0 comments

Prediction markets should take their cue from 007

https://www.ft.com/content/3480ba0f-0524-426a-b166-4c1055d3c8b1
1•hhs•7m ago•0 comments

Palette: Generate on‑brand marketing visuals from a website URL

https://thepalette.app/
1•liftof•7m ago•1 comments

Show HN: LaunchFast – Ship your Next.js SaaS in days, not months

https://github.com/Wittlesus/launchfast-starter
1•wittlesus•9m ago•0 comments

VS Code becomes multi-agent command center for developers

https://thenewstack.io/vs-code-becomes-multi-agent-command-center-for-developers/
1•scubakid•12m ago•0 comments

Why education innovation fails to scale, and what can be done about it: research

https://www.hoover.org/press/new-hoover-institution-research-reveals-why-education-innovation-fai...
1•hhs•13m ago•0 comments

Dying Every Six Hours

https://sammyjankis.com/essay.html
1•_vaporwave_•18m ago•0 comments

How LiveATC Went Live

https://www.ainonline.com/aviation-news/air-transport/2025-12-15/how-liveatc-went-live
1•dangle1•24m ago•0 comments

Show HN: Silo – Every Git branch gets its own localhost

https://github.com/silo-rs/silo
1•junhsss•29m ago•0 comments

Show HN: Lore

https://github.com/dgpc/LORE
1•daave•30m ago•0 comments

Nanotrace: A nanosecond-scale profiler using Intel PT

https://omar.yt/posts/nanotrace-a-nanosecond-scale-profiler-using-intel-pt
1•omarroth•31m ago•0 comments

Canada Has a Secessionist Movement on Its Hands. Its Supporters Thank Trump

https://www.wsj.com/world/americas/alberta-canada-independence-7549e240
4•JumpCrisscross•31m ago•4 comments

finding projects worth doing

https://usize.github.io/blog/2026/advice-00.html
1•plaidthunder•34m ago•0 comments

Researcher skeptical of 'Havana syndrome' tested secret weapon on himself

https://www.washingtonpost.com/national-security/2026/02/14/havana-syndrome-cia-norway-experiment/
7•bookofjoe•36m ago•2 comments

A 3% Rule for Budget Deficits Would Be a Good Start

https://www.advisorperspectives.com/articles/2026/02/13/a-3-rule-for-budget-deficits-would-be-a-g...
1•RickJWagner•38m ago•2 comments

Valentine's Day gift for Winter Olympics athletes – more condoms

https://news.sky.com/story/valentines-day-gift-for-winter-olympics-athletes-more-condoms-13507752
2•austinallegro•39m ago•1 comments

Show HN: Webcam eye-tracking to verify meditation, with money on the line

https://heartful.day/
1•louison11•42m ago•0 comments

Show HN: Nerve: Stitches all your data sources into one mega-API

https://playground.get-nerve.com/
1•mprast•42m ago•0 comments

Show HN: Modo – Manage reusable Claude Code config presets from the CLI

https://github.com/lennacodes/modo
1•lennacodes•44m ago•1 comments

Show HN: We recover SLA credits your cloud vendors owe you

https://reclaimsla.com
2•arberx•44m ago•0 comments

MathArena: Evaluating LLMs on uncontaminated math questions

https://matharena.ai/?view=problem&comp=aime--aime_2026
2•GaggiX•45m ago•0 comments

LLM Alignment/Hallucinations Can't Be Fixed – Proof

https://github.com/moketchups/permanently-jailbroken
2•MoKetchups•46m ago•0 comments

I structured Dario Amodei's philosophy into an open-source book

1•leading-AI•46m ago•1 comments

Drone footage shows turnout in Toronto rally of Iranians [video]

https://www.youtube.com/shorts/l8chv_AQszA
2•ukblewis•47m ago•0 comments

Show HN: Limitless – C/C++ infinitely large number storage

https://github.com/tgergo1/limitless
1•tgergo1•50m ago•0 comments

Show HN: FoodCraft – AI cooking assistant that adapts recipes to your diet/goals

https://foodcraft.app/en
1•Ekimo•51m ago•0 comments