frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•9mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Claude Code Compaction Viewer

https://github.com/swyxio/claude-compaction-viewer/
1•handfuloflight•2m ago•0 comments

Modular Diffusers – Composable Building Blocks for Diffusion Pipelines

https://huggingface.co/blog/modular-diffusers
1•nathanmarz•2m ago•0 comments

Ask HN: Do you have a good solution for isolated workspaces per project?

1•straydusk•8m ago•0 comments

TeX Live 2026 is available for download now

https://www.tug.org/texlive/acquire.html
3•jithinraj•21m ago•0 comments

Show HN: Triplecheck – Review your code free with local LLMs

https://github.com/raullenchai/triplecheck
1•raullen•38m ago•0 comments

Gemini-flash-latest silently broke Search grounding for 1 month

2•takibboinz•38m ago•0 comments

Show HN: AI Benchy – AI benchmarks and comparisons

https://aibenchy.com/
1•XCSme•39m ago•0 comments

Did anyone try this Docker desktop alternative?

https://www.empiricapps.com/zenithal/download
1•mburlac•40m ago•0 comments

Caroline affair – eventually influencing the development of international law

https://en.wikipedia.org/wiki/Caroline_affair
1•vinnyglennon•49m ago•0 comments

Show HN: WingNews – Htmx Hacker News Reader

https://news.wingman.actor/
1•ChaseRensberger•50m ago•0 comments

AI Doesn't Remember Yesterday. How to Design Team Memory for AI Workflows

https://github.com/htuzel/flalingo-mem-bridge
1•htuzel•50m ago•0 comments

Ask HN: Wish Linux tmpfs support compression option

2•gry_gh•50m ago•0 comments

Howard Abrams' Literate Programming with Org Mode [video]

https://www.youtube.com/watch?v=OUuz9-CtCwY
2•todsacerdoti•55m ago•0 comments

My SoC Analyst Writeup – Support by star, thanks guys

https://github.com/ogtamimi/SOC-Analyst-WriteUp-LetsDefend.io
2•ogtamimi•56m ago•0 comments

Create PDF Resume

https://createpdfresume.com/
1•tahirasadov•58m ago•0 comments

All of you doubted me, and said I was hallucinating on AI

https://ryanshamim.substack.com/p/the-inference-efficiency-paradox
1•anima-core•59m ago•1 comments

Tech titans vow to 'take back' California

https://nypost.com/2026/03/05/us-news/tech-titans-vow-to-take-back-california-from-lefties-call-o...
4•mudil•1h ago•0 comments

First Aptera Solar EV Rolls Off Validation Assembly Line

https://aptera.us/first-vehicle-off-validation-line/
1•TeaVMFan•1h ago•0 comments

Show HN: Yappy – A Python TUI to automate LinkedIn yapping

https://github.com/JienWeng/yappy
2•jienweng•1h ago•0 comments

Shut Up and Take My Money

https://lorendb.dev/posts/shut-up-and-take-my-money/
4•LorenDB•1h ago•0 comments

Spell UI

https://spell.sh/
2•handfuloflight•1h ago•0 comments

Show HN: Swarm – Program a colony of 200 ants using a custom assembly language

https://dev.moment.com/
17•armandhammer10•1h ago•8 comments

Show HN: Custom Search Engine on Safari and Spotlight (macOS)

https://knhash.in/custom-search-engine-on-safari-and-spotlight-macos/
3•knhash•1h ago•0 comments

Show HN: SafeAgent – exactly-once execution guard for AI agents

2•Lions2026•1h ago•1 comments

Enhanced brain cells clear away dementia-related proteins

https://medicalxpress.com/news/2026-03-brain-cells-dementia-proteins.html
8•WaitWaitWha•1h ago•0 comments

Google Patent: Sending Searchers to AI-Generated Pages over Your Site

https://www.seroundtable.com/google-patent-ai-generated-pages-search-41010.html
3•frays•1h ago•0 comments

System76 on Age Verification Laws

https://blog.system76.com/post/system76-on-age-verification/
24•LorenDB•1h ago•7 comments

A Technology for a Low-Trust Society

https://www.theatlantic.com/technology/2026/03/central-lie-prediction-markets/686250/
7•CaptainZapp•1h ago•0 comments

Tollund Man

https://en.wikipedia.org/wiki/Tollund_Man
2•thunderbong•1h ago•0 comments

Show HN: Steadwing – Your Autonomous On-Call Engineer

https://www.steadwing.com/
5•abejith•1h ago•0 comments