frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•6mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

3 things to know about Ironwood, our latest TPU

https://blog.google/products/google-cloud/ironwood-google-tpu-things-to-know/
1•zdw•52s ago•0 comments

Someone at YouTube Needs Glasses: The Prophecy Has Been Fulfilled

https://jayd.ml/2025/11/10/someone-at-youtube-needs-glasses-prophecy-fulfilled.html
1•jaydenmilne•1m ago•1 comments

Building road signs at home using a Cricut Machine

https://annanay.dev/build-a-signboard/
1•annanay•2m ago•0 comments

Monty – a sensorimotor learning system following the principles of the neocortex

https://github.com/thousandbrainsproject/tbp.monty
1•stefanv•11m ago•0 comments

20x less peak RAM in PyTorch memory budget solver

https://jedrzej.maczan.pl/2025_11_21_dp_knapsack_sliding_hirschberg
1•yu3zhou4•13m ago•0 comments

Arabic Loanwords in the Neo-Aramaic Dialect of Ankawa

https://books.openbookpublishers.com/10.11647/obp.0209/ch16.xhtml
1•marysminefnuf•13m ago•0 comments

Mumbai families suffer as datacentres keep the city hooked on coal

https://www.theguardian.com/technology/2025/nov/24/mumbai-datacentres-coal-air-pollution
1•Teever•14m ago•0 comments

An Affordable Voice Assistant That Won't Spy on You (2022)

https://blog.tjll.net/build-your-own-voice-asistant/
1•wonger_•15m ago•0 comments

Ask HN: Is Techmeme getting paid to boost certain articles?

2•dabockster•19m ago•1 comments

Let go of StackOverflow; communities must take ownership

https://ahelwer.ca/post/2025-11-25-stackoverflow/
1•tensegrist•19m ago•0 comments

Google steers Americans looking for health care into "junk insurance"

https://pluralistic.net/2025/11/25/open-season/
2•hn_acker•20m ago•0 comments

The Penicillin Myth

https://www.asimov.press/p/penicillin-myth
1•fanf2•23m ago•0 comments

Mapping the Brain's Sense of What Goes on Inside the Body

https://www.nytimes.com/2025/11/25/science/brain-neuroscience-interoception.html
2•tysone•27m ago•0 comments

OpenStreetMap scales to serve users worldwide with Fastly

https://www.fastly.com/customers/openstreetmap
2•raybb•28m ago•0 comments

New Version of the MCP Specification

https://modelcontextprotocol.io/specification/2025-11-25
1•mooreds•29m ago•0 comments

Show HN: Rs-Utcp, a Rust Implementation of the Universal Tool Calling Protocol

https://github.com/universal-tool-calling-protocol/rs-utcp
1•juanviera23•29m ago•0 comments

NVMe driver for Windows 2000, targeting both x86 and Alpha AXP platforms

https://github.com/techomancer/nvme2k
1•zdw•30m ago•0 comments

Techrights the Name Turns 15

https://techrights.org/n/2025/09/20/Techrights_the_Name_Turns_15.shtml
1•amcclure•30m ago•0 comments

Why are static site generators so complicated to use?

https://thejollyteapot.com/2024/10/15/why-are-static-site-generators-so-complicated-to-use/
2•speckx•34m ago•0 comments

The Silent War Between AI and Blockchain for the Future of Trust

https://thinkmintmedia.blogspot.com/2025/11/the-silent-war-between-ai-and.html
1•iamtech•38m ago•0 comments

Ask HN: If Unix gets more popular would you use it instead of Linux?

1•catstor•38m ago•4 comments

Rightwing Influencers Fall for Mamdani's 'Arabic Numerals' in School Troll [video]

https://www.youtube.com/watch?v=zujUUI0WEvk
3•absqueued•39m ago•0 comments

Stop Putting Your Passwords into Random Websites (Yes, Seriously, You Are the PR

https://labs.watchtowr.com/stop-putting-your-passwords-into-random-websites-yes-seriously-you-are...
2•Deeg9rie9usi•39m ago•0 comments

Playing Safe with AI

https://declanbright.com/software/playing-safe-with-ai/
1•dclnbrght•40m ago•1 comments

The State of AI Agent Frameworks in 2025

https://devnavigator.com/2025/11/20/the-state-of-ai-agent-frameworks-in-2025/
1•BerislavLopac•41m ago•0 comments

Amazon CTO Werner Vogels predicts the Renaissance developer era

https://www.aboutamazon.com/news/aws/werner-vogels-amazon-cto-predictions-2026
1•jonbaer•42m ago•0 comments

Swift Standard Library Type Graph (2020)

https://arthurhammer.de/2020/01/swift-stdlib-type-graph/
1•joecobb•42m ago•0 comments

My theories on why Apple's software quality has degraded so much

https://forums.macrumors.com/threads/my-theories-on-why-apples-software-quality-has-degraded-so-m...
2•f_allwein•42m ago•0 comments

Show HN: Validating "Scratch for AI agents" before building

https://www.orchastra.org/
1•harjjotsinghh•43m ago•0 comments

Private Credit's Sketchy Marks Get Warning Shot from Wall Street's Top Cop

https://www.bloomberg.com/news/articles/2025-11-25/private-credit-s-sketchy-marks-get-warning-sho...
2•zerosizedweasle•46m ago•0 comments