frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Show HN: Best setup local LLM found for a 5090

https://local-llm.utop.workers.dev/
1•utopman•1m ago•0 comments

twentyforty [pdf]

https://www.hiig.de/wp-content/uploads/2020/03/twentyforty-book-1.pdf
1•jruohonen•2m ago•0 comments

What we know about the plan to give Americans an equity stake in AI

https://www.ft.com/content/8559a3f9-86de-4a1c-8a75-6623e83e6a00
1•Timofeibu•3m ago•0 comments

CaaStle CEO Confessed to Fraud. The Board Let Her Stay

https://www.nytimes.com/2026/06/07/business/caastle-fraud-christine-hunsicker.html
1•sbulaev•4m ago•0 comments

AI Memory Fails Spectacularly: 95% Error Rate Unveiled

https://www.machinebrief.com/news/ai-memory-fails-spectacularly-95percent-error-rate-unveiled-ewj0
2•decorner•8m ago•0 comments

SourceHut Disrupted by LLM Crawlers

https://status.sr.ht/issues/2026-06-06-llms-again/
2•BrunoBernardino•18m ago•0 comments

A Dark Cave

https://a-dark-cave.com/
1•suralind•20m ago•1 comments

Ask HN: What do you currently use for AI coding (personal or professional)?

1•joelthelion•23m ago•0 comments

Grid connection requested for US fusion power plant

https://www.world-nuclear-news.org/articles/grid-connection-requested-for-us-fusion-power-plant
2•beardyw•26m ago•1 comments

Reality: The Final Eval – Vending Bench Eval

https://www.latent.space/p/andon
1•davedx•29m ago•0 comments

How to Stay Resilient in a Difficult Job

https://andiroberts.com/executive-coaching/how-to-stay-resilient-in-a-difficult-job
1•Garbage•32m ago•0 comments

I've Built 4 Products – Traffic Has Killed All of Them. Organic Growth Strategy?

1•Clikdeo•32m ago•0 comments

Companies Are Using Reddit to Manipulate ChatGPT and Google AI Search

https://www.404media.co/companies-are-using-reddit-to-manipulate-chatgpt-and-google-ai-search/
3•latexr•36m ago•0 comments

DentaQuest Data Breach Analysis

https://www.rescana.com/post/dentaquest-data-breach-analysis-shinyhunters-leak-exposes-pii-and-ph...
2•01-_-•49m ago•0 comments

Rodents in the Attic (Small C64 demo) [video]

https://www.youtube.com/watch?v=fbXIdK3o3NU
1•atesti•49m ago•0 comments

Google Tag Manager Is Not a Skill. It Never Was

https://dev4321.tagcompanion.com/blog/gtm-is-not-a-skill-it-never-was/
2•ybor•49m ago•0 comments

Maybe You Can Get Rid of Code Reviews

https://rogermarley.com/blog/maybe-you-can-get-rid-of-code-reviews/
2•rogermarley•49m ago•0 comments

Hey guys made my first indie game from India called Inverta

https://play.google.com/store/apps/details?id=com.invertagame.app&hl=en_US
1•vamsikv28•51m ago•1 comments

Science and Society need more interaction instead of mere communication

https://elephantinthelab.org/science-and-society-need-more-interaction-instead-of-mere-communicat...
1•jruohonen•55m ago•0 comments

Centrality in the DNS

https://www.potaroo.net/ispcol/2026-05/dns-centrality.html
2•fanf2•55m ago•0 comments

Commandments of Synchronization (2011) [pdf]

https://www.cs.cornell.edu/courses/cs4410/2012fa/papers/commandments.pdf
1•signa11•56m ago•0 comments

A Proposal for an Open Credential Lifecycle Standard

https://gist.github.com/abrambailey/086c1cc4f6417a1f504ef7360962dbfc
1•funkdified•57m ago•0 comments

Ask HN: Who here still codes without AI, and why?

1•jakemanger•1h ago•0 comments

Reverse Engineered Razer Keyboard Control

https://www.rizldizl.com/
2•scienceisneato•1h ago•1 comments

The best/only way to get VCs to pay for a new systems programming language

https://ziggit.dev/t/allocators-in-zig-and-what-can-be-better-in-another-language/11755
1•burglins•1h ago•0 comments

There Is No Sound of the 2020s. Yet

https://ra.co/features/4505
2•volfenstein•1h ago•2 comments

A Modern Proxmox Docker Architecture with Disposable VMs, VirtIO-FS, and ZFS

https://du.nkel.dev/blog/2026-05-16_rootless_docker_virtiofs_proxmox/
1•Helmut10001•1h ago•0 comments

From the Legendary Lft: Microcontroller Magic ("Sum Ergo Demonstro") [video]

https://www.youtube.com/watch?v=0_9YS2tsdYc
1•ttsiodras•1h ago•0 comments

Static Web Server

https://static-web-server.net/
2•igoose1•1h ago•0 comments

I'm a Philosophy Professor. Here's Why I'm Training AI to Replace Me

https://www.chronicle.com/article/im-a-philosophy-professor-heres-why-im-training-ai-to-replace-me
2•KnuthIsGod•1h ago•0 comments