frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•10mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Hosted MCP server "everything" for testing

https://servereverything.dev/
1•pj3677•2m ago•0 comments

Train Neural Network Using DirectCompute D11

https://pypi.org/project/directcompute-nn/
1•raviadiprakoso•2m ago•1 comments

Reviving the Maintenance of MkDocs

https://github.com/orgs/mkdocs-community/discussions/1
1•netule•3m ago•0 comments

Energy-based Model (EBM) for enterprise AI security Ship it or keep tuning?

1•ALMOIZ_MOHMED•3m ago•0 comments

Personal Software and the Collapse of the Talent Pipeline

https://blog.slamdunk.software/extremely-personal-software-and-the-collapse-of-the-talent-pipeline/
1•Destiner•3m ago•0 comments

The Missing Layer in AI Agent Architecture

https://wundergraph.com/blog/why-mcp-is-ceiling-enterprise-ai-agent-architecture
1•asoorm•3m ago•1 comments

The Post-Copyright Era of Software

https://www.nibzard.com/post-copyright-era-software
1•nkko•4m ago•0 comments

AgentHub

https://github.com/karpathy/agenthub
1•john_cogs•5m ago•0 comments

Microsoft wants you to 'hire' its AI agents

https://www.computerworld.com/article/4141904/microsoft-wants-you-to-hire-its-ai-agents.html
1•CrankyBear•5m ago•0 comments

Concern over US travel visas prompts Ig Nobels to move its awards to Europe

https://apnews.com/article/ig-nobels-award-prize-comical-science-achievement-where-7413f288bb43b5...
2•geephroh•5m ago•0 comments

Autonomous Engineering Pipeline

https://github.com/changkun/wallfacer
1•changkun•7m ago•0 comments

Zuckerberg has "finished" with Alexandr Wang, worth US$14B

https://www.idnfinancials.com/news/61918/zuckerberg-has-finished-with-alexandr-wang-worth-us14-bi...
2•matthieu_bl•8m ago•0 comments

Frailty can be eased with an infusion of stem cells from young people

https://www.newscientist.com/article/2517139-frailty-can-be-eased-with-an-infusion-of-stem-cells-...
1•bookofjoe•9m ago•1 comments

Show HN: React Trace – Development-time visual inspector for React components

https://react-trace.js.org/
1•buzinas•9m ago•0 comments

Add AI to Any App

https://www.simeongriggs.dev/add-ai-to-any-app
1•bddicken•10m ago•0 comments

Open-source intelligence dashboard tracking the Iran conflict in real time

https://github.com/Juliusolsson05/pharos-ai
1•merusame•11m ago•0 comments

Anthropic sues Pentagon over rare "supply chain risk" label

https://www.axios.com/2026/03/09/anthropic-sues-pentagon-supply-chain-risk-label
1•sauronsrv•12m ago•1 comments

Dirplayer: A web-compatible Shockwave Player emulator written in Rust

https://github.com/igorlira/dirplayer-rs
1•homarp•12m ago•1 comments

Show HN: Agents with Verifiable Human Claims

https://docs.zipwire.io/zipwire-attest/getting-a-proofpack-jwt-with-nationality
1•lukepuplett•12m ago•0 comments

The Boring Technology Manifesto

https://yagnipedia.com/wiki/the-boring-technology-manifesto
2•riclib•17m ago•1 comments

Redacting Sensitive Data from Java Flight Recorder Files

https://mostlynerdless.de/blog/2026/02/13/redacting-sensitive-data-from-java-flight-recorder-files/
1•mooreds•23m ago•0 comments

Show HN: Four Claude Code hooks that enforce voice and tone on AI-written copy

https://windyroad.com.au/blog/enforcing-voice-and-tone-with-claude-code-hooks
1•tompahoward•23m ago•0 comments

CIA faces backlash after document with potential cancer cure hidden 60 years

https://www.dailymail.co.uk/sciencetech/article-15629211/cia-cancer-cure-document-declassified.html
2•bookmtn•23m ago•2 comments

Why diff fails for CSV comparison

https://reconlify.com/blog/why-diff-fails-for-csv
1•testuteab•27m ago•0 comments

Drug-controlled CAR T cells through the regulation of cell–cell interactions

https://www.nature.com/articles/s41589-026-02152-x
1•PaulHoule•28m ago•0 comments

Are We Sentient AI?

1•abmmgb•28m ago•7 comments

Building a Strict RFC 8259 JSON Parser: Acceptance Issues and Their Impact On

https://lattice-substrate.github.io/blog/2026/02/26/strict-rfc8259-json-parser/
1•birdculture•28m ago•0 comments

Show HN: Fakebase – a lightweight PostgreSQL browser for development databases

https://fakebase.studio
4•albinglad•29m ago•4 comments

Ask HN: Which book are you reading these days?

4•chistev•30m ago•2 comments

We strongly oppose the Unified Attestation initiative

https://twitter.com/GrapheneOS/status/2031041385554386960
2•hnburnsy•30m ago•0 comments