frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•8mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Secret Playbook That Will Raise an Impressive Seed Round

https://pawelbrodzinski.substack.com/p/secret-playbook-that-will-raise-an
1•flail•45s ago•0 comments

We built a tiny invariant layer that preserves meaning when embeddings fail

https://github.com/Architect-Flow78/axiom-core
1•pascalnicolae•3m ago•0 comments

Ask HN: How do you get Cloudflare to take abuse reports seriously?

1•y_oh_y•5m ago•0 comments

Show HN: Clawd Face – an expressive SVG face for Clawdbot in one script tag

https://github.com/Unayung/clawd-face
1•unayung•5m ago•1 comments

Show HN: Sandbox Agent SDK – unified API for automating coding agents

https://github.com/rivet-dev/sandbox-agent
1•NathanFlurry•5m ago•0 comments

When Every Network is 192.168.1.x

https://netrinos.com/blog/conflicting-subnets
2•pcarroll•5m ago•0 comments

Sorry for not knowing. But, what is that? (Pastebin-like)

https://sharetext.io/lyzelq5y
1•malcolmxxx•6m ago•0 comments

A runtime authorization layer for LLM agents

https://medium.com/@naolzewudu98/a70aabfb266d
1•naolbeyene•6m ago•0 comments

Why isn't data pollution more commonly used to obfuscate your personal identity?

1•polarbearballs•6m ago•1 comments

I built FormLight – a lightweight, Gutenberg-native WordPress form builder

https://wordpress.org/plugins/formlight/
1•omar86•7m ago•1 comments

Astronomers used AI to find 1,400 'anomalous objects' from Hubble archives

https://www.theverge.com/news/869182/astronomers-ai-discover-cosmic-anomalies-hubble-archives
1•mooreds•7m ago•0 comments

SunEarthTools – Tools for consumers and designers of solar

https://www.sunearthtools.com/en/index.php
1•smartmic•7m ago•0 comments

Ask HN: Does the UK's new anti-VPN law prevent under-18s from working in tech?

1•b800h•8m ago•0 comments

Why Greenland Matters

https://foreignpolicy.com/live/conley-why-greenland-matters-trump-nato/
1•mooreds•8m ago•0 comments

Show HN: I built an MCP server so ChatGPT can replace comparison sites

https://github.com/SecureLend/mcp-financial-services
1•tpfuetze•8m ago•1 comments

Show HN: I built a small browser engine from scratch in C++

https://github.com/beginner-jhj/mini_browser
1•crediblejhj•9m ago•0 comments

Social Media Regulation: A Proposal

https://dogdogfish.com/blog/2026/01/28/social-media-regulation/
1•matthewsharpe3•9m ago•0 comments

Show HN: Pam-db – A hybrid TUI <-> CLI tool for SQL databases

https://github.com/eduardofuncao/pam
1•xGoivo•11m ago•0 comments

Show HN: Real-time mesh booleans in the browser (~15ms per op on 500k triangles)

https://trueform.polydera.com/live-examples/boolean
1•ZigaSajovic•11m ago•0 comments

A robotic model of prey finding in the gleaning bat Micronycteris microtis

https://journals.biologists.com/jeb/article/229/1/jeb250818/370336/A-robotic-model-of-efficient-p...
1•PaulHoule•12m ago•0 comments

Show HN: ClothMotion – AI Clothing Fashion Video Generator and Try-On

https://www.clothmotion.app/
1•hesilong•12m ago•1 comments

Show HN: Unleash Toolbar

https://github.com/Unleash/toolbar
1•alexcasalboni•12m ago•0 comments

Show HN: An extensible pub/sub messaging server for edge applications

https://github.com/narwhal-io/narwhal
1•ortuman•13m ago•0 comments

Catching a Model Rocket Like SpaceX [video]

https://www.youtube.com/watch?v=R7dCSmyOxrE
2•o4c•13m ago•0 comments

Show HN: NewYouGo – A Fast and Free AI Image and Video Generator

https://newyougo.com/
1•bingbing123•13m ago•0 comments

Show HN: Record and share your coding sessions with CodeMic

https://codemic.io/#
1•seansh•14m ago•0 comments

ReliCSS: A Tool for Front-End Archaeology

https://www.alwaystwisted.com/articles/introducing-relicss-a-tool-for-front-end-archaeology
1•speckx•15m ago•0 comments

GitHub – BenjaminPoilve/minichord: A pocket-sized musical instrument

https://github.com/BenjaminPoilve/minichord
1•surprisetalk•15m ago•0 comments

Show HN: AI PDF to ePub Converter

https://pdftoepubai.com
2•svx_hn•15m ago•0 comments

Ingenic

https://kevinkelly.substack.com/p/ingenic
1•surprisetalk•17m ago•0 comments