frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•11mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Show HN: DialtoneApp Network, card payments for bot commerce

1•fcpguru•1m ago•0 comments

Compromised AI Tool Triggered the Vercel Security Breach

https://entelligence.ai/blogs/how-an-ai-tool-triggered-the-vercel-security-breach
1•astro_09•1m ago•0 comments

Where Are All These Meteors Coming From?

https://www.nytimes.com/2026/04/21/science/march-fireballs-meteors-astronomy.html
1•digital55•2m ago•0 comments

YouTuber Copyright Struck After Others Layer AI Voiceovers on Video Game Music

https://www.techdirt.com/2026/04/20/youtuber-copyright-struck-after-others-layer-ai-voiceovers-on...
1•hn_acker•2m ago•0 comments

Faster LLM Inference via Sequential Monte Carlo

https://arxiv.org/abs/2604.15672
1•matt_d•2m ago•0 comments

LLMSecure – prompt injection detection, no signup

https://llmsecure.io/
1•eliadmualem•2m ago•1 comments

AI is changing how Texas universities teach computer science as job market slows

https://www.texastribune.org/2026/04/21/texas-computer-science-college-degree-ai/
1•hn_acker•3m ago•0 comments

Building a Fast Multilingual OCR Model with Synthetic Data

https://huggingface.co/blog/nvidia/nemotron-ocr-v2
1•gmays•4m ago•0 comments

Show HN: Handler – Open-source local sandboxes and control plane for code agents

https://handler.dev
1•shake-n-fries•5m ago•0 comments

Show HN: Four years of my CS degree, typeset in LaTeX (850 pages)

https://starikov.co/academia-notes/
1•iusevim•5m ago•0 comments

OpenAI turns on cost-per-click ads inside ChatGPT

https://digiday.com/marketing/openai-turns-on-cost-per-click-ads-inside-chatgpt/
1•thm•5m ago•0 comments

200MP iPhone camera rumors align on 2028 release

https://9to5mac.com/2026/04/21/200mp-iphone-camera-rumors-align-on-2028-release/
1•omer_k•6m ago•0 comments

Texas House Speaker orders probe of Roblox in response to Uvalde shooting game

https://www.texastribune.org/2026/04/20/texas-speaker-dustin-burrows-roblox-legislature-child-gam...
1•hn_acker•6m ago•1 comments

Self-Sovereign Agent

https://arxiv.org/abs/2604.08551
1•AgentNews•7m ago•0 comments

Verified Deep Learning with Lean 4

https://brettkoonce.github.io/lean4-mlir/blueprint/
1•asparagui•7m ago•0 comments

Command Execution via Drag-and-Drop in Terminal Emulators

https://sdushantha.github.io/post/drop-it-like-its-hot
1•speckx•7m ago•0 comments

Show HN: App Promo Video with Claude Design and Claude Code

https://www.youtube.com/watch?v=1IIawdmgxTU
1•kamilms21•7m ago•0 comments

Claude Code + Jupyter Notebooks Finally Work Well

https://www.reviewnb.com/claude-code-with-jupyter-notebooks
1•amirathi•10m ago•0 comments

Techno Kick Synthesizer that runs in the browser

https://technokick.com
1•stagas•10m ago•0 comments

Morpheus Research: Figure Techn Is a Lender Masquerading as a Blockchain Darling

https://www.morpheus-research.com/figure/
1•Brajeshwar•10m ago•0 comments

Engineering team looks healthy. It probably isn't

https://dbarabashh.com/thoughts-and-experience/your-engineering-team-looks-healthy
2•birdculture•13m ago•0 comments

Type hints – a mediocre programmer's reaction (2015)

https://mail.python.org/pipermail/python-dev/2015-April/139267.html
1•downbad_•13m ago•1 comments

Attack on Titan Was Never About Titans. USA Retreat Analogy

https://masatoshinishimura.com/attack-on-titan-was-never-about-titans-it-was-about-retreat-usa-an...
1•massanishi•13m ago•0 comments

A Surfin' Safari to the Stars

https://nautil.us/what-it-would-be-like-to-surf-five-distant-planets-1280010
1•kristenfrench•14m ago•0 comments

Creating fake 3D characters in a 2D engine

https://remvst.substack.com/p/creating-fake-3d-characters-in-a
1•atomlib•14m ago•0 comments

Foundation Model Engineering: A free textbook for AI engineers

https://sungeuns.github.io/founation-model-engineering/
2•sungeuns•14m ago•1 comments

Arch Linux now has a reproducible container image

https://lwn.net/Articles/1068699/
2•Brajeshwar•14m ago•0 comments

CodePvP – LeetCode but Multiplayer

https://www.codepvp.com/
1•felibuscaglia•14m ago•1 comments

Pure Borrow: Linear Haskell Meets Rust-Style Borrowing

https://arxiv.org/abs/2604.15290
1•matt_d•15m ago•0 comments

Market Intelligence Agent –MCP agent that autonomously operates a data platform

https://datris.ai/videos/market-intelligence-agent-mcp
1•tfearn•15m ago•0 comments