frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•8mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Hands-On: Exploit RISC-V CPU Using Return-Oriented Programming

https://www.bogdandeac.com/hands-on-exploit-risc-v-cpu-using-return-oriented-programming/
1•bg2d•55s ago•0 comments

Promoting AI Agents

https://world.hey.com/dhh/promoting-ai-agents-3ee04945
1•kiyanwang•1m ago•0 comments

On Being a Human Being in the Time of Collapse (2022) [pdf]

https://web.cs.ucdavis.edu/~rogaway/papers/crisis/crisis.pdf
2•barishnamazov•1m ago•0 comments

The Magic of the Better Software Conference

https://www.rfleury.com/p/the-magic-of-the-better-software
1•kruuuder•2m ago•0 comments

Chrome 145 Adds Experimental Support for Vertical Tabs

https://www.bram.us/2026/01/16/chrome-145-adds-experimental-support-for-vertical-tabs/
1•tobr•2m ago•0 comments

I vibed a CMS with live-preview/users/click-through-edit in an afternoon

https://github.com/mj1618/yolo-cms
1•mj2718•5m ago•1 comments

John von Neumann's MANIAC I (1952)

https://en.wikipedia.org/wiki/MANIAC_I
1•widenrun•5m ago•0 comments

Operation Big Buzz

https://en.wikipedia.org/wiki/Operation_Big_Buzz
1•downboots•5m ago•0 comments

Ask HN: Those making $500/month on side projects in 2025 – Show and tell

2•ssunboyy•13m ago•0 comments

The Thrill Is Gone: Airbnb and the Crisis of Imagination in Short-Term Rentals

https://skift.com/2026/01/15/the-thrill-is-gone-airbnb-and-the-crisis-of-imagination-in-short-ter...
1•jclampet•18m ago•0 comments

Show HN: Wikitool – CLI for fetching Wikipedia content

2•moeffju•19m ago•0 comments

Pi: There are many coding agents, but this one is mine

https://buildwithpi.ai/
1•tosh•28m ago•0 comments

Show HN: Using Strudel to control dance animations

https://github.com/bntre/threejs-osc-dance
2•bntr•31m ago•0 comments

AI Destroys Institutions

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5870623
2•sean_the_geek•32m ago•2 comments

The integrated explicit analytic number theory network

https://terrytao.wordpress.com/2026/01/15/the-integrated-explicit-analytic-number-theory-network/
1•jjgreen•33m ago•0 comments

Control Flow Integrity for Computer Use Agents

https://arxiv.org/abs/2601.09923
1•iliaishacked•35m ago•1 comments

Kamal: Deploy Web Apps Anywhere

https://kamal-deploy.org/
2•ndr•36m ago•0 comments

Show HN: Codex Plus – Turbocharged OpenAI Codex for Headless Workflows

https://github.com/aperoc/codex-plus
1•SafeDusk•38m ago•0 comments

The Discoveries of Continuations [pdf]

https://homepages.inf.ed.ac.uk/wadler/papers/papers-we-love/reynolds-discoveries.pdf
2•fanf2•39m ago•0 comments

I built a tool to help me stop refreshing this site

https://hn-buddy.com/
1•gaborme•40m ago•1 comments

If a Tree Falls – The Trial of the Sycamore Gap Killers

https://harpers.org/archive/2026/01/if-a-tree-falls-rosa-lyster-sycamore-gap/
1•bcraven•40m ago•0 comments

Browser Built with Cursor Agents in Just One Week

https://quasa.io/media/cursor-s-ai-revolution-building-a-browser-from-scratch-with-gpt-5-2-agents...
2•roboboffin•43m ago•0 comments

Artificial StupidIntelligence and Airport Sinks

https://www.deobald.ca/essays/2026-01-13-artificial-stupidintelligence-and-airport-sinks/
1•vishnukvmd•46m ago•0 comments

Product Documentations for AI SEO

1•udit_50•50m ago•0 comments

The spectrum of isolation: From bare metal to WebAssembly

https://buildsoftwaresystems.com/post/guide-to-execution-environments/
10•ThierryBuilds•54m ago•3 comments

I Made Adobe CC Installers Work on Linux

https://old.reddit.com/r/linux_gaming/comments/1qdgd73/i_made_adobe_cc_installers_work_on_linux_p...
5•XzetaU8•57m ago•0 comments

Opening the AWS European Sovereign Cloud

https://aws.amazon.com/blogs/aws/opening-the-aws-european-sovereign-cloud/
4•notmine1337•58m ago•7 comments

How WhatsApp Took over the Global Conversation

https://www.newyorker.com/magazine/2026/01/19/how-whatsapp-took-over-the-global-conversation
2•petethomas•1h ago•0 comments

PostgreSQL in Gleam with pog, squirrel, and cigogne

https://nulltree.xyz/articles/basic-postgres-setup-in-gleam/
2•todsacerdoti•1h ago•0 comments

Some 20-sided dice from Ptolemaic Egypt (ca.140BC)

https://mathstodon.xyz/@dpiponi/115770670004578550
1•aebtebeten•1h ago•0 comments