frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•7mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Intel's 'Panther Lake' Core Ultra Laptop Chips Are Ready for Prime Time

https://www.pcmag.com/news/intel-panther-lake-core-ultra-laptop-chips-details-ces-2026
1•taubek•13s ago•0 comments

Give it Up, Turn it Loose

https://thinkhuman.com/give-it-up-turn-it-loose/
1•jamesgill•1m ago•0 comments

Triton Extensions: a framework for developing and building compiler extensions

https://github.com/triton-lang/triton-ext
1•matt_d•1m ago•0 comments

Seized by US: why so much interest in a rusty tanker in the Atlantic?

https://www.theguardian.com/world/2026/jan/07/marinera-seized-tanker-atlantic-us-uk-russia
1•n1b0m•1m ago•0 comments

Show HN: I built a tool to stop pretending I understood research papers

https://papersplain.com
1•jjoe•2m ago•0 comments

The Psychology of Stranger Things

https://allaboutpsychology.substack.com/p/the-psychology-of-stranger-things
1•rendx•2m ago•0 comments

Show HN: FightHOAFines – An AI agent that reads bylaws to dispute HOA violations

https://fighthoafines.com/
1•todaycompanies•4m ago•1 comments

The Giant Hoax of Shadow of the Colossus [video]

https://www.youtube.com/watch?v=NvGZLMUx7AM
1•crtasm•6m ago•0 comments

OpenCore Legacy Patcher – Experience macOS just like before

https://github.com/dortania/OpenCore-Legacy-Patcher
1•petethomas•6m ago•0 comments

NPM to Implement Staged Publishing After Turbulent Shift Off Classic Tokens

https://socket.dev/blog/npm-to-implement-staged-publishing
1•feross•6m ago•0 comments

WikiFlix

https://wikiflix.toolforge.org/#/
1•Tomte•7m ago•0 comments

Fun with Algebraic Effects – From Toy Examples to Hardcaml Simulations

https://blog.janestreet.com/fun-with-algebraic-effects-hardcaml/
1•i_don_t_know•7m ago•0 comments

Show HN: Clean normalised US equity fundamentals via API (free tier available)

https://finqual.app
2•myztika•10m ago•1 comments

Solving a snaky math problem with Mathematica

https://leancrew.com/all-this/2025/12/solving-a-snaky-math-problem-with-mathematica/
2•surprisetalk•11m ago•0 comments

Single Sign on for Furries

https://cendyne.dev/posts/2025-08-15-single-sign-on-for-furries.html
5•surprisetalk•11m ago•1 comments

Claude Opus 4.5 disappears suddenly from GitHub Copilot

https://github.com/orgs/community/discussions/181266
4•tantona•11m ago•1 comments

FAA signs radar deals to drag US air traffic control out of the 1980s

https://www.theregister.com/2026/01/07/faa_radar_atc_deals/
1•holysoles•12m ago•0 comments

Hardening eBPF for Runtime Security: Lessons from Datadog Workload Protection

https://www.datadoghq.com/blog/engineering/ebpf-workload-protection-lessons/
1•tanelpoder•13m ago•0 comments

DeepSeek-R1 paper updated from 22 pages to 86 with additional details

https://old.reddit.com/r/LocalLLaMA/comments/1q6c9wc/deepseekr1s_paper_was_updated_2_days_ago/
2•ksymph•13m ago•1 comments

Reflections on the Caplan-Bruenig Poverty Debate

https://www.betonit.ai/p/reflections-on-the-caplan-bruenig
1•paulpauper•14m ago•0 comments

Show HN: KyubiSweep – Fast, local secret scanner written in Go (visual reports)

https://github.com/tanmayshahane/kyubisweep
1•tanmay_shahane•15m ago•1 comments

Watch me run malware from NPM [video]

https://www.youtube.com/watch?v=GqnFNNcycxQ
1•todsacerdoti•16m ago•0 comments

Getting started with Claude for software development

https://steveklabnik.com/writing/getting-started-with-claude-for-software-development/
1•steveklabnik•18m ago•0 comments

NotepadNext – Cross-platform reimplementation of Notepad++

https://github.com/dail8859/NotepadNext
1•ethanpil•18m ago•0 comments

Kafka Inc

https://libertiesjournal.com/online-articles/kafkainc/
1•Caiero•19m ago•0 comments

FlashInfer-Bench: Building the Virtuous Cycle for AI-Driven LLM Systems

https://arxiv.org/abs/2601.00227
1•matt_d•21m ago•0 comments

A modular marketing command center built with autonomous workflows

https://flippa.com/12205760-vect-ai-is-an-autonomous-marketing-command-center-where-ai-agents-pla...
3•WoWSaaS•21m ago•0 comments

Predict Your House Price

https://www.bloomberg.com/opinion/newsletters/2026-01-06/predict-your-house-price
1•feross•22m ago•0 comments

Show HN: Sumoffy (macOS) – Offline Document Intelligence You Can Trust

https://rokontech.gumroad.com/l/sumoffy
2•rokontech•23m ago•0 comments

Vect AI: treating marketing execution as software, not a stack of tools

https://vect.pro/
2•MMAFRAZ•24m ago•1 comments