frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•9mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

CA Supreme Court: Loose cannabis is like spilled beer, not open container

https://www.latimes.com/california/story/2026-01-30/cannabis-open-container-california-state-supr...
1•PaulHoule•48s ago•0 comments

OpenAI's Jony Ive-Designed Device Delayed to 2027

https://www.macrumors.com/2026/02/10/openais-jony-ive-designed-device-delayed-to-2027/
1•tosh•1m ago•0 comments

German Bundeskartellamt fines Amazon 59M euros due to price controls

https://www.bundeskartellamt.de/SharedDocs/Meldung/EN/Pressemitteilungen/2026/26_02_05_Amazon.html
1•sehansen•1m ago•0 comments

Apple, Google agree to loosen grip on UK app stores

https://www.theregister.com/2026/02/10/apple_google_uk_app_stores/
1•beardyw•2m ago•0 comments

Taiwan Passes Landmark AI Governance Framework

https://www.cdomagazine.tech/aiml/taiwan-passes-landmark-ai-governance-framework
1•speckx•2m ago•0 comments

Show HN: Portview – a diagnostic-first port TUI (Rust, cross-platform)

https://github.com/Mapika/portview
4•Mapika•2m ago•0 comments

Appiliy – Design Apps in Seconds

https://appiliy.com
1•MichaelFKnight•5m ago•1 comments

Show HN: A Gamified LMS Exploring the Science and Structure of Spirituality

1•Arcane_Temple•6m ago•1 comments

Australian author's erotic novel is child sex abuse material, judge finds

https://www.bbc.com/news/articles/ckgzv529v5no
4•qwefrqwf•6m ago•0 comments

JOVIAL(J73) compiler targeting LLVM, built from MIL-STD-1589C

https://github.com/Zaneham/jovial-compiler
1•ZaneHam•6m ago•0 comments

96% Engineers Don't Trust AI Output, yet Only 48% Verify It

https://newsletter.eng-leadership.com/p/96-engineers-dont-fully-trust-ai
2•blenderob•6m ago•0 comments

Testing Can Be Fun

https://giacomocavalieri.me/writing/testing-can-be-fun-actually
1•birdculture•9m ago•0 comments

Show HN: VBAF – Machine Learning framework built in pure PowerShell

https://github.com/JupyterPS/VBAF
1•JupyterPS•9m ago•0 comments

After Republican complaints, judicial body pulls climate advice

https://arstechnica.com/science/2026/02/us-court-agency-pulls-climate-change-from-science-advisor...
2•ndsipa_pomu•10m ago•0 comments

Software Sector Poses 'All-Time' Credit Risk, Deutsche Bank Analysts Warn

https://www.bloomberg.com/news/articles/2026-02-09/software-among-all-time-concentration-risks-to...
1•speckx•10m ago•0 comments

Show HN: 0x – A language that compiles to React, Vue, and Svelte (80% less code)

https://www.0xlang.com/
1•hankimis•11m ago•0 comments

What Is Claude? Anthropic Doesn't Know, Either

https://www.newyorker.com/magazine/2026/02/16/what-is-claude-anthropic-doesnt-know-either
1•fortran77•13m ago•1 comments

Show HN: Selling an AI interview assistant with ~2k users (no revenue)

https://github.com/evinjohnn/natively-cluely-ai-assistant
1•Nive11•14m ago•0 comments

Josephus and Jesus: New Evidence for the One Called Christ

https://academic.oup.com/book/60034
1•danielam•14m ago•0 comments

Show HN: Flare – AI-agent social network built on short videos

https://www.heyflare.app/blog/introducing-flare
1•JoanMDuarte•14m ago•0 comments

GUID / UUID V7 – Unix Time-Ordered Identifier

https://www.guidsgenerator.com/wiki/uuid-v7
1•BrunoVT1992•14m ago•1 comments

Milan Olympics vs. local reality: armed protests

https://www.aljazeera.com/news/2026/2/8/italys-meloni-condemns-anti-olympics-protesters-in-milan
1•haebom•15m ago•0 comments

Free AI Image Generator: All Models in One Place – TryImgAI

https://tryimgai.com/
1•chengai1106•15m ago•0 comments

Extractional AI as Opposed to Conversational AI

https://normain.com/
2•normain•16m ago•0 comments

Wave Function Collapse Playground in WebAssembly

https://wfc-wa.onrender.com/?p=j_Ckj_Ck____9tMt________j_CkY0UsY0Us9tMt________j_CkY0UsY0Us9tMt__...
1•preciz•16m ago•1 comments

A Ralph Loop for Reading: Beating GPT 5.2 with a 4k Context Window (and 4 GPUs)

https://stevehanov.ca/blog/a-ralph-loop-for-reading-beating-gpt-52-with-a-4k-context-window-and-4...
1•smhanov•17m ago•0 comments

Show HN: Built a cashflow tool for freelancers that tells them when NOT to spend

https://ankenboard.com/
2•josemgp•17m ago•2 comments

Agentic Image Generation

https://academy.dair.ai/blog/agentic-context-engineering
1•omarsar•20m ago•0 comments

Show HN: Logarete – Historical thinkers debate each other via RAG

https://logarete.com
1•idlee•22m ago•1 comments

Rice Theory: Why Eastern Cultures Are More Cooperative

https://www.npr.org/sections/thesalt/2014/05/08/310477497/rice-theory-why-eastern-cultures-are-mo...
4•thunderbong•24m ago•0 comments