frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•7mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Can I use HTTPS RRs?

https://www.netmeister.org/blog/https-caniuse.html
1•fanf2•4m ago•0 comments

How much AI do we need, really?

https://newsletter.alastairrushworth.com/p/how-much-ai-do-we-need-really
1•alastairr•7m ago•0 comments

Tell HN: Cloudflare now censors Polymarket in Germany

2•baobabKoodaa•8m ago•1 comments

The secretive world of North Korean science fiction (2023)

https://arstechnica.com/culture/2023/08/the-strange-secretive-world-of-north-korean-science-fiction/
2•doener•15m ago•0 comments

Windows 3.1 in the Browser

https://www.pcjs.org/software/pcx86/sys/windows/3.10/
2•memalign•16m ago•0 comments

Show HN: Hands on tutorial for open source contribution

https://github.com/firstcontributions/first-contributions
2•promptmike•20m ago•0 comments

New Solitaire Gaming Website

https://www.trysolitaire.com
1•ssmallya•23m ago•0 comments

Show HN: League of Legends AI Assistant (OpenSource)

https://github.com/sorena-ai/LeagueAiCoach
1•legalcriminal•26m ago•0 comments

Gemini with Thinking 3 Pro can't script multi-line string replacement

1•YouAreWRONGtoo•30m ago•0 comments

Can We Really Claim That Civilization is on the Steady Path of Progress?

https://lithub.com/can-we-really-claim-that-civilization-is-on-the-steady-path-of-progress/
3•robtherobber•37m ago•0 comments

Commonplace Book

https://en.wikipedia.org/wiki/Commonplace_book
2•tosh•38m ago•0 comments

Sperm donor with cancer-causing gene fathers nearly 200 children

https://scienceclock.com/sperm-donor-carrying-rare-cancer-causing-gene-fathers-nearly-200-children/
1•ashishgupta2209•38m ago•0 comments

Surgical Masks and Viral Transmission

https://rodgercuddington.substack.com/p/surgical-masks-and-viral-transmission
2•freespirt•38m ago•0 comments

Ask HN: Is there a local dev tool you wish existed because of a repeating issue?

1•johnbros•40m ago•0 comments

Revolutionary gene therapy brings hope of leukaemia cure [video]

https://www.youtube.com/watch?v=IuWFVWwesSE
1•mgh2•46m ago•0 comments

Flow depression treatment now FDA approved

https://www.flowneuroscience.com/fda-approved-lp-2/
1•antfarm•46m ago•0 comments

Oilwell is a wellness app to help you embrace climate chaos

https://oilwell.app/
2•doener•48m ago•0 comments

Show HN: This week we shipped 'Surfaces' on rynk.io

https://twitter.com/farsn_/status/1999764184729551073
1•thefarseen•49m ago•0 comments

Breaking Down Trump's 2025 National Security Strategy

https://www.brookings.edu/articles/breaking-down-trumps-2025-national-security-strategy/
1•thomassmith65•49m ago•0 comments

Ask HN: is Archive.is a Kremlin Asset?

4•leoh•57m ago•1 comments

Kpython – A MicroPython Sidecar for the Linux Kernel (Experimental)

https://github.com/pymergetic/kpython
2•kpython•1h ago•1 comments

What is "involution", China's race-to-the-bottom competition trend

https://www.reuters.com/business/autos-transportation/what-is-involution-chinas-race-to-the-botto...
3•bill38•1h ago•0 comments

50 Years of Proof Assistants

https://lawrencecpaulson.github.io/2025/12/05/History_of_Proof_Assistants.html
1•thunderbong•1h ago•0 comments

Show HN: Clai – Unixlike CLI context feeder for LLMs. Now with recursive tooling

https://github.com/baalimago/clai
1•baalimago•1h ago•0 comments

Technology Radar

https://www.thoughtworks.com/radar
1•pykello•1h ago•0 comments

Public Prompt License (PPL) – prompt-native licensing for LLM prompts

https://shipfail.github.io/public-prompt-license/
1•huan42•1h ago•0 comments

Hetz Demo – build HTML tables online, copy code instantly (no signup)

https://hetz.ct.ws/demo-table/
1•aminekhd•1h ago•3 comments

The military's new AI says boat strike 'unambiguously illegal'

https://san.com/cc/the-militarys-new-ai-says-hypothetical-boat-strike-scenario-unambiguously-ille...
6•saubeidl•1h ago•0 comments

Android PSA: have at least 2 ways of taking your files (especially pictures) out

https://old.reddit.com/r/DataHoarder/comments/1hvowp3/psa_have_at_least_2_ways_of_taking_your_files/
1•sipofwater•1h ago•4 comments

Computer Animator and Amiga fanatic Dick Van Dyke turns 100

7•ggm•1h ago•3 comments