frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•7mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Nintendo Switch 2 RAM prices rise 41%, NAND flash up 8% – shares nosedive

https://www.tomshardware.com/video-games/nintendo/nintendo-switch-2-ram-prices-rise-41-percent-na...
1•speckx•2m ago•0 comments

SQLite JSON at Full Index Speed Using Generated Columns

https://www.dbpro.app/blog/sqlite-json-virtual-columns-indexing
1•upmostly•3m ago•0 comments

Breaking Paragraphs into Lines [pdf]

https://gwern.net/doc/design/typography/tex/1981-knuth.pdf
1•Smaug123•3m ago•1 comments

The Best Big Media Merger Is No Merger at All

https://www.eff.org/deeplinks/2025/12/best-big-media-merger-no-merger-all
1•hn_acker•3m ago•0 comments

Thousands Tell the Patent Office: Don't Hide Bad Patents from Review

https://www.eff.org/deeplinks/2025/12/thousands-tell-patent-office-dont-hide-bad-patents-review
2•hn_acker•4m ago•0 comments

Fedora: Open-source repository for long-term digital preservation

https://fedorarepository.org/
1•cernocky•5m ago•0 comments

I can't draw, so I made a website (to vent UK politics)

https://royalphallicsociety.uk/
1•ykurtov•9m ago•1 comments

Why Isn't Online Age Verification Just Like Showing Your ID in Person?

https://www.eff.org/deeplinks/2025/12/why-isnt-online-age-verification-just-showing-your-id-person
2•hn_acker•11m ago•0 comments

Flipping the NIS2 Switch: What Germany Implementation Means for 2026 Compliance

https://www.mofo.com/resources/insights/251208-flipping-the-nis2-switch-what-germanys-implementation
1•rbanffy•11m ago•0 comments

How to Run Ministral 3 with an AMD GPU on Windows

https://www.50-nuances-octets.fr/en/posts/ministral-3-gpu-amd-windows/
2•Sykursen•12m ago•0 comments

Why Text in Vampire Survivors Used to Look Weird

https://jslegenddev.substack.com/p/vampire-survivors-text-weird
1•ibobev•13m ago•0 comments

Next-gen supersonic jet engine gets a less glamorous job

https://newatlas.com/energy/boom-supersonic-engine-data-center/
1•breve•15m ago•0 comments

Genetic study reveals hidden links between psychiatric conditions

https://www.nature.com/articles/d41586-025-04037-w
3•bookofjoe•16m ago•1 comments

Dyalog APL: Our (Not So) Secret Ingredient [video]

https://www.youtube.com/watch?v=hnz6wjshRNc
3•pillowshift•18m ago•1 comments

Why Nobody Reads Anymore (and What That Says About Us)

https://mackleen.substack.com/p/why-nobody-reads-anymore-and-what
1•speckx•18m ago•0 comments

It's end-of-year concert season. Why do kids struggle with performance anxiety?

https://medicalxpress.com/news/2025-12-year-concert-season-kids-struggle.html
2•PaulHoule•19m ago•0 comments

FDA Approves First Gene Therapy for WAS. First Gene Therapy from a Non-Profit

https://www.fda.gov/news-events/press-announcements/fda-approves-first-gene-therapy-treatment-wis...
1•amarcheschi•19m ago•0 comments

A Chinese whistleblower living in the US is being hunted by Beijing with US tech

https://apnews.com/article/whistleblower-china-surveillance-tech-silicon-valley-adbd0bcfbb0892bfc...
6•alsdkfjas•20m ago•0 comments

A Plan for 5-10%* Faster Free-Threaded JIT by Python 3.16

https://fidget-spinner.github.io/posts/faster-jit-plan.html
1•Qem•22m ago•0 comments

Optimizing Mannequin

https://real-mrbeam.github.io/2025/12/11/Optimizing-Mannequin.html
1•ibobev•26m ago•0 comments

Chinese foundry SMIC achieves 5nm production without EUV tools

https://www.techpowerup.com/344000/chinese-smic-achieves-5-nm-production-on-n-3-node-without-euv-...
9•jsheard•28m ago•0 comments

Show HN: Work Simulation for developer evaluation instead of DSA and take-homes

https://imported-lush-slug.clueso.site/share/130f9789-54b9-41e4-b305-e5ecfe5e27fa
2•rishitchat•29m ago•0 comments

TikTok algorithm favors mental health content, analysis finds

https://www.washingtonpost.com/technology/interactive/2025/tiktok-algorithm-mental-health/
1•reaperducer•32m ago•0 comments

Chatbot-powered toys rebuked for discussing sexual, dangerous topics with kids

https://arstechnica.com/gadgets/2025/12/chatbot-powered-toys-rebuked-for-discussing-sexual-danger...
2•smurda•32m ago•0 comments

A New History of Arabia, Written in Stone (2018)

https://www.newyorker.com/culture/culture-desk/a-new-history-of-arabia-written-in-stone
1•janandonly•34m ago•0 comments

Climate Nobel Prize

https://climatenobelprize.org/
3•Vinnl•34m ago•0 comments

Magic Eye Tetris

https://www.lutanho.net/play/magiceyetetris5.html
1•JelteF•36m ago•0 comments

Show HN: Epstein's emails reconstructed in a message-style UI (OCR and LLMs)

https://github.com/Toon-nooT/epsteins-phone-reconstructed
7•toon-noot•37m ago•0 comments

How to break free from smart TV ads and tracking

https://arstechnica.com/gadgets/2025/12/the-ars-technica-guide-to-dumb-tvs/
13•fleahunter•37m ago•3 comments

Air passengers exposed to high levels of ultrafine particle pollution

https://www.theguardian.com/environment/2025/dec/12/air-passengers-extremely-high-levels-ultrafin...
3•belter•37m ago•0 comments