frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•6mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Study: Déjà Vu is spatial familiarity, not prediction illusion

https://www.psypost.org/new-psychology-research-sheds-light-on-the-mystery-of-deja-vu/
1•DrierCycle•1m ago•0 comments

AutoBlinds: Smart home device that moves roller shades up and down

https://github.com/apanteleev/autoblinds
1•klaussilveira•1m ago•0 comments

Read Something Wonderful (About Biology)

https://read.asimov.com
1•mailyk•4m ago•0 comments

In Praise of DHH

https://okayfail.com/2025/in-praise-of-dhh.html
1•birdculture•5m ago•0 comments

Nano Banana Pro has been released, come and try it for free

https://nanobananaproai.io
1•sinpor1•5m ago•1 comments

Show HN: Fulfilled – Non-custodial financial co-pilot for goal optimization

https://matthew-glossops-workspace.share.arcade.software/share/iiL0WyFF1O1iSlSi1TGg
1•mattglossop•6m ago•0 comments

Switching to Rust's own mangling scheme on nightly

https://blog.rust-lang.org/2025/11/20/switching-to-v0-mangling-on-nightly/
2•ingve•6m ago•0 comments

32-Bit Integer Multiplication on Tenstorrent

https://www.jasondavies.com/2025/tenstorrent-multiply-int32/
1•jasondavies•7m ago•0 comments

Eastern Shipbuilding Suspends Work on Coast Guard's Offshore Patrol Cutters

https://gcaptain.com/eastern-shipbuilding-suspends-work-on-coast-guards-offshore-patrol-cutter-pr...
1•speckx•9m ago•0 comments

How Visa Actually Works

https://nandinfinitum.com/posts/visa/
1•nanfinitum•12m ago•0 comments

Florida nonprofit news reporters ask board to investigate their editor's AI use

https://www.niemanlab.org/2025/11/florida-nonprofit-news-reporters-ask-board-to-investigate-their...
1•danso•12m ago•0 comments

Show HN: Yonoma – Behavior based email automation for SaaS

3•vimall_10•14m ago•1 comments

Practice on Long Behavior Sequence Modeling in Tencent Advertising

https://arxiv.org/abs/2510.21714
1•PaulHoule•14m ago•0 comments

Systems design 3: LLMs and the semantic revolution

https://apenwarr.ca/log/20251120
1•goranmoomin•15m ago•0 comments

Show HN: Tangent – Open-source security data pipeline

https://github.com/telophasehq/tangent
2•ethanblackburn•16m ago•1 comments

The HTML Tags Everybody Hated (2017)

https://thehistoryoftheweb.com/blink-marquis-tag/
1•freedomben•16m ago•0 comments

1984 Swedish Hotline – World's First Social Network

https://medium.com/@RetroTechShow/1984-swedish-hotline-possibly-the-worlds-first-accessible-socia...
1•michalpleban•16m ago•0 comments

Against Apologising

https://cjlm.ca/posts/against-apologising/
1•speckx•16m ago•1 comments

More than half of UK novelists believe AI will replace their work

https://www.theguardian.com/books/2025/nov/20/more-than-half-of-uk-novelists-believe-ai-will-repl...
1•bookofjoe•16m ago•1 comments

Show HN: Code Mode for MCP in MCP-use's client

1•pzullo•18m ago•0 comments

Windows 1 was released 40 years ago

https://videocardz.com/newz/windows-1-was-released-40-years-ago
2•speckx•19m ago•0 comments

The Firefly and the Pulsar

https://www.centauri-dreams.org/2025/11/20/the-firefly-and-the-pulsar/
2•JPLeRouzic•20m ago•0 comments

MacKenzie Scott Gives $700M to Historically Black Colleges

https://www.nytimes.com/2025/11/17/us/hbcus-mackenzie-scott-donations.html
2•bookofjoe•20m ago•1 comments

Show HN: A step-by-step guide for push notifications on iOS, Android, + Rails

https://newsletter.masilotti.com/p/hotwire-native-deep-dive-push-notifications
1•joemasilotti•22m ago•0 comments

Jimdo use LangChain to power personalized business guidance at scale

https://blog.langchain.com/customers-jimdo/
2•yaaan•22m ago•0 comments

Pornhub Is Urging Tech Giants to Enact Device-Based Age Verification

https://www.wired.com/story/pornhub-is-urging-tech-giants-to-enact-device-based-age-verification/
3•basisword•23m ago•3 comments

US jobs saw surprising jump in September after slow summer

https://www.bbc.com/news/articles/cvg423n377lo
2•onemoresoop•24m ago•0 comments

GoDaddy launches ANS API and standards site for verifiable agent identity

https://aboutus.godaddy.net/newsroom/news-releases/press-release-details/2025/GoDaddy-advances-tr...
2•tmuhlestein•26m ago•1 comments

The lowercase aesthetic as cultural liberation

https://www.humaninvariant.com/blog/lowercase
1•radeeyate•26m ago•0 comments

UK Government Unveils England's First Ever Men's Health Strategy

https://www.gov.uk/government/news/government-unveils-englands-first-ever-mens-health-strategy
1•robtherobber•26m ago•0 comments