frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•7mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Show HN: KyubiSweep – Fast, local secret scanner written in Go (visual reports)

https://github.com/tanmayshahane/kyubisweep
1•tanmay_shahane•38s ago•0 comments

Watch me run malware from NPM [video]

https://www.youtube.com/watch?v=GqnFNNcycxQ
1•todsacerdoti•1m ago•0 comments

Getting started with Claude for software development

https://steveklabnik.com/writing/getting-started-with-claude-for-software-development/
1•steveklabnik•3m ago•0 comments

NotepadNext – Cross-platform reimplementation of Notepad++

https://github.com/dail8859/NotepadNext
1•ethanpil•3m ago•0 comments

Kafka Inc

https://libertiesjournal.com/online-articles/kafkainc/
1•Caiero•4m ago•0 comments

FlashInfer-Bench: Building the Virtuous Cycle for AI-Driven LLM Systems

https://arxiv.org/abs/2601.00227
1•matt_d•6m ago•0 comments

A modular marketing command center built with autonomous workflows

https://flippa.com/12205760-vect-ai-is-an-autonomous-marketing-command-center-where-ai-agents-pla...
3•WoWSaaS•6m ago•0 comments

Predict Your House Price

https://www.bloomberg.com/opinion/newsletters/2026-01-06/predict-your-house-price
1•feross•7m ago•0 comments

Show HN: Sumoffy (macOS) – Offline Document Intelligence You Can Trust

https://rokontech.gumroad.com/l/sumoffy
1•rokontech•8m ago•0 comments

Vect AI: treating marketing execution as software, not a stack of tools

https://vect.pro/
2•MMAFRAZ•9m ago•1 comments

US says it will discuss Greenland ownership with Denmark next week

https://www.bbc.com/news/articles/cly39pgmvrzo
1•onemoresoop•9m ago•2 comments

Shortages Cause Sky-Rocketing RAM Prices – In 1985

https://www.goto10retro.com/p/shortages-cause-sky-rocketing-ram
1•rbanffy•10m ago•0 comments

Show HN: AbleMouse AI. Nose-point cursor. Screen-size independent

https://github.com/aradzhabov/AbleMouse
1•aradzhabov•11m ago•0 comments

Policy-Based Design versus Combinatorial Hell

https://becheler.github.io/policy-based-design/
2•todsacerdoti•11m ago•0 comments

Bikemap.nyc – visualization of the history of Citi Bike bike-sharing system

https://bikemap.nyc/
2•ChrisArchitect•11m ago•0 comments

Gleam Web Development Tutorial: JSON Rest API and Type-Safe SQL [video]

https://www.youtube.com/watch?v=kmbH7WdwKkc
1•andfadeev•12m ago•0 comments

macOS Background Security Improvement Update (BSI) Database

https://mrmacintosh.com/macos-background-security-improvement-update-bsi-database/
1•speckx•15m ago•0 comments

We Rewrote Our Startup from PHP to Gleam

https://www.radical-elements.com/minor-epiphanies/we-rewrote-our-startup-from-php-to-gleam-in-3-w...
1•lexx•17m ago•0 comments

Refuctoring [pdf]

https://www.waterfall2006.com/Refuctoring.pdf
2•bguthrie•17m ago•0 comments

British businesses warned of 'cashflow contagion' as more firms set to collapse

https://www.gbnews.com/money/businesses-warned-of-cashflow-contagion
1•petethomas•18m ago•1 comments

Monitoring a Docker Homelab with Open Source

https://coroot.com/blog/monitoring-a-docker-homelab-with-coroot/
2•DebianDude•19m ago•0 comments

Boycott Edge Esmeralda 2026

https://blog.hermesloom.org/p/boycott-edge-esmeralda-2026
1•sigalor•19m ago•0 comments

S3 processes over 100M reqs/sec with strong consistency

https://twitter.com/MarcJBrooker/status/2008670722613539292
1•aloukissas•20m ago•0 comments

Larry Page officially moves business out of CA ahead of a proposed wealth tax

https://www.businessinsider.com/larry-page-leave-california-wealth-billionaire-tax-koop-google-20...
2•elsewhen•20m ago•0 comments

Jensen Huang of Nvidia Named IEEE Medal of Honor Recipient

https://corporate-awards.ieee.org/ieee-medal-of-honor/
1•chrisaycock•20m ago•0 comments

Nvidia at CES, Vera Rubin and AI-Native Storage Infrastructure, Alpamayo

https://stratechery.com/2026/nvidia-at-ces-vera-rubin-and-ai-native-storage-infrastructure-alpamayo/
1•feross•20m ago•0 comments

Predator iOS Spyware: Build a Surveillance Framework

https://blog.reversesociety.co/blog/2025/predator-ios-malware-surveillance-framework-part-1
2•tonygo•21m ago•0 comments

ARM `IT` predication is architecturally unsafe for crypto implementations (POC)

https://github.com/jnk0le/random/blob/master/pipeline%20cycle%20test/CM85_predicate_timmingleak_P...
2•jnk0le•22m ago•1 comments

Facial Age Checks Now Required to Chat on Roblox

https://corp.roblox.com/newsroom/2026/01/roblox-age-checks-required-to-chat
1•haunter•23m ago•1 comments

Train Surgery [video]

https://www.youtube.com/watch?v=RAQBaDWxRQ0
1•iamflimflam1•23m ago•0 comments