frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•11mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Ember 6.12

https://blog.emberjs.com/ember-released-6-12/
1•satvikpendem•1m ago•0 comments

Vibe code with me this Linux system on a browser tab

https://linuxontab.com/
2•kilian-ai•3m ago•1 comments

New Oscars rules: No AI actors, human-written scripts only

https://www.dw.com/en/new-oscars-rules-exclude-ai-performers-require-scripts-written-by-human/a-7...
1•qainsights•4m ago•0 comments

Jonathan Swift's Last Joke

https://www.newyorker.com/culture/the-weekend-essay/jonathan-swifts-last-joke
1•samizdis•6m ago•0 comments

South Africa withdraws AI policy due to fake AI-generated sources

https://www.reuters.com/world/africa/south-africa-withdraws-ai-policy-due-fake-ai-generated-sourc...
2•gnabgib•10m ago•0 comments

The Man Who Built NVIDIA [audio]

https://www.econtalk.org/the-man-who-built-nvidia-with-stephen-witt/
1•mooreds•11m ago•0 comments

Hilariously Useless: Mahalo's Guide to Playing the Xylophone (2011)

https://www.businessinsider.com/check-out-mahalos-hilariously-useless-guide-to-playing-the-xyloph...
1•mooreds•11m ago•1 comments

A terminal Markdown viewer built using Charm libraries

https://github.com/inkcheck/ink
1•geordee•12m ago•0 comments

The Terraform.applying Symbol

https://developer.hashicorp.com/terraform/language/functions/terraform-applying
1•mooreds•12m ago•0 comments

Show HN: Which public repos are friendliest to an AI coding agent?

https://www.agentfriendlycode.com/
1•hsnice16•19m ago•0 comments

Tech layoffs skyrocketed in Q1 2026

https://twitter.com/KobeissiLetter/status/2050630474129719568
1•enraged_camel•21m ago•0 comments

Does Calvin still hurt in practice if you only use it for cross-shard writes?

https://github.com/nodedb-lab/nodedb
1•fs90•21m ago•0 comments

Italian Competition Authority Launches Investigation into Vorwerk (Neato)

https://en.agcm.it/en/media/press-releases/2026/4/PS13069
1•rettichschnidi•26m ago•0 comments

Kids promised 'forever homes' instead confined in for-profit institutions

https://www.ap.org/news-highlights/spotlights/2026/adopted-and-locked-away-kids-promised-forever-...
3•mnky9800n•26m ago•1 comments

Study: AI models that consider user's feeling are more likely to make errors

https://arstechnica.com/ai/2026/05/study-ai-models-that-consider-users-feeling-are-more-likely-to...
1•Brajeshwar•27m ago•0 comments

Canonical Under Attack

https://status.canonical.com
8•ta988•29m ago•1 comments

Linter for data science and statistical experiments

https://github.com/zgornel/DataLinter
1•zg0rnel•33m ago•1 comments

The best AI dictation apps, tested and ranked

https://techcrunch.com/2026/05/02/the-best-ai-powered-dictation-apps-of-2025/
1•oguzhaneksi•36m ago•0 comments

Ask HN: How long do you commute by car each day?

2•roschdal•38m ago•5 comments

LLMs can hide text in other text of the same length

https://arxiv.org/abs/2510.20075
2•m-hodges•40m ago•0 comments

Pitney Bowes Data Breach (April 2026)

https://haveibeenpwned.com/Breach/PitneyBowes
1•gnabgib•40m ago•0 comments

A Note on TurboQuant and the Earlier Eden Work

https://arxiv.org/abs/2604.18555
2•amitport•40m ago•0 comments

Outline of Thought

https://en.wikipedia.org/wiki/Outline_of_thought
1•cainxinth•41m ago•0 comments

The Dunning-Kruger effect is probably just from bimodal skill distributions

https://bosoncutter.substack.com/p/the-dunning-kruger-effect-is-probably
4•the_tyger•47m ago•6 comments

So, About That AI Bubble

https://www.theatlantic.com/economy/2026/05/ai-bubble-revenue-anthropic/687022/
3•saikatsg•47m ago•0 comments

Show HN: A Universal Stability Criterion for Symbolic Complex Systems

https://zenodo.org/records/18883274
1•M_Samir333•49m ago•1 comments

From toroids to helical tubules: Kirigami-inspired programmable assembly

https://www.pnas.org/doi/10.1073/pnas.2516695122
3•bryanrasmussen•51m ago•1 comments

Show HN: Rust library for Undo/Redo using deltas, snapshots or commands

https://github.com/mikwielgus/undoredo
2•mikolajw•51m ago•1 comments

Privacy Dependencies (2020)

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3447384
1•wslh•52m ago•0 comments

Show HN: Sentient OS – On-device intelligence layer for your entire digital life

https://sentient-os.ai
2•TechExpert2910•52m ago•2 comments