frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

All Roads Lead to Om

https://ma.tt/2026/06/om-forever/
1•speckx•1m ago•0 comments

The Demoralization of the White-Collar Worker – No One's Happy

https://nooneshappy.com/article/the-demoralization-of-the-white-collar-worker/
1•diebillionaires•1m ago•0 comments

IngrediCheck

https://www.ingredicheck.app/
1•fungeellc•2m ago•0 comments

Prompts in Manuscripts Exploit AI-Assisted Peer Review

https://cacm.acm.org/opinion/hidden-prompts-in-manuscripts-exploit-ai-assisted-peer-review/
1•adunk•2m ago•0 comments

Government Information Belongs to Everyone: Democracy's Library in 2026

https://blog.archive.org/2026/06/22/government-information-belongs-to-everyone-democracys-library...
1•toomuchtodo•3m ago•0 comments

A glitch in February of the year 0

https://28times.com/blog/2026-06-26-february-of-the-year-0
1•lukasgelbmann•4m ago•0 comments

No-Slop OSS, a checklist of contribution best practices when using AI (or not)

https://github.com/omkar-foss/noslop-oss
1•omkar-foss•5m ago•0 comments

Show HN: uvx ptn and give your agent full access to any system (dangerously)

https://pypi.org/project/ptn/
1•yxl448•6m ago•0 comments

Printable Wrist Rest System – ZSA Voyager – Zsa.io

https://www.zsa.io/voyager/wrist-rest-system?mc_cid=52feb8f1db&mc_eid=c352ca6cba
1•tortilla•7m ago•0 comments

Provisional Voice API Agents with Telnyx

https://github.com/team-telnyx/telnyx-code-examples/tree/main/provisional-telnyx-voice-api-agents...
1•anushathukral•7m ago•0 comments

A month of vibe-coding at 0.01x velocity

https://webesque.agency/blog/2026-06-19-llms.html
2•mhitza•8m ago•0 comments

Michael Milken's Spreadsheets: Computation and Charisma in Finance in the '80s

https://ieeexplore.ieee.org/document/9051785/
1•toomuchtodo•9m ago•1 comments

Benchmarking AI Gateways: GoModel vs. LiteLLM vs. Portkey vs. Bifrost

https://enterpilot.io/blog/benchmarking-ai-gateways-gomodel-litellm-portkey-bifrost-june-2026/
1•santiago-pl•9m ago•1 comments

Does using modulo (%) affect quality of randomness?

https://crypto.stackexchange.com/questions/22767/does-using-modulo-affect-quality-of-randomness
1•tosh•10m ago•0 comments

South Korea to Train All Active-Duty Soldiers to Operate Drones

https://www.wsj.com/world/asia/south-korea-to-train-all-active-duty-soldiers-to-operate-drones-28...
2•bookofjoe•10m ago•1 comments

Insert is a programming language for self-modifying code

https://github.com/uellenberg/Insert
1•trenchgun•10m ago•0 comments

Show HN: Keyway – Control your Mac from the keyboard

https://github.com/Njuhobby/keyway
2•njuhobby•11m ago•1 comments

Show HN: Lettered – a daily phrase puzzle game

https://lettered.io
1•ajhenrydev•12m ago•0 comments

A little bird told her: scientist wins $100k prize for decoding birdsong

https://www.theguardian.com/science/2026/jun/26/human-animal-communication-step-closer-scientist-...
1•Brajeshwar•12m ago•0 comments

Delete Doesn't Mean Deleted. Just Ask OpenAI

https://lindsaygross1.substack.com/p/delete-doesnt-mean-deleted-just-ask
1•herbertl•14m ago•0 comments

Explore+Anna's+Archive

https://annas-archive.is?aa_share=dkzcsggmtnt4rt1ehpy9r3yzkkaa8p8l&utm_source=share&utm_medium=ha...
2•constantanople•15m ago•0 comments

Israeli founder faces backlash on mobile OS launch

https://twitter.com/mil000/status/2070215925576728890
3•grandpajoey•17m ago•2 comments

Supalive, live queries for Postgres and MySQL

https://www.npmjs.com/package/@supalive/core
1•rebaz94•18m ago•0 comments

Show HN: My website as one connected graph – blog, second brain, and book

https://www.ssp.sh/
3•zazuke•21m ago•2 comments

Newsletter market in 2026: $10 per month is default price but not ceiling

https://pressgazette.co.uk/newsletters/newsletters-2026-prices-retention-churn/
1•bookofjoe•21m ago•0 comments

Outbound hold agent that pauses AI runtime while waiting on hold

https://github.com/team-telnyx/telnyx-code-examples/tree/main/outbound-hold-agent-python
1•anushathukral•22m ago•0 comments

What is a Lithium-ion capacitor?

https://www.jtekt.co.jp/e/products/capacitor/capacitor_about.html
2•ksec•22m ago•0 comments

How to Let AI, Workflows, and Custom Systems Manage Your Social Media

https://socialplod.com/blog/how-to-let-ai-workflows-and-custom-systems-manage-your-social-media-t...
1•dexterwura•22m ago•0 comments

More Than Syntax

https://redmonk.com/rstephens/2026/06/12/more-than-syntax/
1•wrxd•22m ago•0 comments

LlamaIndex integration for SynapCores (RAG, GraphRAG, and hybrid retrieval)

https://github.com/SynapCores/synapcores-llamaindex
1•Synapcores•22m ago•1 comments