frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Arch Linux AUR Hit by Another Wave of Now More Sophisticated Malware Attack

https://www.phoronix.com/news/Arch-Linux-AUR-More-Malware
1•ImJamal•3m ago•0 comments

AI enables 1000 people to hold a thoughtful conversation

https://bigthink.com/science-tech/collective-superintelligence/
1•bonkerbits•6m ago•1 comments

How Utahns Took on Mr. Wonderful and a Data Center on the Great Salt Lake

https://www.nytimes.com/2026/06/14/us/elections/kevin-oleary-utah-data-center.html
2•ChrisArchitect•13m ago•1 comments

American capitalism is run by millionaires, not billionaires

https://www.economist.com/business/2026/06/10/american-capitalism-is-run-by-millionaires-not-bill...
2•Anon84•15m ago•0 comments

A live ledger of things people wish existed captured from the BlueSky firehose

https://www.unbuilt.so
2•plural•21m ago•0 comments

Why Software, Not Drones, Will Decide the Next War

https://csis-website-prod.s3.amazonaws.com/s3fs-public/2026-06/260610_Bondar_Defining_Autonomy.pd...
3•tow21•21m ago•1 comments

Ask HN: If 160M Americans are employed, what's the unemployment rate?

1•paganartifact•25m ago•3 comments

Everyone Was Wrong About Maximum Siphon Height [video]

https://www.youtube.com/watch?v=5glksNTKkZI
2•thunderbong•29m ago•0 comments

Why my book can be downloaded for free (2014)

https://blog.plover.com/book/free-hop.html
1•downbad_•30m ago•0 comments

Show HN: Afterburner – Capability-Sandboxed JavaScript/TS Runtime in Rust

https://github.com/afterburner-sh/afterburner
1•vertexclique•31m ago•0 comments

Claude Fable 5 vs. GPT-5.5: better planning, similar execution

https://blog.kilo.ai/p/claude-fable-5-vs-gpt-5-5
2•justiceforsaas•33m ago•0 comments

AI is revolutionising the stock market

https://www.ft.com/content/b31f1e09-5aae-4cad-af15-97adb15dba70
1•thm•34m ago•0 comments

Meta‑Attention Is All You Need

https://medium.com/@vla3728419/meta-attention-is-all-you-need-650a90832d27
1•theorchid•34m ago•0 comments

Qwen 3.6 93B with MTP on 2×RTX 3090 NVLink=187 tokens/SEC,LLM lost bleat-a-thon

https://github.com/Augmented-Reality-Virtual-Reality-AR-VR/P...
3•devilfileprong•34m ago•0 comments

How did Atari apply side art to Arcade Cabinets?

https://arcadeblogger.com/2026/06/14/how-did-atari-apply-side-art-to-arcade-cabinets/
6•msephton•35m ago•1 comments

A text-first social network – No Images, No Videos

https://trendter.com/
1•badz•37m ago•2 comments

Moving Averages (2022)

https://gregorygundersen.com/blog/2022/06/04/moving-averages/
2•tosh•39m ago•0 comments

Shoehorning Flying Toasters into a ESP32-S3

https://taoofmac.com/space/blog/2026/06/14/1400
2•rcarmo•40m ago•0 comments

Brief Notes on Computer Word and Byte Sizes

https://www.cs.columbia.edu/~smb/blog/2023-03/2023-03-07.html
2•jruohonen•40m ago•0 comments

Why Rust does not need OOP

https://belderbos.dev/blog/why-rust-does-not-need-oop/
1•lumpa•41m ago•0 comments

The origins of the AI age /s

https://indiekartik.substack.com/p/the-origins-of-the-ai-age
1•kartik0001•46m ago•0 comments

I built a zero-tab reading workflow for Hacker News

https://gopeek-lovat.vercel.app/blog-hacker-news-workflow.html
6•ofcyes•53m ago•1 comments

The Birth and Death of JavaScript (2014)

https://www.destroyallsoftware.com/talks/the-birth-and-death-of-javascript
41•subset•54m ago•11 comments

China's universities cut 12,000 'obsolete' degrees

https://www.scmp.com/economy/china-economy/article/3356913/chinas-universities-cut-12000-obsolete...
4•the-mitr•54m ago•0 comments

Elon Musk's role was 'instrumental' in the Belfast riots, researchers say

https://www.lemonde.fr/en/international/article/2026/06/13/musk-s-role-was-instrumental-in-the-be...
7•tastyface•56m ago•1 comments

Formal Methods and the Future of Programming

https://blog.janestreet.com/formal-methods-at-jane-street-index/?from_theconsensus=1
2•eatonphil•57m ago•0 comments

The Universe Is Made of Music [video]

https://www.youtube.com/watch?v=j06DGlbwM34
1•rogmash•58m ago•0 comments

Show HN: KBlocker - Linux Productivity Hack

https://github.com/Dan-J-D/kblocker
1•dan-j-d•1h ago•0 comments

Double your Codex / Claude Code productivity and output

https://github.com/tanweai/pua
1•sturza•1h ago•0 comments

Kagi adds Hacker News conversation under links

https://imgur.com/a/S4S0vPX
3•scosman•1h ago•0 comments