frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•11mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

A certain enchanted forest is inhabited by talking birds

https://wiki.xxiivv.com/site/logic.html
2•tosh•3m ago•0 comments

The Trajectory of Artificial Intelligence

https://medium.com/@MachineCognitionLabs/the-trajectory-of-artificial-intelligence-cab899ed5d27
1•MO-379•5m ago•1 comments

The Only Two Markup Languages

https://www.gingerbill.org/article/2026/01/19/two-families-of-markup-languages/
1•birdculture•6m ago•0 comments

10 years: Stephen's Sausage Roll still one of the most influential puzzle games

https://thinkygames.com/features/10-years-of-grilling-stephens-sausage-roll-remains-one-of-the-mo...
1•tobr•10m ago•0 comments

Show HN: Great Apps

https://greatapps.net/
1•IgorStojanov•13m ago•0 comments

Claude Code Opus 4.7 keeps checking on malware

1•decide1000•17m ago•0 comments

GitHub Copilot EU data residency

https://github.blog/changelog/2026-04-13-copilot-data-residency-in-us-eu-and-fedramp-compliance-n...
1•whirlwin•18m ago•1 comments

My AI thinks civics is black studies

https://hollisrobbinsanecdotal.substack.com/p/my-ai-thinks-civics-is-black-studies
1•HR01•22m ago•0 comments

Workers say they're drowing in "workslop"

https://www.theguardian.com/technology/2026/apr/14/ai-productivity-workplace-errors
2•rwmj•22m ago•0 comments

One Person's Trash on the Joys of Collecting Junk

https://lithub.com/one-persons-trash-on-the-joys-of-collecting-junk/
1•herbertl•23m ago•0 comments

StenoKeyboards

https://stenokeyboards.com/
1•usdogu•23m ago•0 comments

Deals on Software

https://www.dealsonsoftware.com/
1•IgorStojanov•25m ago•0 comments

Just let me compute in peace

https://neilzone.co.uk/2026/04/just-let-me-compute-in-peace/
2•miniBill•27m ago•0 comments

Stanford scientists discover "natural Ozempic" without side effects

https://www.nature.com/articles/s41586-025-08683-y
3•stevenjgarner•27m ago•1 comments

Supreme Court Shadow Docket

https://www.nytimes.com/2026/04/18/us/politics/supreme-court-shadow-docket.html
4•MintyPyro•32m ago•0 comments

To Beat China, Embrace Open-Source AI

https://www.wsj.com/opinion/to-beat-china-embrace-open-source-ai-a211bf59
2•sam345•35m ago•0 comments

Private Prosecution of Israeli Soldier Thrown Out

https://www.uklfi.com/private-prosecution-of-israeli-soldier-thrown-out
2•EvgeniyZh•38m ago•0 comments

The Story of Mel (1983)

http://www.catb.org/jargon/html/story-of-mel.html
2•SergeAx•43m ago•0 comments

Claude Opus 4.7 Intelligence, Performance and Price Analysis

https://artificialanalysis.ai/models/claude-opus-4-7
30•Topfi•44m ago•1 comments

An Electronic Conversationalist (and the Machine Replied...) (1962)

https://archive.org/details/DTIC_AD0400016
1•the-mitr•46m ago•0 comments

Why higher pay hasn't made young adults feel richer

https://www.ft.com/content/b61f60a3-d4d7-46d9-aa6f-dd78dffe71f5
1•merksittich•50m ago•0 comments

Detect, Diagnose, and Debug Using Sensors and Functional Monitoring

https://semiengineering.com/detect-diagnose-and-debug-using-sensors-and-functional-monitoring/
1•PaulHoule•51m ago•0 comments

Why your company will never scale (or maybe why it will)

1•xunairah•51m ago•1 comments

Show HN: VolcAPI run your OpenAPI spec as a test suite from the terminal

1•aliamer99•52m ago•1 comments

How do you manage your startup's?

1•arhammirkar1•57m ago•0 comments

Show HN: I built Panda to get up to 99% token savings

https://github.com/AssafWoo/homebrew-pandafilter
2•AssafPetronio•1h ago•0 comments

Behind the Screens

https://behind-the-screens.tv/#ww1
1•jgrahamc•1h ago•0 comments

A Continuous Delivery Playbook for Regulated Industries

https://www.youtube.com/watch?v=iHJfhL6PFEg
1•RebootStr•1h ago•0 comments

Sorting 1M u64 KV-pairs in 20ms on i9-13980HX using a branchless Rust impl

2•EfurDec•1h ago•0 comments

Go Tool Task

https://taskfile.dev/blog/go-tool-task
1•andreynering•1h ago•0 comments