frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

DocHero: PDF Editor and Sign PDF

https://apps.apple.com/us/app/dochero-pdf-editor-sign-pdf/id6781691509
1•suryanshJ•10m ago•0 comments

Fin: A Jellyfin Client for the Terminal

https://tangled.org/tsiry-sandratraina.com/fin
2•nerdypepper•12m ago•0 comments

Scientists decry conference's use of hidden prompts to snare AI peer reviews

https://www.thetransmitter.org/publishing/scientists-decry-conferences-use-of-hidden-prompts-to-s...
1•jruohonen•15m ago•0 comments

Could the next great novel be written by AI?

https://www.theguardian.com/books/ng-interactive/2026/jul/04/future-of-fiction-next-great-novel-a...
1•scandox•20m ago•2 comments

HackathonHub – the control room for hackathons, game jams, and team competitions

https://hackathonhub.xyz/
1•igorthenomad•20m ago•0 comments

Provenance: Proving That Your Code Is Really Yours

https://medium.com/@vektormemory/provenance-proving-that-your-code-is-really-yours-603c09407a97
1•vektormemory•20m ago•1 comments

AI models' values are different from most people's

https://www.economist.com/briefing/2026/06/25/ai-models-values-are-very-different-from-most-peoples
2•Anon84•21m ago•0 comments

We Are Running Companies on Chat Windows and Calling It a Revolution

https://irishtechnews.ie/running-companies-on-chat-windows-calling-it-rev/
2•belkin1•23m ago•0 comments

Jersey Mike's IPO illustrates how bad the AI hype is

https://finance.yahoo.com/technology/ai/articles/jersey-mike-ipo-illustrates-bad-201159743.html
3•cybermango•24m ago•0 comments

Arbitrary code execution breaking sandboxes in KDE Plasma

https://blog.kimiblock.top/2026/07/01/arbitrary-code-execution-in-kde-plasma/index.html
2•birdculture•27m ago•0 comments

BiOptimizers Magnesium Breakthrough Reviews – Truth Check

https://gamma.app/embed/Magnesium-Breakthrough-By-BiOptimizers-Honest-Review-tbe77aupzt7tnrd?mode...
1•prepostseo•32m ago•0 comments

Show HN: AI Coloring Page Generator for printable classroom worksheets

https://aicoloringpagegenerator.org/
1•robot1996•32m ago•0 comments

Show HN: AI Video Detector – check whether a video may be AI-generated

https://aivideodetector.video
1•robot1996•32m ago•0 comments

Dangerously-skip-permissions is the only safe mode

https://www.granola.ai/blog/dangerously-skip-permissions-is-the-only-safe-mode
3•jamesfisher•32m ago•0 comments

Syscall: Ring ZERO assembly puzzle game for those who are tired of agentic AI

https://store.steampowered.com/app/4849330/SYSCALL_RING_ZERO/
1•thisisneat•33m ago•0 comments

Show HN: AssistantAI – Real-Time Conversation Hints and Screenshot Analysis

https://github.com/Aleksandern/assistant-ai
1•aleksandern08•33m ago•0 comments

In 1850, Ignaz Semmelweis saved lives with three words: wash your hands (2015)

https://www.pbs.org/newshour/health/ignaz-semmelweis-doctor-prescribed-hand-washing
2•downbad_•38m ago•0 comments

Show HN: Qpilot – AI agent runs plain-text manual test cases in a real browser

https://github.com/broxhq/qpilot
2•Muhammad-21•39m ago•1 comments

The Declaration of Independence

https://acoup.blog/2026/07/04/collections-on-the-declaration-of-independence/
1•cesidio•40m ago•0 comments

Show HN: An MCP server that gives your AI assistant write access to /etc./hosts

https://www.lockinmcp.com
2•Kiog-Aser•45m ago•1 comments

Show HN: ASMLings – Rustlings-style exercises for Intel 8086 assembly

https://github.com/giacomo-folli/asmlings
2•stopwatch4619•46m ago•0 comments

I stopped automating Reddit marketing and went back to manual

https://replytone.com
3•alwayswntent•47m ago•0 comments

Async: What Is Blocking? (2022)

https://ryhl.io/blog/async-what-is-blocking/
1•vinhnx•47m ago•0 comments

A field guide to Fable: finding your unknowns

https://twitter.com/trq212/status/2073100352921215386
1•mustaphah•49m ago•1 comments

Escape the Moon

https://escapethemoon.vercel.app/
1•rmoff•51m ago•0 comments

Show HN: Majority.wtf · guess yesterday's most common answer to a question

https://majority.wtf/
1•leumon•53m ago•0 comments

Show HN: Psycurate – Curated place for privacy-first, free psychological tools

https://psycurate.com
1•mursu•53m ago•0 comments

Show HN: Quizzyly – A Chrome Extension to Get Quiz from Current Webpage

https://chromewebstore.google.com/detail/quizzyly/agfdemmfpenkbccjnpjliaenpeihefae
1•satyajitdas•54m ago•0 comments

Show HN: Self hosted excalidraw workspace with storage

https://github.com/PriyavKaneria/excalidraw-workspace
1•diginova•1h ago•0 comments

Help, my dentist started coding – or: a little history of low code solutions

https://thomas-witt.com/blog/help-my-dentist-started-coding/
2•thomas_witt•1h ago•0 comments