frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•1y ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Star Citizen tops $1B crowdfunding milestone

https://www.eurogamer.net/star-citizen-tops-1-billion-sells-5000-unflyable-spaceship
2•jllyhill•3m ago•0 comments

Overheated chemical tank in southern California 'will fail', EPA chief says

https://www.theguardian.com/us-news/2026/may/24/chemical-tank-california-epa-lee-zeldin
2•tosh•5m ago•0 comments

Greetings, Class of 2026 Have You Heard About AI? Wait, Why Are You Booing?

https://www.theatlantic.com/newsletters/2026/05/ai-commencement-speech/687236/
3•Michelangelo11•7m ago•0 comments

MongoDB GUI Comparison: Compass vs. Studio 3T vs. VisuaLeaf

https://mongodb-gui-comparison.com/
3•roxana_haidiner•10m ago•0 comments

Open Source and the Iceberg Theory – Queue

https://spawn-queue.acm.org/doi/full/10.1145/3799738
2•rbanffy•13m ago•0 comments

SpaceX's Starship V3–still a work in progress–mostly successful on first flight

https://arstechnica.com/space/2026/05/spacexs-starship-v3-still-a-work-in-progress-mostly-success...
2•rbanffy•16m ago•0 comments

Show HN: Mercury v3 – Convert Python Notebooks to Web Apps

https://github.com/mljar/mercury
1•pplonski86•18m ago•0 comments

Rapira (Рапира) – Soviet programming language interpreter

https://github.com/begoon/rapira
1•begoon•19m ago•0 comments

DeepSeek's 10T USD grand strategy

https://twitter.com/bookwormengr/status/2057909493250539891
2•shscs911•20m ago•0 comments

Show HN: WhatsKept – Searchable,agent-queryable WhatsApp history from iOS backup

https://github.com/alkait/WhatsKept
2•tenthead•20m ago•0 comments

Samsung: It's Time for Floating Data Centers

https://datacenterrichness.substack.com/p/samsung-its-time-for-floating-data
6•rbanffy•26m ago•0 comments

How to Build Institution-Grade Yield Curves and Volatility Surfaces

https://medium.com/@DolphinDB_Inc/the-hidden-foundation-of-pricing-and-risk-how-ficc-curves-and-s...
3•CrazyTomato•27m ago•0 comments

Qwen3.7-Max Ran for 35 Hours on Unknown Hardware and Achieved a 10× Speedup

https://firethering.com/alibaba-qwen3-7-max-autonomous-agent/
2•steveharing1•27m ago•0 comments

Golomb Coding

https://en.wikipedia.org/wiki/Golomb_coding
2•tosh•29m ago•0 comments

Show HN: Geomatic – a command-driven geometry studio enabled with autodiff

https://www.tinyvolt.com/geomatic
3•nivter•32m ago•2 comments

Uvora Growth OS – AI marketing automation and lead generation platform

https://growth.uvora.cloud
2•ghcosmin•33m ago•0 comments

Brain motion is driven by mechanical coupling with the abdomen

https://www.nature.com/articles/s41593-026-02279-z
3•lentoutcry•38m ago•0 comments

Light pollution is washing out the sky. A remote telescope farm helps stargazers

https://www.google.com/url?q=https://www.cbsnews.com/news/starfront-observatories-light-pollution...
3•Michelangelo11•38m ago•0 comments

Show HN: SenseCollect – Web data extraction made simple

https://sensecollect.com
2•chrislxy•40m ago•0 comments

Nous: Offline Duolingo Style app for maths, science and humanities

https://play.google.com/store/apps/details?id=com.mathvoyager.app&hl=en_US
3•flirp•40m ago•1 comments

IMAX Is for a Sale

https://www.cnbc.com/2026/05/22/imax-sale-talks-potential-buyers-wall-street-analysts.html
3•mgh2•41m ago•0 comments

Bringer Tech

https://bringer.tech
2•amirwayne•42m ago•0 comments

Finding Security Bugs in OSS with LLMs on a Budget

https://www.etive-mor.com/blog/carlini-style-vulnerability-hunting-on-a-budget/
2•liamlaverty•43m ago•1 comments

Ask HN: Bitwarden Rejecting Master Password?

2•esquivalience•47m ago•0 comments

Wellness Peptide Craze

https://www.bbc.com/news/articles/cdr268m5pxro
3•andsoitis•48m ago•0 comments

Using design patterns to encode expert judgement for LLM workflows

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6723919
2•Jackal08•51m ago•0 comments

LLMs Locally With a CPU? I Tested 8 Models on Linux

https://itsfoss.com/testing-local-llms-without-gpu/
3•hochmartinez•51m ago•0 comments

Almost always look on the bright side of life

https://economist.com/business/2026/05/21/why-you-should-almost-always-look-on-the-bright-side-of...
3•andsoitis•1h ago•1 comments

Stop Doing Easy Things

https://xendo.bearblog.dev/stop-doing-easy-things/
3•xendo•1h ago•0 comments

The Essential Cloud for AI: Why Purpose-Built Defines the Future of Intelligence

https://www.coreweave.com/blog/the-essential-cloud-for-ai-why-purpose-built-defines-the-future-of...
2•janandonly•1h ago•0 comments