frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•11mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Three reasons why DeepSeek’s new model matters

https://www.technologyreview.com/2026/04/24/1136422/why-deepseeks-v4-matters/
1•thunderbong•3m ago•0 comments

Show HN: Cask.news – discover and track new homebrew Mac apps

https://cask.news/
1•to•3m ago•0 comments

I built a benchmark for testing LLMs playing Gomoku

https://github.com/homerquan/GomokuBench
1•homerquan•7m ago•0 comments

British cyclist takes KOM on San Francisco's steepest street with 41% gradient

https://www.bikeradar.com/news/harry-macfarlane-san-francisco-kom
1•littlexsparkee•7m ago•0 comments

Cheapest GPUs in the World

https://timlig.com/posts/cheapest-gpus-in-the-world/
1•anujsharmax•8m ago•0 comments

Meta Is Preparing to Have to Undo Its Manus Acquisition After China Ban

https://www.wsj.com/tech/ai/meta-is-preparing-to-have-to-undo-its-manus-acquisition-after-china-b...
2•thm•10m ago•0 comments

The AI Rug Pull

https://www.warman.life/blog/2026-04-27-the-apprenticeship/
3•shaunistyping•18m ago•0 comments

How to Start Journaling

https://www.theguardian.com/wellness/2026/apr/27/how-to-start-journaling
4•devonnull•21m ago•0 comments

ATS Resume Forge – an ATS-focused resume builder for job seekers

https://www.atsresumeforge.com/
1•kinrell•23m ago•1 comments

Why your 'Private Google Access enabled' subnet still bills Cloud NAT

https://github.com/FootprintAI/Containarium
1•hsin003•24m ago•1 comments

LingBot-Map: Streaming 3D reconstruction with geometric context transformer

https://technology.robbyant.com/lingbot-map
2•nateb2022•29m ago•0 comments

Florida AG probes ChatGPT's role in USF student killings

https://www.axios.com/local/tampa-bay/2026/04/27/florida-ag-openai-chatgpt-usf-murders-ai-account...
1•1vuio0pswjnm7•29m ago•1 comments

Claire's closes all 154 stores in UK and Ireland with loss of 1,300 jobs

https://www.bbc.com/news/articles/cg4047qnpk2o
6•stevekemp•33m ago•0 comments

Why Spotify has no button to filter out AI music

https://www.bbc.co.uk/news/articles/cd7jpg4w181o
2•dijksterhuis•36m ago•1 comments

The Hold

https://www.subbu.org/essays/2026/the-hold/
1•freediver•37m ago•0 comments

Cisco Introduces Universal Quantum Switch

https://newsroom.cisco.com/c/r/newsroom/en/us/a/y2026/m04/cisco-introduces-universal-quantum-swit...
1•kousthub•37m ago•1 comments

Conus Electrical Resistivity at 35km

https://www.usgs.gov/media/images/conus-electrical-resistivity-35km
2•testingonetwo34•37m ago•1 comments

Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error

https://www.philosophicalhacker.com/post/anthropic-error/
2•kmdupree•39m ago•0 comments

Devin for Terminal

https://devin.ai/terminal
1•qainsights•43m ago•1 comments

Stripe: Radar Technical Guide

https://stripe.com/in/guides/primer-on-machine-learning-for-fraud-protection
1•jonnonz•43m ago•0 comments

OpenJDK 21 April 2026 CVEs Explain

https://tux.re/forum/viewtopic.php?t=175
2•Neteam•44m ago•0 comments

The Conspiracy Against High Temperature Sampling

https://gist.github.com/Hellisotherpeople/71ba712f9f899adcb08b94bce20d5397
2•Der_Einzige•46m ago•0 comments

TeamPCP Supply Chain Campaign: Update 008

https://isc.sans.edu/diary/32926
1•jruohonen•48m ago•0 comments

Grocyy – AI receipt scanner that tracks grocery spending by item, not just total

https://grocyy.com/
1•Devanship1•51m ago•0 comments

Video Upscaler with Temporal Smoothing

https://github.com/freeaigit/video-upscaler
1•nadermx•57m ago•0 comments

Try Contra Dancing

https://www.benkuhn.net/contra/
2•jefftk•59m ago•0 comments

Consequences of passing too few register parameters to a C function

https://devblogs.microsoft.com/oldnewthing/20260427-00/?p=112271
1•aragonite•1h ago•0 comments

China's push to commercialize research: match 680k innovators with companies

https://www.nature.com/articles/d41586-026-01202-7
2•manvel_hn•1h ago•0 comments

Show HN: See your computer's audio output on a real-time piano

https://github.com/ecstrema/overchords
1•ecstrema•1h ago•0 comments

Show HN: PrePrompt – rewrites vague prompts before they reach the LLM

https://preprompt.org/
2•yashdeeptehlan•1h ago•1 comments