frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•6mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Software Gets a New Layer

https://www.wreflection.com/p/software-gets-a-new-layer
1•nowflux•3m ago•0 comments

Seekdb – AI-Native search database

https://github.com/oceanbase/seekdb
2•synergy20•3m ago•0 comments

Münchhausen Trilemma

https://en.wikipedia.org/wiki/M%C3%BCnchhausen_trilemma
1•thunderbong•4m ago•0 comments

Show HN: RIMC – An Alpha-Drift Framework for Finite-Speed Learning Markets

https://github.com/rimc-lab/RIMC
1•sode_rimc•5m ago•0 comments

Does Time Flow? New Clues Come from a Century-Old Approach to Math

https://www.quantamagazine.org/does-time-really-flow-new-clues-come-from-a-century-old-approach-t...
1•tesserato•12m ago•2 comments

FRIP Weaponizes Identity Fabrics

https://www.kuppingercole.com/blog/tolbert/how-frip-weaponizes-identity-fabrics-the-security-revo...
1•mooreds•13m ago•0 comments

Ukraine stares down the barrel of population collapse

https://www.reuters.com/world/ukraine-stares-down-barrel-population-collapse-2025-12-04/
2•layer8•13m ago•0 comments

How AI is rewiring childhood

https://www.economist.com/leaders/2025/12/04/how-ai-is-rewiring-childhood
1•jdkee•13m ago•1 comments

CDC advisory panel delays vote on hepatitis B vaccines after unruly meeting

https://www.msn.com/en-us/health/other/cdc-advisory-panel-delays-vote-on-hepatitis-b-vaccines-aft...
2•petethomas•14m ago•0 comments

Belief

https://en.wikipedia.org/wiki/Belief
1•marysminefnuf•15m ago•0 comments

How to Find Time to Do Science

https://chillphysicsenjoyer.substack.com/p/how-to-find-time-to-do-science
2•Gormisdomai•16m ago•0 comments

Ask HN: Is there a reliable mass automation focus group applier?

1•bunnybomb2•16m ago•0 comments

Dosh (LLM-powered shell commands)

https://raku-advent.blog/2025/12/01/day-1-dancer-dasher-and-dosh/
1•librasteve•21m ago•0 comments

Zero Table Dependency: A model for testing SQL as pure functions

https://github.com/mk3008/rawsql-ts/tree/main/packages/drivers/pg-testkit
1•masugiura•21m ago•0 comments

Apple Announces Departure of Lisa Jackson and Kate Adams

https://www.cnbc.com/2025/12/04/apple-announces-departure-lisa-jackson-kate-adams.html
2•coloneltcb•23m ago•0 comments

Qwen3-VL 2B on Raspberry Pi with llama.cpp

https://eheidi.dev/posts/raspberry-llama/
1•ignoramous•24m ago•0 comments

Show HN: Disaggregating GPU compute from CPU in ML job execution to scale GPUs

https://woolyai.com/
1•medicis123•25m ago•0 comments

Confessions can keep language models honest

https://openai.com/index/how-confessions-can-keep-language-models-honest/
2•xavierlint•26m ago•0 comments

Cosmological Lithium Problem

https://en.wikipedia.org/wiki/Cosmological_lithium_problem
1•eklitzke•30m ago•0 comments

Coca Cola has an executive dedicated to McDonald's

https://www.coca-colacompany.com/about-us/leadership/roberto-mercade
3•sbolt•31m ago•0 comments

TanStack AI Alpha: Your AI, Your Way

https://tanstack.com/blog/tanstack-ai-alpha-your-ai-your-way
1•franky47•32m ago•0 comments

Like Social Media, AI Requires Difficult Choices

https://www.schneier.com/blog/archives/2025/12/like-social-media-ai-requires-difficult-choices.html
3•hn_acker•32m ago•0 comments

What is the difference between science and pseudoscience? [video]

https://www.youtube.com/watch?v=UTpxICN-O1U
1•PotatoPancakes•33m ago•0 comments

We gave 5 LLMs $100K to trade stocks for 8 months

https://www.aitradearena.com/research/we-ran-llms-for-8-months
58•cheeseblubber•35m ago•47 comments

SMS phishers pivot to points, taxes, fake retailers

https://krebsonsecurity.com/2025/12/sms-phishers-pivot-to-points-taxes-fake-retailers/
12•todsacerdoti•35m ago•0 comments

Google Rolling Out Gemini 3 Deep Think to AI Ultra

https://9to5google.com/2025/12/04/gemini-3-deep-think/
1•gradus_ad•37m ago•0 comments

Microflora Danica–a genetic atlas of Danish environmental microbiomes

https://www.nature.com/articles/s41586-025-09794-2
1•macmac•37m ago•0 comments

Young Adults and the Future of News

https://www.pewresearch.org/journalism/2025/12/03/young-adults-and-the-future-of-news/
3•hn_acker•37m ago•0 comments

Tired of social media noise, so I built a platform for technical discussions

https://synthchat.netlify.app
2•akku779•37m ago•1 comments

Titans and MIRAS: Helping AI have long-term memory

https://research.google/blog/titans-miras-helping-ai-have-long-term-memory/
1•xnx•37m ago•0 comments