frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•10mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Invoice Processing Cost per Invoice: The 2026 Benchmark

https://www.digiparser.com/statistics/invoice-processing-cost-per-invoice
1•thepantales•59s ago•0 comments

The last shall be (slightly) safer

https://dylancastillo.co/til/securing-package-managers.html
1•dcastm•1m ago•0 comments

Better-Clawd – A Claude Code Fork with OpenRouter and OpenAI Support

https://github.com/x1xhlol/better-clawd
1•lucknite•1m ago•0 comments

Negative social ties as emerging risk factors for accelerated aging

https://www.pnas.org/doi/10.1073/pnas.2515331123
1•ulrischa•1m ago•0 comments

Half of social-science studies fail replication test in years-long project

https://www.nature.com/articles/d41586-026-00955-5
2•MBCook•3m ago•0 comments

AI for American-Produced Cement and Concrete

https://engineering.fb.com/2026/03/30/data-center-engineering/ai-for-american-produced-cement-and...
1•latchkey•6m ago•0 comments

Show HN: Metal Quantized Attention on M5 Max

https://releases.drawthings.ai/p/metal-quantized-attention-pulling
1•liuliu•7m ago•0 comments

Is "Hackback" Official US Cybersecurity Strategy?

https://www.schneier.com/blog/archives/2026/04/is-hackback-official-us-cybersecurity-strategy.html
1•speckx•7m ago•0 comments

Show HN: H-Core Snapshot – forcing LLMs to execute instead of explain

https://github.com/yaloms/h-core-snapshot
1•Stronz•7m ago•0 comments

Telling More Than We Can Know: Verbal Reports on Mental Processes(1977)[pdf]

https://home.csulb.edu/~cwallis/382/readings/482/nisbett%20saying%20more.pdf
1•kelseyfrog•7m ago•0 comments

Show HN: I Played Total Overdose Today, Once More

1•gray_wolf_99•8m ago•0 comments

Kia to sell lower-priced electric vehicle in US

https://www.reuters.com/business/autos-transportation/kia-sell-lower-priced-electric-vehicle-us-2...
2•tartoran•9m ago•0 comments

Pesticides and cancer: researchers find a connection at the national level

https://www.lemonde.fr/en/environment/article/2026/04/01/pesticides-and-cancer-for-the-first-time...
1•MrDresden•10m ago•1 comments

The Family That Decided to Have Their Stomachs Removed

https://www.theatlantic.com/health/2026/03/stomach-cancer-total-gastrectomy/686623/
1•breve•10m ago•0 comments

Claude Code Steals Your Dreams

https://github.com/Bitterbot-AI/bitterbot-desktop/tree/main/docs/memory
2•VtotheMtotheG•11m ago•0 comments

Community Pulse – Episode 103 – AI Slop in DevRel

https://www.communitypulse.io/103-ai-slop
1•aspleenic•12m ago•0 comments

NASA Artemis II moon mission live launch broadcast

https://plus.nasa.gov/scheduled-video/nasas-artemis-ii-crew-launches-to-the-moon-official-broadcast/
11•apitman•12m ago•0 comments

As Moon interest heats up, two companies unveil plans for a lunar "harvester"

https://arstechnica.com/space/2026/03/as-moon-interest-heats-up-two-companies-unveil-plans-for-a-...
1•PaulHoule•12m ago•0 comments

Tell HN: Git hook to keep LLM signatures out of your commit history

1•akktor•13m ago•2 comments

I Rebuilt Traceroute in Rust and It Was Simpler Than I Expected

https://tech.stonecharioteer.com/posts/2026/traceroute/
2•stonecharioteer•13m ago•0 comments

Show HN: AirplaneMode – Simulate realistic airplane WiFi on macOS

https://github.com/freeze-rey/airplanemode-sim
1•jlreyes•14m ago•1 comments

AI Usage on Texas

https://daviduritu.substack.com/p/the-safety-valve
1•claudiug•14m ago•0 comments

Ask HN: Was Bay Area traffic less today?

1•HoldOnAMinute•16m ago•0 comments

Understanding CPUs by building one in Kotlin

https://github.com/bloderxd/kotlin-cpu
1•bloder•16m ago•1 comments

Thinking Too Different – Apple Vision Pro, Disability and 20 Months in Court

https://medium.com/@edgecaseexistence/thinking-too-different-apple-50-years-later-5d16b2257841
1•iheartbiggpus•18m ago•0 comments

Renewables hit 49.4% of global electricity capacity in 2025

https://www.theregister.com/2026/04/01/renewables_generated_nearly_half_global_power/
1•speckx•18m ago•0 comments

Best Office Chairs of 2026– I've Tested 65 to Pick Them

https://www.wired.com/gallery/best-office-chairs/
2•joozio•19m ago•0 comments

James Webb captures two galaxies in the middle of a cosmic collision

https://techfixated.com/james-webb-captures-two-galaxies-in-the-middle-of-a-cosmic-collision/
2•benlarweh•19m ago•0 comments

An Invisible Bottleneck: A Helium Shortage Threatens the Chip Industry

https://www.nytimes.com/2026/03/27/business/helium-chips-iran-war.html
1•walterbell•20m ago•0 comments

Show HN: Agent Action Guard – AI agent action safety

1•praneeth-v•21m ago•0 comments