frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a YC data scraper in under 5 minutes

https://www.autonoly.com/blog/682212a2b65a68f26d0c10a4/how-to-scrape-complete-y-combinator-startup-data-in-3-minutes-without-writing-a-single-line-of-code
2•dpacman•7mo ago
Hi HN,

I'm an indie hacker who built Autonoly solo over the past 3.5 months. I essentially vibe coded the entire platform based on automation needs I encountered in my own work. I wanted to share a practical example of what it can do - creating a Y Combinator data scraper in just a few minutes without writing any traditional code.

The technical approach is straightforward but effective:

1. Browser automation navigates to YC's company directory 2. For YC's infinite scroll pagination, I implemented a progressive scroll function that iterates about 150 times with calibrated delays (ensuring all ~1000+ companies load) 3. Data extraction uses XPath selectors to identify and capture the structural pattern of each company listing 4. The system then extracts specific data points (company name, description, location, etc.) into a structured CSV

The trickiest parts were getting the XPath patterns right (the DOM structure varies slightly between different company entries) and fine-tuning the scroll timing to ensure complete loading without timeout issues.

What makes this approach effective is that it works with the site's intended user experience. The browser automation renders JavaScript properly, handles dynamic loading, and interacts with elements in a natural way.

While this YC scraper example is specific, I built Autonoly to automate virtually any digital task - data processing, content creation, file management, business workflows, and more. As an indie developer, I kept encountering processes that were tedious to do manually but didn't justify hiring someone or spending weeks on custom code.

I'd love to hear feedback from the HN community, especially from those who've built similar systems or have different approaches to workflow automation. Happy to answer any technical questions about the implementation or discuss the challenges of building automation tools as a solo founder.

Show HN: Habit Tracking as an RPG in Google Sheets

https://befitting-iodine-673.notion.site/Gamified-Habit-Tracker-Turn-Your-Daily-Habits-into-an-RP...
1•digital_tempo•1m ago•0 comments

Show HN: Omnvert Network Toolkits Ping/MTR,DNS,Headers,TLS,RDAP,PCAFlowsP→JSON

https://omnvert.com/en/category/network
1•kaant•1m ago•0 comments

Shipping at Inference-Speed

https://steipete.me/posts/2025/shipping-at-inference-speed
1•jpalomaki•3m ago•0 comments

Show HN: I built a standalone WASM playbook builder for sports coaches

https://playmaker.click/playbook
1•paulgrimes1•4m ago•0 comments

Snowflake Gen2 vs. Gen1: performance and cost analysis

https://seemoredata.io/blog/gen1-vs-gen2-snowflake-warehouses/
2•yanivleven•6m ago•1 comments

I built justRead because existing reading apps ignore what readers want

https://justread.app/en/blog_post_development_of_justread_part_one
2•jahaman•14m ago•2 comments

Netflix: Open Content

https://opencontent.netflix.com/
27•tosh•15m ago•0 comments

Show HN: LLMRouter – Stop using GPT-4/o1 for everything (16 routing strategies)

https://github.com/ulab-uiuc/LLMRouter
2•tao2024•16m ago•1 comments

Researchers make "neuromorphic" artificial skin for robots

https://arstechnica.com/science/2025/12/researchers-make-neuromorphic-artificial-skin-for-robots/
3•smurda•17m ago•0 comments

The Coordination Tax

https://codegood.co/writing/the-coordination-tax
2•iolloyd•20m ago•1 comments

Now we are done with these AI data centers

https://www.wired.com/story/expired-tired-wired-data-centers/
3•octave12•20m ago•1 comments

Marks and Spencer launches 'nutrient dense' range for people on weight-loss jabs

https://www.theguardian.com/business/2025/dec/30/marks-spencer-nutrient-dense-range-weight-loss-jabs
2•chrisjj•29m ago•1 comments

Show HN: Tetris Time

https://tetris-time.koenvangilst.nl/?mode=countdown&to=2026-01-01T00:00:00.000Z&speed=3
3•vnglst•29m ago•0 comments

HSBC blocks its app due to F-Droid-installed Bitwarden

https://mastodon.neilzone.co.uk/@neil/115807834298031971
59•_____k•30m ago•29 comments

Harvard Youth Poll (51st Edition – Fall 2025)

https://iop.harvard.edu/youth-poll/51st-edition-fall-2025
4•futurecat•30m ago•0 comments

An attempt on defining the nature of consciousness [pdf]

https://philpapers.org/go.pl?id=HUGOTN&proxyId=&u=https%3A%2F%2Fphilpapers.org%2Farchive%2FHUGOTN...
2•Trenthug•32m ago•1 comments

ffmpeg.wasm

https://ffmpegwasm.netlify.app/
2•saikatsg•32m ago•0 comments

Meta Buys Manus

https://manus.im/updates
3•nrsapt•33m ago•0 comments

Louis Gerstner, former IBM CEO who revitalized 'Big Blue,' dies at 83

https://www.reuters.com/business/louis-gerstner-former-ibm-ceo-who-revitalized-big-blue-dies-83-2...
4•bhouston•34m ago•0 comments

China mandates 50% domestic equipment rule for chipmakers sources say

https://www.reuters.com/world/china/china-mandates-50-domestic-equipment-rule-chipmakers-sources-...
2•_____k•34m ago•0 comments

Japanese Black Coffee Woodcut

https://wildclothing.store/limited-edition-copy-329?productId=&color=GILDAN-BLACK
2•keepamovin•35m ago•0 comments

AB-AV1-GUI: A Simple Python GUI for AB-AV1 Conversion of Video Files to AV1

https://github.com/Loufe/AB-AV1-GUI
1•tosh•37m ago•0 comments

Toit: Program your microcontrollers in a fast and robust high-level language

https://github.com/toitlang/toit
3•tosh•44m ago•0 comments

More on Shuffles

https://shreevatsa.net/post/more-on-shuffles/
1•fanf2•45m ago•0 comments

Show HN: FlowStateOS Companion – experiment, modeling human life forces with GPT

https://chatgpt.com/g/g-69291c93fdc481918b3f13e60d5dcf2e-flowstateos-companion
1•Yusufshunan•45m ago•0 comments

Show HN: Lazy-image – Node.js image library with static binaries (Rust/NAPI)

https://github.com/albert-einshutoin/lazy-image
1•einshutoin•46m ago•0 comments

Undisciplined? Entitled? Lazy? Gen Z faces familiar flood of workplace criticism

https://www.theguardian.com/money/ng-interactive/2025/nov/17/gen-z-workplace-criticism
2•abdelhousni•50m ago•0 comments

Show HN: DT Gatekeeper – a GPT that maps decision closures before they lock in

https://chatgpt.com/g/g-69538d028b148191850423a69f88bd63-dt-gatekeeper
1•Yusufshunan•57m ago•0 comments

Show HN: Reko – Local-First YouTube-to-Markdown LLM Summarizer

https://github.com/riccardoruspoli/reko
1•riccardoruspoli•57m ago•0 comments

Looking for Feedback: Tips to Improve My Website's Design and User Experience

https://idealremit.com
1•bk-mira•57m ago•2 comments