I built web tool for accessing just the (usually text) content of sites. It handles news sites best, thanks to Mozilla's fantastic Readability library. It also supports sites that use client-side rendering, using Puppeteer. Stripped pages are cached in Redis.
My main use-case for this tool is for extreme bandwidth-constrained networks, eg Meshtastic (with an internet gateway).
Comments
japaco•8m ago
Nice. Have you considered if you took a step further to take the stripped down page content and format it for delivery via AI audio reader? Might be an interesting twist.
japaco•8m ago