frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Turn any website into a live, structured data feed

https://www.meter.sh/
15•chadwebscraper•3h ago

Comments

chadwebscraper•3h ago
Here’s how it works:

1. Paste a URL in, describe what you want

2. Define an interval to monitor

3. Get real time webhooks of any changes in JSON

Lots of customers are using this across different domains to get consistent, repeatable JSON out of sites and monitor changes.

Supports API + HTML extraction, never write a scraper again!

codingdave•1h ago
Writing a scraper isn't the hard part, that is actually fairly trivial at this point in time. Pulling content into JSON from your scrape is also fairly trivial - libraries exist that handle it well.

The harder parts are things like playing nicely so your bot doesn't get banned by sysadmins, detecting changes downstream from your URL, handling dynamically loading content, and keeping that JSON structure consistent even as your sites change their content, their designs, etc. Also, scalability. One customer I'm talking to could use a product like this, but they have 100K URLs to track, and that is more than I currently want to deal with.

I absolutely can see the use case for consistent change data from a URL, I'm just not seeing enough content in your marketing to know whether you really have something here, or if you vibe coded a scraper and are throwing it against the wall to see if it sticks.

chadwebscraper•1h ago
I appreciate the response! I also agree - happy to add some clarity to this stuff.

Bot protection - this is handled in a few ways, the basic form bypasses most bot protections and that’s what you can use on the site today. For tougher sites, it solves the bot protections (think datadome, Akamai, incapsula).

The consistency part is ongoing, but it’s possible to check the diffs and content extractions and notice if something has changed and “reindex” the site.

100k URLs is a lot! It could support that, but the initial indexing would be heavy. It’s fairly resource efficient (no browsers). For scale, it’s doing about 40k/scrapes a day right now.

Appreciate the comments, happy to dive deeper into the implantation and I agree with everything you’ve said. Still iterating and trying to improve it.

tmaly•54m ago
this must wreck their google analytics stats
chadwebscraper•53m ago
lol it probably does unless their filtering is great
arm32•1h ago
Residential proxies are sketchy at best. How can you guarantee that your service's infrastructure isn't hinging on an illicit botnet?
chadwebscraper•56m ago
This is a good callout - I’ve tried my best thus far to limit the use of proxies unless absolutely necessary and then focus on reputable providers (even though these are a bit more pricey).

Definitely going to give this more thought though, thank you for the comment

dewey•3m ago
There's a lot of variety in the residential proxy market. Some are sourced from bandwidth sharing SDKs for games with user consent, some are "mislabeled" IPs from ISPs that offer that as a product and then there's a long tail of "hacked" devices. Labeling them generally as sketchy seems wrong.
arjunchint•51m ago
So what happens when the website layout updates, does the monitoring job fail silently?

Voxtral Transcribe 2

https://mistral.ai/news/voxtral-transcribe-2
592•meetpateltech•7h ago•150 comments

Claude Code: connect to a local model when your quota runs out

https://boxc.net/blog/2026/claude-code-connecting-to-local-models-when-your-quota-runs-out/
100•fugu2•3d ago•37 comments

Claude Code for Infrastructure

https://www.fluid.sh/
80•aspectrr•4h ago•74 comments

Building a 24-bit arcade CRT display adapter from scratch

https://www.scd31.com/posts/building-an-arcade-display-adapter
89•evakhoury•5h ago•23 comments

A real-world benchmark for AI code review

https://www.qodo.ai/blog/how-we-built-a-real-world-benchmark-for-ai-code-review/
18•benocodes•1h ago•5 comments

Spotlighting the World Factbook as We Bid a Fond Farewell

https://www.cia.gov/stories/story/spotlighting-the-world-factbook-as-we-bid-a-fond-farewell/
26•mxfh•1h ago•9 comments

The Singularity Is Always Near (2006)

https://kk.org/thetechnium/the-singularity/
25•rmason•1d ago•20 comments

AI is killing B2B SaaS

https://nmn.gl/blog/ai-killing-b2b-saas
139•namanyayg•5h ago•228 comments

Remarkable Pro Colors

https://www.thregr.org/wavexx/rnd/20260201-remarkable_pro_colors/
14•ffaser5gxlsll•3d ago•3 comments

Tractor

https://incoherency.co.uk/blog/stories/tractor.html
114•surprisetalk•1d ago•36 comments

Attention at Constant Cost per Token via Symmetry-Aware Taylor Approximation

https://arxiv.org/abs/2602.00294
136•fheinsen•8h ago•70 comments

Show HN: Morph – Videos of AI testing your PR, embedded in GitHub

https://morphllm.com/products/glance
5•bhaktatejas922•1h ago•1 comments

A sane but bull case on Clawdbot / OpenClaw

https://brandon.wang/2026/clawdbot
224•brdd•1d ago•358 comments

RS-SDK: Drive RuneScape with Claude Code

https://github.com/MaxBittker/rs-sdk
75•evakhoury•5h ago•28 comments

Microsoft's Copilot chatbot is running into problems

https://www.wsj.com/tech/ai/microsofts-pivotal-ai-product-is-running-into-big-problems-ce235b28
56•fortran77•6h ago•62 comments

Arcan-A12: Weaving a Different Web

https://www.divergent-desktop.org/blog/2026/01/26/a12web/
38•ingenieroariel•6h ago•13 comments

Converge (YC S23) Is Hiring Product Engineers (NYC, In-Person)

https://www.runconverge.com/careers/product-engineer
1•thomashlvt•5h ago

Tell HN: Another round of Zendesk email spam

38•Philpax•3h ago•13 comments

Turn any website into a live, structured data feed

https://www.meter.sh/
15•chadwebscraper•3h ago•9 comments

The Codex app illustrates the shift left of IDEs and coding GUIs

https://www.benshoemaker.us/writing/codex-app-launch/
37•straydusk•2h ago•61 comments

Coding Agent VMs on NixOS with Microvm.nix

https://michael.stapelberg.ch/posts/2026-02-01-coding-agent-microvm-nix/
73•secure•3d ago•35 comments

Writing an optimizing tensor compiler from scratch

https://michaelmoroz.github.io/WritingAnOptimizingTensorCompilerFromScratch/
3•t-3•4d ago•0 comments

Technocracy 2.0

https://brooklynrail.org/2026/02/field-notes/technocracy-2-0/
46•antonomon•2h ago•24 comments

Show HN: Interactive California Budget (By Claude Code)

https://california-budget.com
19•sberens•2h ago•9 comments

Data Poems

https://dr.eamer.dev/datavis/poems/
6•putzdown•3d ago•0 comments

Study: emotional support from social media found to reduce anxiety

https://news.uark.edu/articles/80669/emotional-support-from-social-media-found-to-reduce-anxiety
62•giuliomagnifico•5h ago•67 comments

Claude is a space to think

https://www.anthropic.com/news/claude-is-a-space-to-think
299•meetpateltech•10h ago•155 comments

Show HN: Ghidra MCP Server – 110 tools for AI-assisted reverse engineering

https://github.com/bethington/ghidra-mcp
257•xerzes•15h ago•63 comments

No More Hidden Changes: How MySQL 9.6 Transforms Foreign Key Management

https://blogs.oracle.com/mysql/no-more-hidden-changes-how-mysql-9-6-transforms-foreign-key-manage...
17•ksec•4d ago•6 comments

A case study in PDF forensics: The Epstein PDFs

https://pdfa.org/a-case-study-in-pdf-forensics-the-epstein-pdfs/
217•DuffJohnson•7h ago•119 comments