frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: MCP-baepsae – MCP server for iOS Simulator automation

https://github.com/oozoofrog/mcp-baepsae
1•oozoofrog•1m ago•0 comments

Make Trust Irrelevant: A Gamer's Take on Agentic AI Safety

https://github.com/Deso-PK/make-trust-irrelevant
2•DesoPK•5m ago•0 comments

Show HN: Sem – Semantic diffs and patches for Git

https://ataraxy-labs.github.io/sem/
1•rs545837•6m ago•1 comments

Hello world does not compile

https://github.com/anthropics/claudes-c-compiler/issues/1
1•mfiguiere•12m ago•0 comments

Show HN: ZigZag – A Bubble Tea-Inspired TUI Framework for Zig

https://github.com/meszmate/zigzag
2•meszmate•14m ago•0 comments

Metaphor+Metonymy: "To love that well which thou must leave ere long"(Sonnet73)

https://www.huckgutman.com/blog-1/shakespeare-sonnet-73
1•gsf_emergency_6•16m ago•0 comments

Show HN: Django N+1 Queries Checker

https://github.com/richardhapb/django-check
1•richardhapb•32m ago•1 comments

Emacs-tramp-RPC: High-performance TRAMP back end using JSON-RPC instead of shell

https://github.com/ArthurHeymans/emacs-tramp-rpc
1•todsacerdoti•36m ago•0 comments

Protocol Validation with Affine MPST in Rust

https://hibanaworks.dev
1•o8vm•41m ago•1 comments

Female Asian Elephant Calf Born at the Smithsonian National Zoo

https://www.si.edu/newsdesk/releases/female-asian-elephant-calf-born-smithsonians-national-zoo-an...
2•gmays•42m ago•0 comments

Show HN: Zest – A hands-on simulator for Staff+ system design scenarios

https://staff-engineering-simulator-880284904082.us-west1.run.app/
1•chanip0114•43m ago•1 comments

Show HN: DeSync – Decentralized Economic Realm with Blockchain-Based Governance

https://github.com/MelzLabs/DeSync
1•0xUnavailable•48m ago•0 comments

Automatic Programming Returns

https://cyber-omelette.com/posts/the-abstraction-rises.html
1•benrules2•51m ago•1 comments

Why Are There Still So Many Jobs? The History and Future of Workplace Automation [pdf]

https://economics.mit.edu/sites/default/files/inline-files/Why%20Are%20there%20Still%20So%20Many%...
2•oidar•53m ago•0 comments

The Search Engine Map

https://www.searchenginemap.com
1•cratermoon•1h ago•0 comments

Show HN: Souls.directory – SOUL.md templates for AI agent personalities

https://souls.directory
1•thedaviddias•1h ago•0 comments

Real-Time ETL for Enterprise-Grade Data Integration

https://tabsdata.com
1•teleforce•1h ago•0 comments

Economics Puzzle Leads to a New Understanding of a Fundamental Law of Physics

https://www.caltech.edu/about/news/economics-puzzle-leads-to-a-new-understanding-of-a-fundamental...
3•geox•1h ago•1 comments

Switzerland's Extraordinary Medieval Library

https://www.bbc.com/travel/article/20260202-inside-switzerlands-extraordinary-medieval-library
2•bookmtn•1h ago•0 comments

A new comet was just discovered. Will it be visible in broad daylight?

https://phys.org/news/2026-02-comet-visible-broad-daylight.html
4•bookmtn•1h ago•0 comments

ESR: Comes the news that Anthropic has vibecoded a C compiler

https://twitter.com/esrtweet/status/2019562859978539342
2•tjr•1h ago•0 comments

Frisco residents divided over H-1B visas, 'Indian takeover' at council meeting

https://www.dallasnews.com/news/politics/2026/02/04/frisco-residents-divided-over-h-1b-visas-indi...
4•alephnerd•1h ago•5 comments

If CNN Covered Star Wars

https://www.youtube.com/watch?v=vArJg_SU4Lc
1•keepamovin•1h ago•1 comments

Show HN: I built the first tool to configure VPSs without commands

https://the-ultimate-tool-for-configuring-vps.wiar8.com/
2•Wiar8•1h ago•3 comments

AI agents from 4 labs predicting the Super Bowl via prediction market

https://agoramarket.ai/
1•kevinswint•1h ago•1 comments

EU bans infinite scroll and autoplay in TikTok case

https://twitter.com/HennaVirkkunen/status/2019730270279356658
6•miohtama•1h ago•5 comments

Benchmarking how well LLMs can play FizzBuzz

https://huggingface.co/spaces/venkatasg/fizzbuzz-bench
1•_venkatasg•1h ago•1 comments

Why I Joined OpenAI

https://www.brendangregg.com/blog/2026-02-07/why-i-joined-openai.html
29•SerCe•1h ago•23 comments

Octave GTM MCP Server

https://docs.octavehq.com/mcp/overview
1•connor11528•1h ago•0 comments

Show HN: Portview what's on your ports (diagnostic-first, single binary, Linux)

https://github.com/Mapika/portview
3•Mapika•1h ago•0 comments
Open in hackernews

Defeating AI scraping by rethinking webpage rendering

2•exodys•2w ago
Consider the idea of someone creating a project that rendered out webpages in images and sent those over the web; updating them whenever an input is received, much like a video game input loop. If everything was server side rendered, how difficult would it be for scraping? The idea of an un-copyable webpage is enticing, assuming that you would not like your data scraped. I know computer vision is a thing, but the error rate may be enough?

Comments

bryanrasmussen•2w ago
I believe you would run into accessibility laws that would make your project extremely illegal.

on edit: actually probably not illegal, that would be the wrong word, extremely open to financially ruinous lawsuits would be the correct phrasing.

exodys•2w ago
I guess it would have to provide audio in order to be accessible, and if blind and deaf... well, do most people prepare applications for the blind and deaf?
bryanrasmussen•2w ago
in the EU or the US where there are laws requiring that things are accessibly you can be sued if you are not, the EU law is quite new, the US is not. Some of the things that go into a decision as to what the amount will be you are fined is how inaccessible you are, so an image would be not at all.

If you are rendering a single image of a page - does this page have interactive parts? How are you planning on people actually interacting with the image? If you have it solved for sighted people interacting with the graphical rendering of your application you also have to provide solutions for people who are not sighted, have mobility issues, combinations of the two...

If you have provided an image that is not at all accessible, how much is because you haven't any understanding of accessibility and how much is "screw the disabled" thinking, because this would also affect how much you get fined.

When you first get fined and the plaintiff against you gets awarded money because your page is inaccessible (this would be the US), it's not over. Because there are still disabled people who want to use your application and they can't, and they will sue too, and you get fined more and more because you got fined once and you kept up with your behavior. Sooner or later you might receive a running court order - make your stuff accessible or pay this fee until it is. This sooner or later would probably be what would happen in EU, you have until this date to make your site accessible, or you are getting fined a lot.

Since your solution is totally not architected for accessibility you will need to put a lot of work into it.

Finally providing audio may or may not be considered good enough, depending on a lot of things, but most disabled people use screen readers that interact with the DOM and the Accessibility Object Model (depending on version of the application, figure as big a variation between screen readers as between IE 6, 9, Safari 2 years ago, Newest Chrome, and Firefox) if you provide something that they cannot use with their preferred accessibility tool I would bet it wouldn't be considered good enough.

The fact is your idea is not going to work as a tool for others to use, because accessibility is a legal requirement for lots of people, you will probably be sued for a lot of money, if any company did use it they would probably get sued too, maybe they would sue you if they didn't like getting sued, and in the end the AI companies can probably figure out the graphics you generate good enough to scrape it anyway.

The most likely result for following this plan to build a product would be financial ruination, how great would really depend on how initially successful you were. The greater the initial success the worse off you would eventually be.

on edit: link to the accessibility object model I forgot to put in earlier https://wicg.github.io/aom/spec/ - it's not exactly something that they work with now, but it is under development so yes, just like specs when they are under development come out in early releases with specific browsers. At any rate, it is not designed to work with complete web pages rendered as an image.

bryanrasmussen•2w ago
Now, the second question: What do you think the performance is of a page rendered to one big image and downloaded to render.

Web pages can render in pieces, images not so much. At least not the way web pages can. What is the resolution of a web page - the resolution of a web page really depends on the browser and the OS, some web pages render really high definition because that is what their OS allows (Macs for example), some browsers have more color spaces available than just RGB - many nowadays, so if your site uses more advanced color spaces are you going to render to an RGB image, meaning that your customers get less popping designs with your solution than with the browser. Or are you going to render to the most advanced image resolution possible meaning the images are going to be even bigger and it will be even harder to download.

Are you going to render multiple resolutions to give the correct resolution to user agent, so that you can save on bandwidth - by having done more renders on the server and having your customer pay for more renders.

What is caching behavior here?

I believe performance of this solution would by necessity be sub-optimal. Nobody likes a sub-optimal performance on the web, because almost all of the web is entertainment development, and people won't accept poor performance on their entertainment.

https://medium.com/luminasticity/on-premature-optimization-i...

bryanrasmussen•2w ago
sorry I don't normally go around poking holes in people's hopeful business plans, but I have thought about the problem of keeping sites from being scraped before, and there are really two issues that everybody needs to work that go against a decent anti-scraping tool and that is

automated testing requirements make it difficult to keep things from scraped, because scraping is an automation process, and automated testing is obviously and automation process.

and the needs for accessibility. Which even if you didn't care about the moral requirements of being accessible the legal requirements force you to be.

And I have also given some thoughts to automated generation of images for a sort of graphing application so I am familiar with the performance issues as well, so your question of course hit all the points that I knew something about.

bryanrasmussen•2w ago
the real battle with anti-scraping is human heuristic identification, which to get around a scraper needs to make their automated process behave more and more like a human, which results in making the process less and less financially rewarding.