frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models [pdf]

https://www.arxiv.org/pdf/2508.06471
88•SerCe•2h ago•10 comments

Wikipedia loses challenge against Online Safety Act

https://www.bbc.com/news/articles/cjr11qqvvwlo
649•phlummox•11h ago•498 comments

I tried every todo app and ended up with a .txt file

https://www.al3rez.com/todo-txt-journey
859•al3rez•13h ago•538 comments

Show HN: I built an offline, open‑source desktop Pixel Art Editor in Python

https://github.com/danterolle/tilf
69•danterolle•5h ago•9 comments

GitHub is no longer independent at Microsoft after CEO resignation

https://www.theverge.com/news/757461/microsoft-github-thomas-dohmke-resignation-coreai-team-transition
1001•Handy-Man•12h ago•738 comments

FreeBSD Scheduling on Hybrid CPUs

https://wiki.freebsd.org/Scheduler/Hybrid
26•fntlnz•3d ago•7 comments

Claude Code is all you need

https://dwyer.co.za/static/claude-code-is-all-you-need.html
519•sixhobbits•13h ago•292 comments

Chris Simpkins, creator of Hack font, has died

https://typo.social/@Hilary/114845913381245488
17•laqq3•57m ago•1 comments

Show HN: Play Pokémon to unlock your Wayland session

https://github.com/AdoPi/wlgblock
78•anajimi•1d ago•34 comments

OpenSSH Post-Quantum Cryptography

https://www.openssh.com/pq.html
363•throw0101d•15h ago•96 comments

Neki – sharded Postgres by the team behind Vitess

https://planetscale.com/blog/announcing-neki
162•thdxr•9h ago•22 comments

Launch HN: Halluminate (YC S25) – Simulating the internet to train computer use

52•wujerry2000•12h ago•38 comments

How to teach your kids to play poker: Start with one card

https://www.bloomberg.com/news/articles/2025-08-08/how-to-teach-your-kids-poker-with-one-card-at-age-four
47•ioblomov•3d ago•62 comments

Japan's largest paper, Yomiuri Shimbun, sues Perplexity for copyright violations

https://www.niemanlab.org/2025/08/japans-largest-newspaper-yomiuri-shimbun-sues-perplexity-for-copyright-violations/
70•aspenmayer•3h ago•18 comments

What does it mean to be thirsty?

https://www.quantamagazine.org/what-does-it-mean-to-be-thirsty-20250811/
22•pseudolus•4h ago•5 comments

Why tail-recursive functions are loops

https://kmicinski.com/functional-programming/2025/08/01/loops/
72•speckx•3d ago•86 comments

Ollama and gguf

https://github.com/ollama/ollama/issues/11714
102•indigodaddy•9h ago•45 comments

Supreme Court formally asked to overturn same-sex marriage ruling

https://abcnews.go.com/Politics/supreme-court-formally-asked-overturn-landmark-same-sex/story?id=124465302
41•1659447091•50m ago•2 comments

The value of institutional memory

https://timharford.com/2025/05/the-value-of-institutional-memory/
119•leoc•10h ago•74 comments

U.S. preparing IPO for Fannie Mae and Freddie Mac later this year

https://www.wsj.com/finance/regulation/trump-aiming-to-ipo-fannie-mae-and-freddie-mac-later-this-year-13b138cf
23•JumpCrisscross•3d ago•3 comments

The History of Windows XP

https://www.abortretry.fail/p/the-history-of-windows-xp
35•achairapart•1d ago•17 comments

36B solar mass black hole at centre of the Cosmic Horseshoe gravitational lens

https://academic.oup.com/mnras/article/541/4/2853/8213862?login=false
125•bookofjoe•13h ago•87 comments

The Joy of Mixing Custom Elements, Web Components, and Markdown

https://deanebarker.net/tech/blog/custom-elements-markdown/
88•deanebarker•11h ago•32 comments

Show HN: Keeps – Mail a postcard that plays your voice

https://www.sendkeeps.com/
9•dinnison•4h ago•10 comments

Byte Buddy is a code generation and manipulation library for Java

https://bytebuddy.net/
74•mooreds•3d ago•24 comments

Trellis (YC W24) Is Hiring: Automate Prior Auth in Healthcare

https://www.ycombinator.com/companies/trellis/jobs/Cv3ZwXh-forward-deployed-engineers-all-levels-august-2025
1•jackylin•10h ago

AOL to discontinue dial-up internet

https://www.nytimes.com/2025/08/11/business/aol-dial-up-internet.html
153•situationista•20h ago•177 comments

How Boom uses software to accelerate hardware development

https://bscholl.substack.com/p/move-fast-and-dont-break-safety-critical
75•flabber•1d ago•59 comments

Optimizing my sleep around Claude usage limits

https://mattwie.se/no-sleep-till-agi
162•mattwiese•1d ago•113 comments

Pricing Pages – A Curated Gallery of Pricing Page Designs

https://pricingpages.design/
197•finniansturdy•15h ago•59 comments
Open in hackernews

Japan's largest paper, Yomiuri Shimbun, sues Perplexity for copyright violations

https://www.niemanlab.org/2025/08/japans-largest-newspaper-yomiuri-shimbun-sues-perplexity-for-copyright-violations/
69•aspenmayer•3h ago

Comments

aspenmayer•3h ago
Original title edited for length:

> Japan’s largest newspaper, Yomiuri Shimbun, sues AI startup Perplexity for copyright violations

ujkhsjkdhf234•2h ago
Before someone mentions Japan effectively making all data fair use for AI training, Japan specifically forbids direct recreation which is what this lawsuit is about.
ants_everywhere•2h ago
If they are copying and pasting news articles on their site, that's a pretty straightforward copyright case I would think.

In the US at least this should be pretty well covered by the case law on news aggregators.

ronsor•2h ago
I don't know why Perplexity in particular gets everyone in a nit. It's not even particularly special: a user inputs a query, an AI model does a web search and fetches some pages on the user's behalf, and then it serves the result to the user.

Putting aside that other products, such as OpenAI's ChatGPT and modern Google Search have the same "AI-powered web search" functionality, I can't see how this is meaningfully different from a user doing a web search and pasting a bunch of webpages into an LLM chat box.

> But what about ad revenue?

The user could be using an ad blocker. If they're using Perplexity at all, they probably already are. There's no requirement for a user agent to render ads.

> But robots.txt!!!11

`robots.txt` is for recursive, fully automated requests. If a request is made on behalf of a user, through direct user interaction, then it may not be followed and IMO shouldn't be followed. If you really want to block a user agent, it's up to you to figure out how to serve a 403.

> It's breaking copyright by reproducing my content!

Yes, so does the user's browser. The purpose of a user agent is to fetch and display content how the user wants. The manner in which that is done is irrelevant.

jaredwiener•1h ago
There's a difference between what is technically feasible and what is allowed, legally or even morally.

Just because it is possible -- or even easy -- to essentially steal from newspapers/other media outlets, doesn't make it right, or legal. The people behind it put in labor, financial resources, and time to create a product that, like almost every other service, has terms attached -- and those usually come with some form of monetization. Maybe it is a paywall, maybe it is advertisements -- but it is there.

Using an adblocker, or finding some loophole around a paywall, etc, are all very easy to do technically, as any reader of this site knows. That said, the media outlet doesn't have to allow it. And when it is violated on an industrial scale, like Perplexity, then they can be understandably upset and take legal action. And that includes any AI (or other technology, for that matter) that is a wrapper around plagiarism.

Sites opted in to Google originally because it fed them traffic. They most likely did not opt in to an AI rewriter that takes their work and republishes it without any compensation.

Alex4386•41m ago
Well, some bots even spoof User-Agents, requesting tons of requests without proper rate-limiting (looking at you, ByteSpider)

No fair plays done by people, even before the LLMs, so we get the PoW challenge on everywhere.

And what is that conclusion? since Adblockers are used by anywhere, it is OK to corporates not to license them directly and just yank them and put it into curation service? especially without ads? that's a licensing issue. the author allowed you to view the article if you provide them monetary support (i.e. ads), they didn't allow you to reproduce and republish the work by default.

also calling browser itself as reproducing? Yes, the data might be copied in memory (but I wouldn't call it as reproducing material, more like transfer from the server to another), but redistribution is the main point here.

It's like saying well, "the part of the variable is replicated to register from the L2 cache, so whole file on DRAM can be authorized to reproduce", Your point of calling "it's reproducing and should not be reproduced in first place" can't be prevented unless you bring non-turing computers that doesn't use active memory.

kazinator•4m ago
[delayed]
daedrdev•2h ago
Japan has extremely favorable copyright laws to the holders. My understanding is that without explicit permission, there is no fair use and so any reproduction or modified work is only allowed as long as they don't request a takedown.
beepbooptheory•1h ago
From tfa:

> Japan’s copyright law allows AI developers to train models on copyrighted material without permission. This leeway is a direct result of a 2018 amendment to Japan’s Copyright Act, meant to encourage AI development in the country’s tech sector. The law does not, however, allow for wholesale reproduction of those works, or for AI developers to distribute copies in a way that will “unreasonably prejudice the interests of the copyright owner.”

Alex4386•44m ago
tl;dr: If you are not directly affecting the "sales" of the product, you are good to go. But It seems perplexity did, and (as they might call it) directly trying to compete as a news source

Personally, About their news service, Their news summarization is kinda misleading with AI hallucination in some places.

kazinator•9m ago
[delayed]
totetsu•2h ago
The Japan Newspaper Publishers & Editors Association is very active lobbying about this area https://www.pressnet.or.jp/english/
charcircuit•2h ago
It's best not to crawl Japanese newspapers. Japan does not have the same kind of fair use. Even reproducing facts from a newspaper can be infringing.
SilverElfin•1h ago
I don’t understand why corporations can violate copyright laws at hyper scale but individuals are banned from small scale piracy through authoritarian internet governance.
mlinhares•39m ago
The law only exists for those without enough money and influence to control the enforcers.
ranyume•25m ago
It's because people are allowing corporations, the elite, governments, to do as they please. In this hedonistic, shallow, era nobody wants to sacrifice themselves for a cause. Except some rare cases like luigi.
mattigames•22m ago
I wish there was a open fund anyone could donate with the exclusive aim of suing Perplexity, OpenAI and others for copyright violations, where a team of lawyers would help the cases with the most likelihood to win, that would try to highlight that the way such systems are "learning" have little similitude to the intent of the law when it was written to give layaway for other artists/authors to create similar creations.
miohtama•19m ago
I wish there would be an open fund that allows me to do opposite and the fund would countersue copyright holders for holding development back and demanding excessive mafia payments
CamperBob2•4m ago
Amazing how many copyright maximalists there are on a site called "Hacker News."

Seems to be a fairly recent trend. Wonder what changed.

jongjong•14m ago
IMO the legal system is in disarray due to extreme asymmetries in how the law is selectively applied.

First of all, the way certain platforms get sued for certain activities while others are left alone is unfair and creates significant market distortions.

Then there is the fact that wealthy individuals have much better legal representation than non-wealthy individuals.

Then there are tax loopholes which create market asymmetries above that.