frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

DeepSeek-v3.1-Terminus

https://api-docs.deepseek.com/news/news250922
66•meetpateltech•3h ago

Comments

sbinnee•2h ago
> What’s improved? Language consistency: fewer CN/EN mix-ups & no more random chars.

It's good that they made this improvement. But is there any advantages at this point using DeepSeek over Qwen?

IgorPartola•2h ago
I wish there was some easy resource to keep up with the latest models. The best I have come up with so far is asking one model to research the others. Realistically I want to know latest versions, best use case, performance (in terms of speed) relative to some baseline, and hardware requirements to run it.
exe34•2h ago
> asking one model to research the others.

that's basically choosing are random with extra steps!

throwup238•1h ago
Research not spit out the answer based on weights. Just ask Gemini/Claude to do deep research on /r/LocalLLama and HN posts.
Jgoauh•1h ago
have you tried https://artificialanalysis.ai/
JimDugan•22m ago
Dumb collation of benchmarks that the big labs are essentially training on. Livebench.ai is the industry standard - non contaminated, new questions every few months.
IgorPartola•4m ago
Thanks! Are the scores in some way linear here? As in, if model A is rated at 25 and model B at 50, does that mean I will have half the mistakes with model B? Get answers that are 2x more accurate? Or is it subjective?
comrade1234•2h ago
MIT license that lets you run it on your own hardware and make money off of it.
coder543•1h ago
Qwen3 models (including their 235B and 480B models) use the Apache-2.0 license, so it’s not like that’s a big difference here.
coder543•1h ago
They seem fairly competitive with each other. You would have to benchmark them for your specific use case.
yu3zhou4•2h ago
I see no article in the link, just "news250922" header with some layout
meetpateltech•2h ago
It’s up again, check it.

Twitter/X post link: https://twitter.com/deepseek_ai/status/1970117808035074215

Also Hugging Face model link: https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Terminus

bratao•2h ago
The link is off. This link works https://api-docs.deepseek.com/updates#deepseek-v31-terminus
esafak•1h ago
Notable performance improvement in agentic tool use: https://huggingface.co/deepseek-ai/DeepSeek-V3.1-Terminus

The Deepseek provider may train on your prompts: https://openrouter.ai/deepseek/deepseek-v3.1-terminus

storus•1h ago
I tried V3.1 but it was driving me crazy by ignoring parts of user input, which R1 never did. I had many such instances when e.g. asking about running DeepSeek 671B it instead picked DeepSeek 67B because 671B is too large to exist so I must have made a mistake etc. I concluded that despite being better in benchmarks than R1, it was essentially useless due to this characteristics and I instead started using R1 at OpenRouter. Not sure why deepseek.com removed R1 and left only V3.1 without any ability to switch back, I guess it's cheaper to run.

Dear GitHub: no YAML anchors, please

https://blog.yossarian.net/2025/09/22/dear-github-no-yaml-anchors
87•woodruffw•1h ago•53 comments

Cloudflare: A New Internet Business Model

https://blog.cloudflare.com/cloudflare-2025-annual-founders-letter/
15•mmaia•22m ago•6 comments

A Simple Way to Measure Knots Has Come Unraveled

https://www.quantamagazine.org/a-simple-way-to-measure-knots-has-come-unraveled-20250922/
20•baruchel•47m ago•3 comments

Cloudflare is sponsoring Ladybird and Omarchy

https://blog.cloudflare.com/supporting-the-future-of-the-open-web/
159•jgrahamc•2h ago•93 comments

Easy Forth

https://skilldrick.github.io/easyforth/
112•pkilgore•3h ago•46 comments

CompileBench: Can AI Compile 22-year-old Code?

https://quesma.com/blog/introducing-compilebench/
73•jakozaur•2h ago•15 comments

PlanetScale announces PlanetScale for Postgres is GA

https://planetscale.com/blog/planetscale-for-postgres-is-generally-available
25•munns•25m ago•4 comments

What is algebraic about algebraic effects?

https://interjectedfuture.com/what-is-algebraic-about-algebraic-effects/
21•iamwil•1h ago•3 comments

Cap'n Web: a new RPC system for browsers and web servers

https://blog.cloudflare.com/capnweb-javascript-rpc-library/
42•jgrahamc•2h ago•6 comments

Kmart's use of facial recognition to tackle refund fraud unlawful

https://www.oaic.gov.au/news/media-centre/18-kmarts-use-of-facial-recognition-to-tackle-refund-fr...
171•Improvement•5h ago•128 comments

SGI demos from long ago in the browser via WASM

https://github.com/sgi-demos
162•yankcrime•7h ago•36 comments

How I, a beginner developer, read the tutorial you, a developer, wrote for me

https://anniemueller.com/posts/how-i-a-non-developer-read-the-tutorial-you-a-developer-wrote-for-...
651•wonger_•14h ago•313 comments

Beyond the Front Page: A Personal Guide to Hacker News

https://hsu.cy/2025/09/how-to-read-hn/
75•firexcy•5h ago•33 comments

Anti-*: The Things We Do but Not All the Way

https://blog.jim-nielsen.com/2025/my-antis/
4•gregwolanski•32m ago•0 comments

A Beautiful Maths Game

https://sinerider.com/
49•waonderer•2d ago•15 comments

What if we treated Postgres like SQLite?

https://www.maragu.dev/blog/what-if-we-treated-postgres-like-sqlite
11•markusw•2h ago•4 comments

You did this with an AI and you do not understand what you're doing here

https://hackerone.com/reports/3340109
735•redbell•7h ago•352 comments

M4.6 Earthquake – 2 km ESE of Berkeley, CA

https://earthquake.usgs.gov/earthquakes/eventpage/ew1758534970/executive
132•brian-armstrong•5h ago•75 comments

Biconnected components

https://emi-h.com/articles/bcc.html
33•emih•16h ago•7 comments

Privacy and Security Risks in the eSIM Ecosystem [pdf]

https://www.usenix.org/system/files/usenixsecurity25-motallebighomi.pdf
215•walterbell•11h ago•114 comments

Show HN: Software Freelancers Contract Template

https://sopimusgeneraattori.ohjelmistofriikit.fi/?lang=en
100•baobabKoodaa•8h ago•38 comments

The Counterclockwise Experiment

https://domofutu.substack.com/p/the-counterclockwise-experiment
49•domofutu•2d ago•16 comments

DeepSeek-v3.1-Terminus

https://api-docs.deepseek.com/news/news250922
66•meetpateltech•3h ago•15 comments

Why Local-First Apps Haven't Become Popular?

https://marcobambini.substack.com/p/why-local-first-apps-havent-become
107•marcobambini•2h ago•136 comments

The death rays that guard life

https://worksinprogress.co/issue/the-death-rays-that-guard-life/
33•ortegaygasset•4d ago•18 comments

We Politely Insist: Your LLM Must Learn the Persian Art of Taarof

https://arxiv.org/abs/2509.01035
119•chosenbeard•15h ago•69 comments

Why is Venus hell and Earth an Eden?

https://www.quantamagazine.org/why-is-venus-hell-and-earth-an-eden-20250915/
168•pseudolus•16h ago•282 comments

What if AMD FX had "real" cores? [video]

https://www.youtube.com/watch?v=Lb4FDtAwnqU
21•zdw•3d ago•16 comments

How can I influence others without manipulating them?

https://andiroberts.com/leadership-questions/how-to-influence-others-without-manipulating
185•kiyanwang•17h ago•181 comments

Simulating a Machine from the 80s

https://rmazur.io/blog/fahivets.html
64•roman-mazur•3d ago•10 comments