frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

DeepSeek-v3.1 Release

https://api-docs.deepseek.com/news/news250821
116•wertyk•1h ago

Comments

hodgehog11•48m ago
For reference, here is the terminal-bench leaderboard:

https://www.tbench.ai/leaderboard

Looks like it doesn't get close to GPT-5, Claude 4, or GLM-4.5, but still does reasonably well compared to other open weight models. Benchmarks are rarely the full story though, so time will tell how good it is in practice.

seunosewa•41m ago
The DeepSeek R1 in that list is the old model that's been replaced.
yorwba•16m ago
Yes, and 31.3% is given in the announcement as the performance of the new v3.1, which would put it in sixteenth place.
seunosewa•3m ago
So it's not that great for shell terminals.
coliveira•22m ago
My personal experience is that it produces high quality results.
amrrs•17m ago
Any example or prompt you use to make this statment?
YetAnotherNick•13m ago
Depends on the agent. Rank 5 and 15 are claude 4 sonnet, and this stands close to 15th.
seunosewa•44m ago
It's a hybrid reasoning model. It's good with tool calls and doesn't think too much about everything, but it regularly uses outdated tool formats randomly instead of the standard JSON format. I guess the V3 training set has a lot of those.
esafak•38m ago
It seems behind Qwen3 235B 2507 Reasoning (which I like) and gpt-oss-120B: https://artificialanalysis.ai/models/deepseek-v3-1-reasoning

Pricing: https://openrouter.ai/deepseek/deepseek-chat-v3.1

bigyabai•31m ago
Those Qwen3 2507 models are the local creme-de-la-creme right now. If you've got any sort of GPU and ~32gb of RAM to play with, the A3B one is great for pair-programming tasks.
pdimitar•17m ago
Do you happen to know if it can be run via an eGPU enclosure with f.ex. RTX 5090 inside, under Linux?

I'm considering buying a Linux workstation lately and I want it full AMD. But if I can just plug an NVIDIA card via an eGPU card for self-hosting LLMs then that would be amazing.

gunalx•2m ago
You would still need drivers and all the stuff difficult with nvidia in linux with a egpu. (Its not nessecarily terrible just suboptimal) Rather just add the second GPU in the Workstation, or just run the llm in your AMD GPU.

2030: Privacy's Dead. What happens next? [video]

https://www.youtube.com/watch?v=_kBlH-DQsEg
1•TheFreim•4m ago•0 comments

Ask HN: Should the police be banned from saying whether a shooting is targeted?

1•amichail•4m ago•0 comments

xbyak: A JIT assembler for x86/x64 architectures

https://github.com/herumi/xbyak
1•d_tr•5m ago•0 comments

Exotic Programming Ideas

https://www.stephendiehl.com/posts/exotic_01/
2•ibobev•5m ago•0 comments

AI Isn't Coming for Hollywood. It's Already Arrived

https://www.wired.com/story/artificial-intelligence-hollywood-stability/
2•TMWNN•5m ago•0 comments

Ask HN: Do you think programming as a job will end soon and if so, how soon?

1•akkad33•6m ago•0 comments

Rolling the Dice with CSS Random()

https://webkit.org/blog/17285/rolling-the-dice-with-css-random/
1•zdw•6m ago•0 comments

Fundamental Research Labs

https://fundamentalresearchlabs.com/
1•fzliu•9m ago•0 comments

The Stochastic Code Monkey Theorem

https://www.stephendiehl.com/posts/ai_for_coding/
2•ibobev•9m ago•0 comments

Typechecker Zoo

https://www.stephendiehl.com/posts/typechecker_zoo/
2•ibobev•10m ago•0 comments

Kanye Quest 3030 and the Ascensionist Cult

https://en.wikipedia.org/wiki/Kanye_Quest_3030
1•sans_souse•10m ago•0 comments

A Novel Idea: Read More Fiction

https://www.wsj.com/opinion/heres-a-novel-idea-read-more-fiction-9bdb783d
2•Bostonian•12m ago•1 comments

Which Is Better: Build 100 Startups or 100x One Startup?

https://www.subbuddy.io/
1•Alecocluc•12m ago•1 comments

Reproducing prospect theory with 'differentiable decision theories'

https://www.science.org/doi/full/10.1126/science.abe2629
1•timshell•13m ago•0 comments

Ask HN: What's the best resource that helped you improve your focus?

1•Shane325•14m ago•0 comments

The Epochalypse: It's Y2K, but 38 Years Later

https://hackaday.com/2025/07/22/the-epochalypse-y2k-but-38-years-later/
1•austinallegro•15m ago•0 comments

New Schizophrenia Genes Discovered

https://www.cardiff.ac.uk/news/view/2930618-scientists-discover-eight-new-schizophrenia-genes
2•gmays•15m ago•0 comments

Tim Cook Reveals Apple's Vision for Movies and TV

https://variety.com/2025/film/news/f1-apple-movie-strategy-tim-cook-lewis-hamilton-1236424270/
1•mgh2•15m ago•0 comments

Using Derive_more for Errors in Rust

https://quamserena.com/2025-08-02/using-derive-more-for-errors-in-rust
1•quamserena•16m ago•0 comments

Change the default temp file prefix to be "SQLite" spelled backwards (2006)

https://www.sqlite.org/src/info/a19ce5c1c4e86d14
1•susam•16m ago•0 comments

AI giants race to secure user data

https://restofworld.org/2025/ai-data-collection-global-deals/
1•colinprince•16m ago•0 comments

Satoshi Nakamoto

https://en.wikipedia.org/wiki/Satoshi_Nakamoto
1•sans_souse•17m ago•0 comments

How AI Is Driving Us Back to Human Connection at Work

https://people-work.io/blog/how-ai-driving-human-connection-work/
2•mooreds•18m ago•0 comments

A neuroscientist's guide to banishing stress, self-doubt and loneliness

https://www.telegraph.co.uk/health-fitness/wellbeing/mental-health/neuroscientist-happy-brain/
1•domofutu•19m ago•0 comments

Rust vs. C++ with Steve Klabnik and Herb Sutter [video]

https://www.youtube.com/watch?v=XdHBSxDsVlM
3•nicoburns•20m ago•0 comments

How to create an engineering team that hits their deadlines

https://blog.yasuflores.me/posts/hitting-deadlines/
1•carlosyasu91•20m ago•1 comments

Uv format: Code Formatting Comes to uv (experimentally)

https://pydevtools.com/blog/uv-format-code-formatting-comes-to-uv-experimentally/
3•tanelpoder•23m ago•0 comments

Show HN: Convosphere – Real-life proximity chat

https://convosphere.app/
2•jothetaha•25m ago•1 comments

Closed Beta for Populus,Synthetic Population Modeling

https://epistemix.com/populus
1•AmruthaK•25m ago•1 comments

Internet-in-a-Box

https://mdwiki.org/wiki/WikiProjectMed:Internet-in-a-Box
1•colinprince•26m ago•0 comments