frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

LaTeX Coffee Stains [pdf]

https://ctan.math.illinois.edu/graphics/pgf/contrib/coffeestains/coffeestains-en.pdf
52•zahrevsky•55m ago•13 comments

Sugar industry influenced researchers and blamed fat for CVD

https://www.ucsf.edu/news/2016/09/404081/sugar-papers-reveal-industry-role-shifting-national-hear...
47•aldarion•1h ago•6 comments

“Stop Designing Languages. Write Libraries Instead” (2016)

https://lbstanza.org/purpose_of_programming_languages.html
147•teleforce•3h ago•95 comments

A4 Paper Stories

https://susam.net/a4-paper-stories.html
108•blenderob•2h ago•53 comments

Meditation as Wakeful Relaxation: Unclenching Smooth Muscle

https://psychotechnology.substack.com/p/meditation-as-wakeful-relaxation
19•surprisetalk•38m ago•1 comments

US Job Openings Decline to Lowest Level in More Than a Year

https://www.bloomberg.com/news/articles/2026-01-07/us-job-openings-decline-to-lowest-level-in-mor...
19•toomuchtodo•8m ago•1 comments

Shipmap.org

https://www.shipmap.org/
4•surprisetalk•37m ago•0 comments

The Eric and Wendy Schmidt Observatory System

https://www.schmidtsciences.org/schmidt-observatory-system/
46•pppone•3h ago•32 comments

Show HN: KeelTest – AI-driven VS Code unit test generator with bug discovery

https://keelcode.dev/keeltest
16•bulba4aur•2h ago•4 comments

Formal methods only solve half my problems

https://brooker.co.za/blog/2022/06/02/formal.html
52•signa11•4d ago•18 comments

Sergey Brin's Unretirement

https://www.inc.com/jessica-stillman/google-co-founder-sergey-brins-unretirement-is-a-lesson-for-...
282•iancmceachern•6d ago•348 comments

We might have been slower to abandon StackOverflow if it wasn't a toxic hellhole

https://www.pcloadletter.dev/blog/abandoning-stackoverflow/
10•ronbenton•14m ago•3 comments

Optery (YC W22) Hiring a CISO and Web Scraping Engineers (Node) (US and Latam)

https://www.optery.com/careers/
1•beyondd•3h ago

Vector graphics on GPU

https://gasiulis.name/vector-graphics-on-gpu/
115•gsf_emergency_6•4d ago•21 comments

Stop Doom Scrolling, Start Doom Coding: Build via the terminal from your phone

https://github.com/rberg27/doom-coding
514•rbergamini27•20h ago•356 comments

Opus 4.5 is not the normal AI agent experience that I have had thus far

https://burkeholland.github.io/posts/opus-4-5-change-everything/
696•tbassetto•21h ago•1007 comments

Show HN: Comet MCP – Give Claude Code a browser that can click

https://github.com/hanzili/comet-mcp
14•hanzili•3d ago•11 comments

Electronic nose for indoor mold detection and identification

https://advanced.onlinelibrary.wiley.com/doi/10.1002/adsr.202500124
162•PaulHoule•15h ago•91 comments

Quake Brutalist Jam III

https://www.slipseer.com/index.php?resources/quake-brutalist-jam-iii.549/
44•Venn1•2d ago•4 comments

Show HN: RepoReaper – AST-aware, JIT-loading code audit agent (Python/AsyncIO)

https://github.com/tzzp1224/RepoReaper
4•realdexter•1h ago•0 comments

A 30B Qwen model walks into a Raspberry Pi and runs in real time

https://byteshape.com/blogs/Qwen3-30B-A3B-Instruct-2507/
298•dataminer•18h ago•103 comments

Show HN: SMTP Tunnel – A SOCKS5 proxy disguised as email traffic to bypass DPI

https://github.com/x011/smtp-tunnel-proxy
106•lobito25•15h ago•35 comments

The creator of Claude Code's Claude setup

https://twitter.com/bcherny/status/2007179832300581177
493•KothuRoti•4d ago•330 comments

Vietnam bans unskippable ads

https://saigoneer.com/vietnam-news/28652-vienam-bans-unskippable-ads,-requires-skip-button-to-app...
1487•hoherd•22h ago•760 comments

The first new compass since 1936

https://www.youtube.com/watch?v=eiDhbZ8-BZI
57•1970-01-01•5d ago•38 comments

On the slow death of scaling

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5877662
99•sethbannon•11h ago•20 comments

I wanted a camera that doesn't exist, so I built it

https://medium.com/@cristi.baluta/i-wanted-a-camera-that-doesnt-exist-so-i-built-it-5f9864533eb7
424•cyrc•4d ago•132 comments

Everyone hates OneDrive, Microsofts cloud app that steals and deletes files

https://boingboing.net/2026/01/05/everyone-hates-onedrive-microsofts-cloud-app-that-steals-then-d...
105•mikecarlton•1h ago•99 comments

Oral microbiome sequencing after taking probiotics

https://blog.booleanbiotech.com/oral-microbiome-biogaia
170•sethbannon•18h ago•75 comments

We recreated Steve Jobs's 1975 Atari horoscope program

https://blog.adafruit.com/2026/01/06/we-recreated-steve-jobss-1975-atari-horoscope-program-and-yo...
91•ptorrone•14h ago•41 comments
Open in hackernews

Mistral Agents API

https://mistral.ai/news/agents-api
152•pember•7mo ago

Comments

orliesaurus•7mo ago
Whoever made those embedded videos, here some feedback if you want it take it, it's free:

1) It's really hard to follow some of the videos since you're just copy pasting the prompts fr your agents into the chat because the output generation comes out and hides the prompts. Instead put the prompt text as an overlay/subtitle-like so we know what you're doing

2) The clicking sound of you copy pasting and typing is not ASMR, please just mute it next time

3) Please zoom into the text more, not everyone has 20/20 super vision 4K style

ianhawes•7mo ago
4) Use a clean browser profile so you don't show unrelated autocomplete
threeducks•7mo ago
To add to 3): YouTube embedded videos default to 360p for me even if I maximize the embedded video on my 4k screen, which is completely unreadable. This is probably an attempt by YouTube to get viewers to click through to the YouTube website. It is probably not in Mistral's best interest to funnel viewers to YouTube, so they should use a different video host.

But even at maximum 1080p resolution, the image quality is not that great. And while we are at it, the wine-red (#833048) on dark-brown (#23231F) syntax highlighting for keyword arguments has very poor contrast ratio of around 1.8 to 1: https://webaim.org/resources/contrastchecker/ which earns a rating of "Fail" across the categories normal text, large text and UI elements.

moralestapia•7mo ago
I came here to see if anyone else noticed.

Very sloppy job, imo.

It costs next to nothing to come up with a little story and have someone on Fiverr narrate it (or an AI, after all that's what they sell).

bbor•7mo ago
Ok I’m behind the times in terms of MCP implementation, so would appreciate a check: the appeal of this feature is that you can pass off the “when to call which MCP endpoint and with what” logic to Mistral, rather than implementing it yourself? If so I’m not sure I completely understand why I’d want a model-specific, remote solution for this rather than a single local library, since theoretically this logic should be the same for any given LLM/MCP toolset pairing. Just simpler?

It certainly looks easy to implement, I will say that! Docs halfway down the page: https://docs.mistral.ai/agents/mcp/

potatolicious•7mo ago
It seems like the main pitch here is auto-inclusion and auto-exclusion of various tools via an orchestration agent (which may or may not be the main model itself? Unclear from their post)

Mostly this seems like an end-run around tool calling scalability limits. Model performance degrades heavily if the field of possible tools gets too large, so you insert a component into the system that figures out what tools should be in-scope, and make only those available, to get reliability higher.

In terms of "why outsource this" it seems like the idea is that their orchestration agent would be better than a cruder task state machine that you would implement yourself. Time will tell if this assertion is true!

ed•7mo ago
> auto-inclusion and auto-exclusion of various tools via an orchestration agent

Where do you see that? That would be neat, I'm under the impression orchestration is manual though – you define an agent and give it the ability to hand off tasks to sub-agents.

potatolicious•7mo ago
Sorry, maybe I could've phrased it better: it basically forces the devs to divide their tools into buckets of fewer tools manually. (The Travel Agent has N tools, the Research Agent has M tools, etc. all specified by the dev)

The pitch is that if you do this bucketization, the overall orchestrator can intelligently pick the bucket to use, but the idea is that at any moment the LLM is only exposed to a limited set of tools.

As opposed to the more pie-in-the-sky idea that given N tools (where N is very very large) the LLM can still accurately tool-select without any developer intervention. This seems pretty far off at this point.

htrp•7mo ago
is mistral a model company, an agent company, or a enterprise software company now?
nomsters•7mo ago
yes
greenavocado•7mo ago
Mistral is trying to be everything at once and it shows. To make ends meet they pivoted to selling enterprise software through Le Chat and cozying up to Microsoft. Now they're throwing around terms like "agentic AI" to stay trendy, even as competitors like DeepSeek outperform them in key areas. Their identity crisis is obvious. Are they a model company? A software vendor? A research lab? At this point, they seem more like a startup chasing hype and funding than a company with a clear direction. The 6 billion Euro valuation looks impressive, but with so many shifts in strategy, you have to wonder if they're building something lasting or just riding the AI wave until it crashes.
eigenspace•7mo ago
Their strategy doesn't make sense to you because you're looking for a technical feature that differentiates them. But technical features aren't their key differentiator, geography is their key differentiator. They'll get a lot of contracts in Europe simply because they're European. Everyone is keenly aware of how dependant European tech stacks are on increasingly unfriendly foreign powers.

If there's a local European option that does most of what an American or Chinese company does, that's simply a safer choice.

From this point of view, them trying to do everything at once makes a lot of sense. They don't actually need to be the absolute best or even the cheapest at any one thing. They need to just exist in Europe, be stable, and offer good services that people want. Casting a wide net is a better strategy for them.

Raed667•7mo ago
Do they need to pick one? Their offering doesn't seem incoherent to me
brandall10•7mo ago
Couldn't the same questions be asked of OpenAI and Anthropic?

Ultimately these are product/service companies, levering their research and innovations as differentiators.

If you're "only a model" company you likely have no moat.

FailMore•7mo ago
Is this basically a LLM that has tools automatically configured so I don’t have to handle that myself? Or am I not understanding it correctly? As in do I just make standard requests , but the LLM does more work than normal before sending me a response? Or I get the response to every step?
spmurrayzzz•7mo ago
The aspirational goal is that the model knows what tools to call and when, without human intervention. In practice, you'll see varying efficacy with that depending on the tools you need. Some of the tool usage is in-distribution / well represented in training set, but if you have some custom exotic MCP server you created yourself (or pulled off of some random github) you may see mixed results. Sometimes that can be fixed by simply augmenting your prompt with contrastive examples of how to use or not use the tool.

As an aside, my experience with devstral (both via API and locally w/ open weights) has been very underwhelming to this effect. So I'm curious how this new agent infra performs given that observation.

koakuma-chan•7mo ago
It's a software framework for orchestrating agents. Each agent can have its own system prompt, its own tools, and it can delegate ("hand off") to a different agent. When a hand off occurs, the LLM runs again but as a different agent.
manmal•7mo ago
Like Gemini Gems, but agentic?
koakuma-chan•7mo ago
Gemini Gems seems to be a ChatGPT “GPTs” equivalent, and I never figured out what those actually are. Mistral Agents API is like OpenAI Agents SDK.
LeoPanthera•7mo ago
Gems and GPTs are just a way to customize the system prompt from the web UI.
qwertox•7mo ago
The "My MCPs" button looks very promising.

I was looking around at Le Chat, a thing I haven't done in months, and I thought that they've really worked on interesting stuff in interesting ways.

The ability to enrich either a chat or generally an agent with one or more libraries has been solved in a very friendly way. I don't think OpenAI nor Anthropic have solved it so well.