frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Accelerating Gemma 4: faster inference with multi-token prediction drafters

https://blog.google/innovation-and-ai/technology/developers-tools/multi-token-prediction-gemma-4/
245•amrrs•3h ago•96 comments

Three Inverse Laws of AI

https://susam.net/inverse-laws-of-robotics.html
258•blenderob•4h ago•169 comments

Computer Use is 45x more expensive than structured APIs

https://reflex.dev/blog/computer-use-is-45x-more-expensive-than-structured-apis/
148•palashawas•3h ago•91 comments

EEVblog: The 555 Timer is 55 years old [video]

https://www.youtube.com/watch?v=6JhK8iCQuqI
137•brudgers•4h ago•35 comments

Google Chrome silently installs a 4 GB AI model on your device without consent

https://www.thatprivacyguy.com/blog/chrome-silent-nano-install/
923•john-doe•12h ago•631 comments

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

https://arxiv.org/abs/2604.26752
57•gmays•2h ago•12 comments

Agents for financial services and insurance

https://www.anthropic.com/news/finance-agents
135•louiereederson•4h ago•107 comments

IBM didn't want Microsoft to use the Tab key to move between dialog fields

https://devblogs.microsoft.com/oldnewthing/20260505-00/?p=112298
187•SeenNotHeard•2h ago•105 comments

Show HN: Explore color palettes inspired by 3000 master painter artworks

https://paletteinspiration.com/
25•ouli•1h ago•10 comments

California farmers to destroy 420k peach trees following Del Monte bankruptcy

https://www.sfgate.com/centralcoast/article/usda-aid-california-farmers-22240694.php
90•littlexsparkee•1h ago•87 comments

I'm scared about biological computing

https://kuber.studio/blog/Reflections/I%27m-Scared-About-Biological-Computing
90•kuberwastaken•3h ago•62 comments

Proliferate (YC S25) Is Hiring- 200k for junior engineers

https://www.ycombinator.com/companies/proliferate/jobs/L3copvK-founding-engineer
1•pablo24602•2h ago

Show HN: Airbyte Agents – context for agents across multiple data sources

60•mtricot•4h ago•7 comments

When everyone has AI and the company still learns nothing

https://www.robert-glaser.de/when-everyone-has-ai-and-the-company-still-learns-nothing/
241•youngbrioche•10h ago•172 comments

Should I Run Plain Docker Compose in Production in 2026?

https://distr.sh/blog/running-docker-in-production/
293•pmig•5d ago•218 comments

Async Rust never left the MVP state

https://tweedegolf.nl/en/blog/237/async-rust-never-left-the-mvp-state
394•pjmlp•12h ago•215 comments

Quantum Key Distribution (QKD) and Quantum Cryptography (QC)

https://www.nsa.gov/Cybersecurity/Quantum-Key-Distribution-QKD-and-Quantum-Cryptography-QC/
26•mooreds•2h ago•6 comments

Researchers print structural colour with an inkjet printer

https://physicsworld.com/a/researchers-print-structural-colour-with-an-inkjet-printer/
9•zeristor•2d ago•1 comments

Collaborative Editing in CodeMirror (2020)

https://marijnhaverbeke.nl/blog/collaborative-editing-cm.html
39•luu•2d ago•3 comments

iOS 27 is adding a 'Create a Pass' button to Apple Wallet

https://walletwallet.alen.ro/blog/ios-27-wallet-create-pass/
324•alentodorov•7h ago•260 comments

Docker 29 has changed its default image store for new installs

https://docs.docker.com/engine/storage/containerd
101•neitsab•3d ago•51 comments

Comparing the Z80 and 6502 to Their Relatives

https://bumbershootsoft.wordpress.com/2026/05/02/comparing-the-z80-and-6502-to-their-relatives/
87•ibobev•2d ago•9 comments

UK: Two millionth electric car registered as market rebounds strongly

https://www.smmt.co.uk/two-millionth-electric-car-registered-as-market-rebounds-strongly-from-tax...
142•kieranmaine•3h ago•194 comments

Empty Screenings – Finds AMC movie screenings with few or no tickets sold

https://walzr.com/empty-screenings
290•MrBuddyCasino•15h ago•245 comments

Clarification on the Notepad++ Trademark Issue

https://notepad-plus-plus.org/news/clarify-npp-trademark-infringement/
111•minimaxir•1h ago•47 comments

Simple Meta-Harness on Islo.dev

https://zozo123.github.io/meta-harness-on-islo-page/
39•zozo123-IB•5h ago•17 comments

Today I've made the difficult decision to reduce the size of Coinbase by ~14%

https://twitter.com/brian_armstrong/status/2051616759145185723
91•adrianmsmith•7h ago•104 comments

The first photo published in a newspaper, in 1848 (2023)

https://phsne.org/the-first-photograph-published-in-a-newspaper-1848/
44•geuis•2d ago•18 comments

Lessons for Agentic Coding: What should we do when code is cheap?

https://www.dbreunig.com/2026/05/04/10-lessons-for-agentic-coding.html
203•ingve•12h ago•200 comments

Hand Drawn QR Codes (2025)

https://sethmlarson.dev/hand-drawn-qr-codes
191•jollyjerry•15h ago•42 comments
Open in hackernews

New tools and features in the Responses API

https://openai.com/index/new-tools-and-features-in-the-responses-api
74•meetpateltech•11mo ago

Comments

skeptrune•11mo ago
Wow background mode looks awesome. I'm excited to work that into our UX for people. Live Q&A is such a dead interface at this point.

Reasoning summaries also look great. Anything that provides extra explainability is a win in my book.

pizzuh•11mo ago
It's great to see more and more adoption for MCP. I'm not sure it's the most bulletproof protocol, but it feels like it's in a strong lead, especially with OpenAI support.

I've been using Codex for the last 24 hours, and background mode boosts your output. You can have Codex work on n+ features async. I had it building a database model alongside frontend authentication, and did both pretty well.

tedtimbrell•11mo ago
Im quite surprised they’re actually going with hosted mcp versus just implementing the mcp server locally and interacting with the api
nknj•11mo ago
you can use local mcp servers with the agents sdk: https://openai.github.io/openai-agents-python/mcp/

responses api is a hosted thing and so it made most sense for it to directly connect to other hosted services (like remote mcp servers).

jasongill•11mo ago
I wish OpenAI would provide more clarity about the Assistants API deprecation, which has been announced as being sunset in spring of 2026 and replaced by the Responses API, but still no other updates on the timeline or migration plan.

Prior to the release of the Responses API, the Assistants API was the best way (for our use cases, at least) to interact with OpenAI's API, so hopefully some clarity on the plan for it is released soon (now that Responses API has some of the things that it was previously missing)

nknj•11mo ago
I hear you and really appreciate the patience here.

We're almost ready to share a migration guide. Today, we closed the gap between Assistants and Responses by launching Code Interpreter and support for multiple vector stores in File Search.

We still need to add support for Assistants and Threads objects to Responses before we can give devs a simple migration path. Working on this actively and hope to have all of this out in the coming weeks.

alasano•11mo ago
Interesting that you're migrating assistants and threads to the responses API, I presumed you were killing them off.

I started my MVP product with assistants and migrated to responses pretty easily. I handle a few more things myself but other than that it's not really been difficult.

beklein•11mo ago
On the announcement page they are saying that "...introducing updates to the file search tool that allow developers to perform searches across multiple vector stores...". On the docs, I still find this limitation: "At the moment, you can search in only one vector store at a time, so you can include only one vector store ID when calling the file search tool."

Anybody knows how searching multiple vector stores is implemented? The obvious plan would be to allow something like:

  "vector_store_ids": ["<vector_store_id1>", "<vector_store_id2>", ...]
nknj•11mo ago
sorry about the error in the docs. we're removing that call out.

`"vector_store_ids": ["<vector_store_id1>", "<vector_store_id2>"]` is exactly right. only 2 vector stores are supported at the moment.

akgfab•11mo ago
2 feels quite arbitrary and honestly not that much of an improvement. Any plans to up that limit?
mritchie712•11mo ago
we were using it for our agent in https://www.definite.app/ and I've been expecting it to die for almost a year considering the lack of updates.

We switched over to https://ai.pydantic.dev/ which I really like. LLM agnostic and the team is very receptive to feedback.

andrewrn•11mo ago
It was never really clear what the difference between the chat and responses APIs were. Anyone know the difference?
brittlewis12•11mo ago
chat completions is stateless — you must provide the entire conversation history with each new message; openai stores nothing (at least nothing that the downstream product _can use_) beyond the life of the request.

responses api, by contrast, is stateful — only send the latest message, and openai stores the conversation history, while keeping track of other details on behalf of the calling app, like parallel tool call states.

but i would say that since chat completions has become an informal industry standard, the responses api feels like an attempt by openai to break away from that shared interface, because it is so easy to swap out providers with nothing more than a base url and a model id, to a paradigm which requires data migration as well as replacement infrastructure (containers for code execution, for example).

nknj•11mo ago
one additional difference between chat and responses is the number model turns a single api call can make. chat completions is a single turn api primitive -- which means it can talk to the model just once. responses is capable of making multiple model turns and tool calls in a single api call.

for example, you can give the responses api access to 3 tools: a vector store with some user memories (file_search), the shopify mcp server, and code_interpreter. you can then ask it to look up some user memories, find relevant items in the shopify mcp store, and then download them into a csv file. all of this can be done in a single api call that involves multiple model turns and tool calls.

p.s. - you can also use responses statelessly by setting store=false.

OutOfHere•11mo ago
What are my choices for using a custom tool? Does it come down to: function calling (single turn), MCP (multi-turn via Responses)? What else are my choices?

Why would anyone want to use Responses statelessly? Just trying to understand.

swyx•11mo ago
i think the original intent of responses api was also to unify the realtime experiences into responses - is that accurate?
nknj•11mo ago
we expect responses and realtime to be our 2 core api primitives long term — responses for turn by turn interactions and realtime for models requiring low latency bidirectional streams to/from the apps/models.
swyx•11mo ago
thank you for the correction!
andrewrn•11mo ago
This is very enlightening. You're right then, it does seem to partially be a strategic moat-building move by OpenAI
rafram•11mo ago
> Encrypted reasoning items: Customers eligible for Zero Data Retention (ZDR) can now reuse reasoning items across API requests

So, so weird that they still don't want you to see their models' reasoning process, to the point that even highly trusted organizations with ZDR contracts only get them in a black-box encrypted form. Gemini has no issue showing its work. Why can't OpenAI?

vessenes•11mo ago
Is this true? I can click open o3’s dialogue and see a running monologue. I guess it might be a summary of the actual reasoning though.
hhh•11mo ago
It is a summary
mediaman•11mo ago
Correct, you are not seeing the reasoning chains.
rafram•11mo ago
I may be giving Gemini too much credit, actually - seems like its "reasoning" may be a summary as well.
Doohickey-d•11mo ago
They changed it yesterday or so: it used to show the actual reasoning, now it no longer does. And the reasoning was quite useful to see if it was going down the wrong track, the summary is much less so.
epiccoleman•11mo ago
That's disappointing. I was getting a lot of utility from reading through the thoughts returned by Gemini when I used it in Cursor - occasionally even learning something new from its stream of "consciousness". Obfuscating the information because it can be used to train competitors seems misguided, if understandable.
vessenes•11mo ago
Agreed. Right now deepseek’s R1 has uncensored stream of consciousness in open weights. I think it’s interesting that teams feel the streams should be proprietary. They must be doing something a little different than R1, or it wouldn’t be worth the extra engineering work.
fermisea•11mo ago
Not only that. I have an agent product and I’m currently blocked from using their reasoning models on Azure for having asked for a chain of thought, which apparently is against the ToS.

The customer service itself was surreal enough that it was easier just to migrate to Anthropic

NitpickLawyer•11mo ago
> So, so weird that they still don't want you to see their models' reasoning process

It's not weird at all. R1-distills have shown that you can get pretty close to the real thing with post-training on enough completions. I believe gemini has also stopped showing the thinking steps (apparently the GLM series of open access models were heavily trained on gemini data).

ToS violations can't be enforced in any effective way, and certainly not cross-borders. Their only way to maintain whatever moat thinking models give them is to simply not show the thinking parts.

zvitiate•11mo ago
Google actually switched to an OpenAI system for 2.5 Pro's Chain-of-Thought yesterday on the Gemini app and AI Studio ("I did this; I did that. etc"). Apparently it still shows via API, but no clear how long. Also, in my experience, if you select the "Canvas" output, you still get the old style CoT.

And yes, the above is true even if you are ULTRA.

You can still view your old thinking traces from prior turns and conversations.

zoogeny•11mo ago
My heart just broke to hear this. Although I honestly don't read the thinking output very often. But I had been cheekily copy-n-pasting the info for my own records.
knowsuchagency•11mo ago
I agree, but there's always Deepseek. They're publishing and open-sourcing more than anyone these days.
orasis•11mo ago
Reasoning models can now call tools during the reasoning process.
joshwarwick15•11mo ago
List of remote MCP servers to use here: https://github.com/jaw9c/awesome-remote-mcp-servers