frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

The case for e-ink displays in a high-dopamine world

1•jerr12939•2m ago•0 comments

OpenAI site errors are now generated by AI

https://openai.com/careers/software-engineer-codex-runtime-san-francisco/
2•behnamoh•3m ago•0 comments

Ask HN: Who can invite me to lobster.rs?

1•DenisDolya•4m ago•0 comments

Chemical weapons used on protestors in Iran

https://www.iranintl.com/en/202601235991
1•ukblewis•4m ago•1 comments

Shamir's Secret Sharing Explained (For Normal People)

https://www.deadhandprotocol.com/blog/shamirs-secret-sharing-explained
1•maxcomperatore•6m ago•0 comments

The First Eighteen Lines of the Waste Land (1989)

https://yalereview.org/article/hecht-eliot-waste-land
1•benbreen•10m ago•0 comments

Trendshift Stats – GitHub language trends across 20k+ curated repositories

https://trendshift.io/stats
1•Hylasca•12m ago•1 comments

ROS2 Robotics 2026: Jetson Nano or Raspberry Pi 5 Kit?

https://www.hackster.io/HiwonderRobot/ros2-robotics-2026-jetson-nano-or-raspberry-pi-5-kit-ba5299
1•chfritz•14m ago•0 comments

HN: ReguAction – Turn regulatory URLs into actionable compliance plans

https://apify.com/brazen_vanguard/reguaction-ai-compliance-regulation-analyst
1•founder_mode•15m ago•1 comments

Show HN: Animated Anthropic OS Take Home Explainer

https://twitter.com/ashebytes/status/2014767748291952776
1•ashe_mag•16m ago•0 comments

It's Time for Primary Prevention in Medicine

https://erictopol.substack.com/p/dawn-of-a-new-era-of-primary-prevention
1•brandonb•16m ago•0 comments

Tariffs took an emotional and financial toll on founders in 2025

https://www.modernretail.co/operations/quitting-wasnt-an-option-how-tariffs-took-an-emotional-and...
1•sberens•18m ago•0 comments

Show HN: Codex Self-Reflect Skill and CLI to run subagents on past Codex convos

https://github.com/olliepro/Codex-Reflect-Skill
2•olliepro•19m ago•2 comments

Show HN: My mouse was sending two clicks instead of one – I built a tester

https://oopsclick.pages.dev/
1•nirvanist•21m ago•0 comments

Ask HN: B2B or B2C?

1•eyalhadad•23m ago•0 comments

The Slow Death of Productivity: Why Time.sleep() Is Killing Your Automation

https://pypi.org/project/nano-wait/
1•LuizSeabra•24m ago•1 comments

Show HN: Collecting AI Developer Tools

https://aihunt.dev/
1•fullofdev•25m ago•0 comments

Proposal: Generic Methods for Go

https://github.com/golang/go/issues/77273
1•pansa2•25m ago•0 comments

'Organized syndicates' fraudulently access health records, lawsuit says

https://www.washingtonpost.com/health/2026/01/22/electronic-health-record-fraud-lawsuit/
1•bookofjoe•26m ago•1 comments

JWT authentication bypass in HarbourJwt via "unknown alg"

https://pentesterlab.com/blog/cve-2026-23993-harbourjwt-unknown-alg-jwt-bypass
1•mooreds•29m ago•0 comments

Justice Department Opens Criminal Probe into Silicon Valley Spy Allegations

https://www.wsj.com/us-news/law/justice-department-opens-criminal-probe-into-silicon-valley-spy-a...
2•nradov•30m ago•0 comments

Results from the 2025 Go Developer Survey

https://go.dev/blog/survey2025
3•mooreds•30m ago•0 comments

Show HN: Skill Generator – Turn terminal workflows into AI agent skills

1•ezulabs•31m ago•0 comments

A multi-entry CFG design conundrum

https://bernsteinbear.com/blog/multiple-entry/
1•ibobev•31m ago•0 comments

Infinite Random Rectangles – The Poisson Rect Process

https://www.boristhebrave.com/2026/01/22/infinite-random-rectangles-the-poisson-rect-process/
1•ibobev•32m ago•0 comments

Mana LLM OS

https://www.mana.space/
1•behzadhaghgoo•34m ago•0 comments

EV battery leader CATL set to launch first sodium-ion batteries in vehicles

https://electrek.co/2026/01/23/ev-battery-leader-plans-first-sodium-ion-batteries-passenger-cars/
3•breve•34m ago•1 comments

The Time Oxygen Almost Killed Everything [video]

https://www.youtube.com/watch?v=qERdL8uHSgI
2•mooreds•34m ago•0 comments

Zulip AI use policy and guidelines

https://zulip.readthedocs.io/en/latest/contributing/contributing.html#ai-use-policy-and-guidelines
1•Philpax•36m ago•0 comments

Ask HN: Why don't winter gloves have mechanical fingers?

2•amichail•37m ago•3 comments
Open in hackernews

Unrolling the Codex agent loop

https://openai.com/index/unrolling-the-codex-agent-loop/
66•tosh•1h ago

Comments

mkw5053•1h ago
I guess nothing super surprising or new but still valuable read. I wish it was easier/native to reflect on the loop and/or histories while using agentic coding CLIs. I've found some success with an MCP that let's me query my chat histories, but I have to be very explicit about it's use. Also, like many things, continuous learning would probably solve this.
MultifokalHirn•1h ago
thx :)
jumploops•1h ago
One thing that surprised me when diving into the Codex internals was that the reasoning tokens persist during the agent tool call loop, but are discarded after every user turn.

This helps preserve context over many turns, but it can also mean some context is lost between two related user turns.

A strategy that's helped me here, is having the model write progress updates (along with general plans/specs/debug/etc.) to markdown files, acting as a sort of "snapshot" that works across many context windows.

crorella•49m ago
Same here! I think it would be good if this could be made by default by the tooling. I've seen others using SQL for the same and even the proposal for a succinct way of representing this handoff data in the most compact way.
vmg12•48m ago
I think this explains why I'm not getting the most out of codex, I like to interrupt and respond to things i see in reasoning tokens.
behnamoh•12m ago
that's the main gripe I have with codex; I want better observability into what the AI is doing to stop it if I see it going down the wrong path. in CC I can see it easily and stop and steer the model. in codex, the model spends 20m only for it to do something I didn't agree on. it burns OpenAI tokens too; they could save money by supporting this feature!
zeroxfe•5m ago
You're in luck -- /experimetal -> enable steering.
behnamoh•4m ago
I first need to see real time AI thoughts before I can steer it tho! Codex hides most of them
CjHuber•3m ago
They've added steering as an experimental feature in the latest versions
sdwr•46m ago
That could explain the "churn" when it gets stuck. Do you think it needs to maintain an internal state over time to keep track of longer threads, or are written notes enough to bridge the gap?
behnamoh•16m ago
but that's why I like Codex CLI, it's so bare bone and lightweight that I can build lots tools on top of it. persistent thinking tokens? let me have that using a separate file the AI writes to. the reasoning tokens we see aren't the actual tokens anyway; the model does a lot more behind the scenes but the API keeps them hidden (all providers do that).
CjHuber•5m ago
It depends on the API path. Chat completions does what you describe, however isn't it legacy?

I've only used codex with the responses v1 API and there it's the complete opposite. Already generated reasoning tokens even persist when you send another message (without rolling back) after cancelling turns before they have finished the thought process

Also with responses v1 xhigh mode eats through the context window multiples faster than the other modes, which does check out with this.

dfajgljsldkjag•52m ago
The best part about this is how the program acts like a human who is learning by doing. It is not trying to be perfect on the first try, it is just trying to make progress by looking at the results. I think this method is going to make computers much more helpful because they can now handle the messy parts of solving a problem.
written-beyond•35m ago
Has anyone seriously used codex cli? I was using LLMs for code gen usually through the vscode codex extension, Gemini cli and Claude Code cli. The performance of all 3 of them is utter dog shit, Gemini cli just randomly breaks and starts spamming content trying to reorient itself after a while.

However, I decided to try codex cli after hearing they rebuilt it from the ground up and used rust(instead of JS, not implying Rust==better). It's performance is quite literally insane, its UX is completely seamless. They even added small nice to haves like ctrl+left/right to skip your cursor to word boundaries.

If you haven't I genuinely think you should give it a try you'll be very surprised. Saw Theo(yc ping labs) talk about how open ai shouldn't have wasted their time optimizing the cli and made a better model or something. I highly disagree after using it.

procinct•26m ago
Same goes for Claude Code. Literally has vim bindings for editing prompts if you want them.
behnamoh•5m ago
CC is the clunkiest PoS software I've ever used in terminal; feels like it was vibe coded and anthroshit doesn't give a shit
ewoodrich•24m ago
OpenCode also has an extremely fast and reliable UI compared to the other CLIs. I’ve been using Codex more lately since I’m cancelling my Claude Pro plan and it’s solid but haven’t spent nearly as much time compared to Claude Code or Gemini CLI yet.

But tbh OpenAI openly supporting OpenCode is the bigger draw for me on the plan but do want to spend more time with native Codex as a base of comparison against OpenCode when using the same model.

I’m just happy to have so many competitive options, for now at least.

behnamoh•14m ago
Seconded. I find codex lacks only two things:

- hooks (this is a big one)

- better UI to show me what changes are going to be made.

the second one makes a huge diff and it's the main reason I stopped using opencode (lots of other reasons too). in CC, I am shown a nice diff that I can approve/reject. in codex, the AI makes lots of changes but doesn't pin point what changes it's doing or going to make.

written-beyond•9m ago
Yeah it's really weird with automatically making changes. I read in it's chain of thought that it's going to request approval for something from the user, the next message was approval granted doing it. Very weird...
williamstein•20m ago
I strongly agree. The memory and cpu usage of codex-cli is also extremely good. That codex-cli is open source is also valuable because you can easily get definitive answers to any questions about its behavior.

I also was annoyed by Theo saying that.

georgeven•13m ago
I found codex cli to be significantly better than claude code. It follows instructions and executes the exact change I want without going off on an "adventure" like Claude code. Also the 20 dollars per month sub tier gives very generous limits of the most powerful model option (5.2 codex high).

I work on SSL bio acoustic models as context.

behnamoh•11m ago
code the model (not the cli) is the big thing here. I've used it in CC and w/ my claude setup, it can handle things Opus could never. it's really a secret weapon not a lot of people talk about. I'm not even using xhigh most of the time.
copperx•4m ago
[delayed]
CuriouslyC•10m ago
The problem with codex right now is it doesn't have hook support. It's hard to understate how big of a deal hooks are, the Ralph loop that the newer folks are losing their shit over is like the level 0, most rudimentary use of hooks.

I have a tool that reduces agent token consumption by 30%, and it's only viable because I can hook the harness and catch agents being stupid, then prompt them to be smarter on the fly. More at https://sibylline.dev/articles/2026-01-22-scribe-swebench-be...

ppeetteerr•16m ago
I asked Claude to summarize the article and it was blocked haha. Fortunately, I have the Claude plugin in chrome installed and it used the plugin to read the contents of the page.
sdwvit•15m ago
Great achievement. What did you learn?
ppeetteerr•13m ago
Nothing particularly insightful other than avoiding messing with previous messages so as not to mess with the cache.
rvnx•6m ago
Summary by Claude:

    Codex works by repeatedly sending a growing prompt to the model, executing any tool calls it requests, appending the results, and repeating until the model returns a text response
rvnx•11m ago

    while true; do echo 'prompt' | codex ; sleep 1 ; done
https://i.ytimg.com/vi/74U04h9hQ_s/maxresdefault.jpg

meh.

asey•3m ago
The article isn't about outer loops but rather what's happening inside codex while handling a task