Context Engineering for Agents

https://rlancemartin.github.io/2025/06/23/context_engineering/

45•0x79de•2d ago

Comments

ares623•5h ago

Another article handwaving or underselling the effects of hallucination. I can't help but draw parallels to layer 2 attempts from crypto.

FiniteIntegral•3h ago

Apple released a paper showing the diminishing returns of "deep learning" specifically when it comes to math. For example, it has a hard time solving the Tower of Hanoi problem past 6-7 discs, and that's not even giving it the restriction of optimal solutions. The agents they tested would hallucinate steps and couldn't follow simple instructions.

On top of that -- rebranding "prompt engineering" as "context engineering" and pretending it's anything different is ignorant at best and destructively dumb at worst.

hnlmorg•2h ago

Context engineering isn’t a rebranding. It’s a widening of scope.

Like how all squares are rectangles, but not all rectangles are squares; prompt engineering is context engineering but context engineering also includes other optimisations that are not prompt engineering.

That all said, I don’t disagree with your overall point regarding the state of AI these days. The industry is full of so much smoke and mirrors these days that it’s really hard to separate the actual novel uses of “AI” vs the bullshit.

senko•2h ago

That's one reading of that paper.

The other is that they intentionally forced LLMs to do the things we know are bad at (following algorithms, tasks that require more context that available, etc) without allowing them to solve it in a way they're optimized to do (write a code that implements the algorithm).

A cynical read is that the paper is the only AI achievement Apple has managed to do in the past few years.

(There is another: they managed not to lose MLX people to Meta)

OJFord•48m ago

Let's just call all aspects of LLM usage 'x-engineering' to professionalise it, even while we're barely starting to figure it out.

azaras•4h ago

To provide context, I utilize the memory-bank pattern with GitHub Copilot Agent, but I believe I'm wasting a significant number of tokens.

truth_seeker•4h ago

Nah ! I am not convinced that context engineering is better (in the long trem) than prompt engineering. Context engineering is still complex and needs maintainance. Its much lower level than human level language.

Given that domain expertise of the problem statment, we can apply the same tactics in context engineering on higher level in prompt engineering.

hnlmorg•2h ago

This whole industry is complex and needs constant maintenance. APIs break all the time -- and that's assuming they were even correct to begin with. New models are constantly released, each with their own new quirks. People are still figuring out how to build this tech -- and as quickly as they figure one thing out, the goal posts move again.

This entire field is basically being built on quicksand. And it will stay like this until the bubble bursts.

jes5199•4h ago

good survey of what people are already implementing, but I’ve convinced we barely understand the possibility space here. There may be much more elaborate structures that we will put context into that haven’t been discovered yet

Introducing tmux-rs

Killer whales groom each other with pieces of kelp

Flounder Mode – Kevin Kelly on a different way to do great work

LooksMapping

How AI on Microcontrollers Works: Operators and Kernels

Launch HN: K-Scale Labs (YC W24) – Open-Source Humanoid Robots

As a Labrador swam by me out to sea his owner said I hope he doesn't meet a seal

AV1@Scale: Film Grain Synthesis, The Awakening

Wind Knitting Factory

One Billion Cells – Another Multiplayer Demo with Clojure

Raphael discovery emerges from Vatican museum restoration

The Rise of Whatever

My open source project was relicensed by a YC company [license updated]

Peasant Railgun

Alternative Blanket Implementations for a Single Rust Trait

Major reversal in ocean circulation detected in the Southern Ocean

Man goes viral after working for four startups at the same time

Context Engineering for Agents

Poor Man's Back End-as-a-Service (BaaS), Similar to Firebase/Supabase/Pocketbase

WASM Agents: AI agents running in the browser

Developing with GitHub Copilot Agent Mode and MCP

Ubuntu 25.10 Raises RISC-V Profile Requirements

Where is my von Braun wheel?

Manipulating trapped air bubbles in ice for message storage in cold regions

High-Fidelity Simultaneous Speech-to-Speech Translation

Caching is an abstraction, not an optimization

Converge (YC S23) well-capitalized New York startup seeks product developers

Opening up ‘Zero-Knowledge Proof’ technology

Zig breaking change – initial Writergate

Fei-Fei Li: Spatial intelligence is the next frontier in AI [video]

Context Engineering for Agents

Comments

Introducing tmux-rs

Killer whales groom each other with pieces of kelp

Flounder Mode – Kevin Kelly on a different way to do great work

LooksMapping

How AI on Microcontrollers Works: Operators and Kernels

Launch HN: K-Scale Labs (YC W24) – Open-Source Humanoid Robots

As a Labrador swam by me out to sea his owner said I hope he doesn't meet a seal

AV1@Scale: Film Grain Synthesis, The Awakening

Wind Knitting Factory

One Billion Cells – Another Multiplayer Demo with Clojure

Raphael discovery emerges from Vatican museum restoration

The Rise of Whatever

My open source project was relicensed by a YC company [license updated]

Peasant Railgun

Alternative Blanket Implementations for a Single Rust Trait

Major reversal in ocean circulation detected in the Southern Ocean

Man goes viral after working for four startups at the same time

Context Engineering for Agents

Poor Man's Back End-as-a-Service (BaaS), Similar to Firebase/Supabase/Pocketbase

WASM Agents: AI agents running in the browser

Developing with GitHub Copilot Agent Mode and MCP

Ubuntu 25.10 Raises RISC-V Profile Requirements

Where is my von Braun wheel?

Manipulating trapped air bubbles in ice for message storage in cold regions

High-Fidelity Simultaneous Speech-to-Speech Translation

Caching is an abstraction, not an optimization

Converge (YC S23) well-capitalized New York startup seeks product developers

Opening up ‘Zero-Knowledge Proof’ technology

Zig breaking change – initial Writergate

Fei-Fei Li: Spatial intelligence is the next frontier in AI [video]