frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: For those of you building AI agents, how have you made them faster?

2•arkmm•5h ago
Because of the coordination across multiple systems + chaining LLM calls, a lot of agents today can feel really slow. I would love to know how others are tackling this:

- How are you all identifying performance bottlenecks in agents?

- What types of changes have gotten you the biggest speedups?

For us we vibe-coded a profiler to identify slow LLM calls - sometimes we could then switch out a faster model for that step or we'd realize we could shrink the input tokens by eliminating unnecessary context. For steps requiring external access (browser usage, API calls), we've moved to fast start external containers + thread pools for parallelization. We've also experimented some with UI changes to mask some of the latency.

What other performance enhancing techniques are people using?

Comments

codingdave•2h ago
To start with, I'm not coordinating across multiple systems or chaining LLM calls. I write all data to a central data store, run a state machine on it, and each LLM call then operates independently.

At that point, you can measure and optimize each call as needed and it is fairly straightforward.

Ask HN: How can we solve the loneliness epidemic?

330•publicdebates•7h ago•613 comments

Ask HN: One IP, multiple unrealistic locations worldwide hitting my website

28•nacho-daddy•5h ago•14 comments

Ask HN: Share your personal website

830•susam•1d ago•2207 comments

Ask HN: How are you doing RAG locally?

347•tmaly•1d ago•139 comments

Ask HN: What did you find out or explore today?

195•blahaj•1d ago•360 comments

Ask HN: What are your best purchases under $100?

26•krishadi•4h ago•101 comments

Ask HN: Is Codex login down for all workspace (non-personal) users?

2•amluto•1h ago•0 comments

Ask HN: Estimating % of dev using coding assistants

5•japoneris•3h ago•4 comments

Ask HN: Why do AI code editors suck at closing tags?

8•cryptography•17h ago•3 comments

Ask HN: What is the best way to provide continuous context to models?

66•nemath•23h ago•37 comments

Ask HN: How to make spamming us uncomfortable for LinkedIn and friends?

10•zx8080•12h ago•6 comments

Ask HN: What to teach my kid if AI does math and CS?

7•devShark•8h ago•13 comments

Ask HN: How do you safely give LLMs SSH/DB access?

77•nico•1d ago•104 comments

Ask HN: Anyone else finding it impossible to land a job?

12•Arch485•8h ago•17 comments

Ask HN: A pattern we noticed in how website leads are handled

2•lucascorrei4•4h ago•1 comments

Ask HN: Distributed SQL engine for ultra-wide tables

22•synsqlbythesea•1d ago•18 comments

Ask HN: Any real prompt injections in the wild?

6•singularity2001•15h ago•2 comments

Ask HN: Iran's 120h internet shutdown, phones back. How to stay resilient?

112•us321•2d ago•95 comments

Ask HN: Are the layoffs at Tailwind a trend that can be extrapolated?

2•qcardona•1h ago•1 comments

Ask HN: For those of you building AI agents, how have you made them faster?

2•arkmm•5h ago•1 comments

Ask HN: Audio analysis models, how to train to learn sound patters?

4•thedangler•9h ago•1 comments

Where does data help in real estate – and where does it fail?

2•D___R___•6h ago•0 comments

Ask HN: How to overcome the limit of roles in LLM's

2•weli•7h ago•0 comments

Ask HN: Why does Google still provide an open redirect for phishers?

21•throwaway89201•1d ago•9 comments

GitHub Is Down

18•dfajgljsldkjag•7h ago•16 comments

Ask HN: What are you working on? (January 2026)

256•david927•4d ago•867 comments

The $LANG Programming Language

261•dang•2d ago•69 comments

Architecture+cost drivers for a deterministic rule/metric engine 1,200metrics

2•Trackdiver•9h ago•0 comments

Tell HN: 1B Jobs on GitHub Actions

2•dorianmariecom•8h ago•1 comments

Turning weeks of medical device documentation into minutes

2•feargalosull•9h ago•0 comments