Over the last 18 months, agentic AI has shifted from prompt-driven chatbots to system-level engineering. This post maps the open-source tools teams are actually using to build, run, and evaluate agents in production, and where they sit in the agent stack.
bgrief•1h ago
good stuff :)
marianebekker•1h ago
Thank you!
alexfeinstein77•1h ago
Cool article!
marianebekker•1h ago
Thank you!
phlik•1h ago
Great Read.
marianebekker•1h ago
Thank you!
marianebekker•1h ago
One thing that surprised us while putting this together was how uneven the stack still is. Planning and execution tooling feels fairly mature, but evaluation and long-term reliability lag far behind. Curious how people here are testing and validating agents in production today.
kenhoppe•1h ago
The agentic AI landscape is rapidly maturing, shifting from isolated experiments to end-to-end systems that can think, act, and adapt autonomously. Thanks for sharing how various frameworks are shaping the future of intelligent, real-world automation."
marianebekker•1h ago
Agreed on the shift — what stood out to us is how quickly teams hit system-level constraints once agents leave the demo phase. Things like evaluation drift, tool reliability, and recovery behavior start to matter more than model choice. Curious which of those has been the biggest bottleneck for you in practice?
marianebekker•2h ago