frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

HaluMem: Evaluating Hallucinations in Memory Systems of Agents

https://arxiv.org/abs/2511.03506
2•timini•1h ago

Comments

timini•1h ago
HaluMem introduces the first benchmark for evaluating hallucinations in agent memory systems at the operation level. Through three evaluation tasks (memory extraction, updating, and question answering), it reveals that existing memory systems generate and accumulate hallucinations during early stages, which then propagate errors downstream. The benchmark uses two datasets spanning different context scales to systematically reveal these failure modes.

New MongoDB CEO violated former company hiring policy

https://www.stripes.com/theaters/us/2024-08-02/army-investigation-contracts-14711647.html
1•opisthenar84•1m ago•0 comments

Zohran Mamdani Just Inherited the NYPD Surveillance State

https://www.wired.com/story/welcome-to-mamdanis-surveillance-state/
1•geox•1m ago•0 comments

GTA 6 delayed to 26 November 2026 for "additional polish"

https://www.gamesindustry.biz/gta-6-delayed-to-26-november-2026-for-additional-polish
1•Philpax•5m ago•0 comments

We built a 270M local model to detect phishing URLs

https://charlemagnelabs.ai/blog/building-agent-charley-270m-parameter-local-ai
2•rwhaling•6m ago•0 comments

ICE plans to open call center to help law enforcement locate unaccompanied minor

https://www.theguardian.com/us-news/2025/nov/06/ice-call-center-unaccompanied-migrant-children
1•onemoresoop•8m ago•0 comments

Why Manufacturing's Last Boom Will Be Hard to Repeat

https://www.wsj.com/economy/us-manufacturing-onshoring-history-988375c3
1•clcaev•9m ago•0 comments

Deep Space Exploitation Is Out Now

https://juhrjuhr.itch.io/deep-space-exploitation/devlog/1104898/deep-space-exploitation-is-out-now
2•ibobev•10m ago•0 comments

Whole World Holonomy

https://galileo-unbound.blog/2025/11/05/whole-world-holonomy/
1•ibobev•10m ago•0 comments

Ninjas Lets Chat

https://x.com/8secguru/
1•realityredninja•11m ago•0 comments

Show HN: Rankly – The only AEO platform to track AI visibility and conversions

https://rankly.ai
1•satj•11m ago•0 comments

Microsoft forms superintelligence team to serve humanity

https://www.cnbc.com/2025/11/06/microsoft-forms-superintelligence-team-under-ai-head-mustafa-sule...
1•leopoldj•13m ago•1 comments

Occurrence, Dominance, and Combined Use of Antibiotics in Aquaculture Ponds

https://www.mdpi.com/2305-6304/13/10/892
1•PaulHoule•14m ago•0 comments

Show HN: How We Escaped the Purple Prison of AI Front Ends

https://x.com/YouWareAI/article/1986491218461933767
1•marv1nnnnn•14m ago•0 comments

Synthient Credential Stuffing Threat Data Breach

https://haveibeenpwned.com/Breach/SynthientCredentialStuffingThreatData
1•manlymuppet•15m ago•1 comments

An Undefeated Pull Request Template (2024)

https://ashleemboyer.com/blog/pull-request-template/
1•mooreds•16m ago•0 comments

1,500 PRs Later: Spotify's Journey with Our Background Coding Agent (Part 1)

https://engineering.atspotify.com/2025/11/spotifys-background-coding-agent-part-1
1•janpio•17m ago•0 comments

Create an ultra quality HEVC video pipeline with hardware cost of less than $500

https://github.com/AirenSoft/OvenMediaEngine/discussions/1585
1•thisislife2•19m ago•0 comments

Arc Prize Verified

https://arcprize.org/blog/arc-prize-verified-program
2•gok•19m ago•0 comments

Tuta Introduces Key Verification

https://tuta.com/blog/key-verification
2•dotcoma•19m ago•0 comments

Show HN: I Vibe-Coded a TUI for AWS Logs Insights in Rust

https://github.com/alexgit/awslogs
1•ast42•22m ago•0 comments

Giant Spider Colony Discovered in Unique Chemoautotrophic Cave

https://newatlas.com/biology/sulfur-cave-largest-spiderweb/
4•thunderbong•23m ago•1 comments

Rethink How You Build AI-Driven Systems

https://www.eventsourcing.ai/
1•goloroden•23m ago•0 comments

Show HN: Turn docs into tailored self-serve playgrounds to create aha-moments

https://visr.dev/
1•sourishkrout•25m ago•0 comments

Unix v4 Tape Found

https://discuss.systems/@ricci/115504720054699983
6•greatquux•25m ago•0 comments

Be Bold

https://en.wikipedia.org/wiki/Wikipedia:Be_bold
1•MrJagil•26m ago•0 comments

Show HN: Completely free Claude Sonnet 4.5, supported by contextual ads

2•namanyayg•28m ago•0 comments

Tududi – Self-hosted task management

https://github.com/chrisvel/tududi
1•celsoazevedo•29m ago•0 comments

Chaos and lies: Why Sam Altman was booted from OpenAI according to new testimony

https://www.theverge.com/ai-artificial-intelligence/814876/ilya-sutskever-deposition-openai-sam-a...
1•littlexsparkee•31m ago•1 comments

How to fix subsystem request failed on channel 0

https://blog.x-way.org/Linux/2025/11/06/How-to-fix-subsystem-request-failed-on-channel-0.html
1•speckx•31m ago•0 comments

Space junk may have struck a Chinese crew ship in low-Earth orbit

https://arstechnica.com/space/2025/11/landing-postponed-for-chinese-astronauts-after-suspected-sp...
2•rbanffy•31m ago•0 comments