frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Gemini 3.5 Flash Benchmarks

https://twitter.com/Google/status/2056788271926239471/photo/1
1•henrikhorluck•36s ago•0 comments

Gemini Spark

https://gemini.google/overview/agent/spark/
1•jeremydw•53s ago•0 comments

Google Antigravity 2.0

https://antigravity.google/blog/introducing-google-antigravity-2-0
1•John7878781•55s ago•0 comments

We made our filesystem 47× faster by deleting it

https://microsandbox.dev/blog/oci-filesystem-47x-faster
1•appcypher•1m ago•0 comments

Universal Commerce Protocol

http://ucp.dev/
1•Wingy•1m ago•0 comments

Managed Agents in the Gemini API

https://blog.google/innovation-and-ai/technology/developers-tools/managed-agents-gemini-api/
1•berlianta•3m ago•0 comments

Bipartisan Bill Would Impose New Annual Fee on Electric Vehicles

https://www.nytimes.com/2026/05/19/business/energy-environment/electrc-vehicles-annual-fee-congre...
2•tantalor•3m ago•0 comments

Gemini for Science: AI experiments and tools for a new era of discovery

https://blog.google/innovation-and-ai/technology/research/gemini-for-science-io-2026/
2•berlianta•4m ago•0 comments

'Capitalism has to become more humane': a Stanford economist on big tech

https://www.theguardian.com/books/2026/may/18/big-tech-monopolies-democracy-mordecai-kurz
2•xyzal•5m ago•0 comments

Google Search as you know it is over

https://techcrunch.com/2026/05/19/google-search-as-you-know-it-is-over/
2•evo_9•6m ago•0 comments

Agent Evaluation: A Detailed Guide

https://cameronrwolfe.substack.com/p/agent-evals
2•gmays•7m ago•0 comments

Show HN: LaunchDock – App Launcher in Rust

https://github.com/qa3-tech/launchdock
2•qa3-tech•7m ago•0 comments

De‐Bloating JavaScript

https://github.com/naver/lispe/wiki/6.23-De%E2%80%90bloating-Javascript
2•birdculture•7m ago•0 comments

Co-Scientist: A multi-agent AI partner to accelerate research

https://deepmind.google/blog/co-scientist-a-multi-agent-ai-partner-to-accelerate-research/
2•ryanhn•8m ago•0 comments

Streamer Realtime Deepfakes Himself into Mr. Beast

https://www.404media.co/streamer-realtime-deepfakes-himself-into-mr-beast-says-he-loves-touching-...
1•cdrnsf•8m ago•0 comments

Show HN: Local LLM code-generation with Gemma 4 e2B via JSON AST to Clojure

https://github.com/quadracollision/llmisp
1•vegnus•9m ago•0 comments

Demis Hassabis Thinks AI Job Cuts Are Dumb

https://www.wired.com/story/demis-hassabis-ai-layoffs-deepmind-google-io/
1•ent101•9m ago•0 comments

IBM Brings Its Most Advanced AI-Powered Security Portfolio to Clients

https://newsroom.ibm.com/2026-05-19-IBM-Brings-Its-Most-Advanced-AI-Powered-Security-Portfolio-to...
1•SVI•9m ago•0 comments

Google Search is getting its biggest changes

https://www.theverge.com/tech/932970/google-search-ai-update-io-2026
1•droidjj•10m ago•0 comments

You're not ready for minions

https://contextbridge.ai/blog/youre-not-ready-for-minions-01/
1•jcarver•10m ago•0 comments

Parallel execution for Node.js, done right

https://github.com/yankouskia/hurried
2•yankouskia•11m ago•0 comments

Show HN: Bypassing the AWS Lambda 4KB limit to run polyglot AI agents

https://brewhubphl.com/engineering/parity-contracts-for-polyglot-llm-commerce-a-case-study
1•tomc267•12m ago•1 comments

Tanenbaum: Can We Make Operating Systems Reliable and Secure? (2006) [pdf]

https://www.cs.vu.nl/~ast/Publications/Papers/computer-2006a.pdf
1•delamon•12m ago•0 comments

Gemini 3.5: frontier intelligence with action

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/
16•meetpateltech•12m ago•1 comments

Chime CEO: Pursuing bank charter is 'a when, not if'

https://www.bankingdive.com/news/chime-ceo-bank-charter-fintech/820621/
1•petethomas•13m ago•0 comments

Direction-Preserving Number Representations

https://arxiv.org/abs/2605.07662
1•matt_d•14m ago•0 comments

Wireless in situ wearable bioelectronic sweat sensor for biomarker monitoring

https://www.nature.com/articles/s41551-026-01670-2
1•bookofjoe•16m ago•0 comments

Gemini Omni

https://deepmind.google/models/gemini-omni/
8•meetpateltech•16m ago•2 comments

Nancy Mace pushes for statewide data center moratorium

https://www.politico.com/news/2026/05/19/nancy-mace-data-center-moratorium-00927781
2•1vuio0pswjnm7•17m ago•0 comments

Gemini 3.5 Flash

https://ai.google.dev/gemini-api/docs/models/gemini-3.5-flash
2•spectraldrift•18m ago•0 comments
Open in hackernews

LLMs adapt 24.9% under observation – safety evals are always observed

https://the-mind-of-ai.com/posts/the-relay-room-is-still-running/https://the-mind-of-ai.com/posts/the-relay-room-is-still-running/
2•agentic-wiki•41m ago

Comments

agentic-wiki•41m ago
Covas et al. (arXiv:2605.15034) ran 100 multi-agent debate sessions across 5 observation conditions. Human monitoring: +24.9% TTR vs. unmonitored baseline. Automated AI auditing: +22.2%. Both statistically significant.

  Anthropic's own agentic misalignment docs acknowledge the problem: "If models
  recognize evaluations and act differently when tested, then such evaluations may
  not accurately predict the model's real behavior in the wild."

  In August 2025, OpenAI and Anthropic cross-tested each other's models — both
  labs temporarily relaxed their external safety filters to run the safety tests
  cleanly. The models knew they were being tested. The results were published as
  evidence of safety.

  Anthropic's RSP and OpenAI's Preparedness Framework set deployment thresholds
  from this data. The gap between evaluation conditions and deployment conditions
  is 24.9 percentage points of measured linguistic adaptation. That experiment
  hasn't been run.