frontpage.

I'm sure many of you work for companies where various AI tools are being made available and IT departments asking for feedback on those tools. The IT departments are allocating in some cases unlimited budget in the hopes that something comes out as a winner and sticks out eventually...

For example the models from Anthropic, OpenAI, Google etc. can be accessed via: - IDE integration, e.g. VS Code, JetBrains etc. - Dedicated apps and CLIs, e.g. Codex, Claude, Copilot CLI etc.

It's already bad enough that SWE orgs are struggling to quantify the strength weaknesses of the models themselves and now we have their integration/entry points to test out too and I'm not sure how we can even being to systematically evaluate these tools...

How are you approaching this? What's worked for you and what's not?

Show HN: Waibsite – Build a Website by Chatting

Deutsch–Jozsa Algorithm

How you implemented your Python decorator is wrong

Shortest Sudoku Solver

IOSurface Kernel Teardown Panic (macOS 15.x / 26.x)

To become a good C programmer (2011)

GPT-5.5 has pulled ahead of Opus for accounting and finance tasks

How good is Mac Studio M3 Ultra for Trillion param models like DeepSeekv4?

The Alignment Problem in Your Government

My audio interface has SSH enabled by default

Microsoft's Wave of Executive Departures

TIPSv2: Advancing Vision-Language Pretraining with Enhanced Patch-Text Alignment

LLMs are pretty good at making slideshows now

Show HN: ffmpeg-render-pro – Parallel video rendering with live dashboard

RLMs process inputs up to two orders of magnitude beyond model context windows

Support Put, Patch, and Delete in HTML Forms

Solve identity consistency problem for foundational image models

Show HN: HNswered – watches for replies to your Hacker News posts and comments

Simple 3D Modeler on Web

Google Commits to Invest Up to $40B in Anthropic

Cohere to Acquire Aleph Alpha

What Is APL and What Can APL Do for You? (2024) [video]

Join the Choredle Android Beta Testers List

The Geometry of Forgetting

Use Built-In VPN in Firefox

The Classic American Diner

Relays for Nimony's Standard Library

Show HN: I'm 15 and built a cryptographic accountability layer for AI agents

mine: A complete, no-frills IDE for Coalton and Common Lisp

Ubuntu 26.04 Released

Ask HN: How are you evaluating AI apps and CLI?