frontpage.

One of the main difficulties in testing user interfaces is that, even when you have volunteers, they can only be used once for initial testing of an interface.

After the first test, they have already learned something, and that memory will interfere with further tests.

AI could be very useful in this case: imitate human behavior (can multi-modal LLMS "read" an interface screenshot and pretend to try to interact with it? Are there tools that can interpret what the LLM responds, e.g. "I'll try to click 'Details'" and then give the LLM another image?), but then immediately forget everything when a different version of the interface is presented to them.

Bonus points if you can add "personas" to the LLM (e.g. "you are a hurried user who barely reads the text", "you are a patient beginner who patiently watches the screen before trying", etc).

Maybe all of this is already available with agents and being currently used?

Exploring Anthropic's Memory Tool

Ask HN: For Mac users – Do you have а shoulder pinched nervе?

Python is not a great language for data science. Part 1: The experience

I stopped using Figma and switched to Penpot

A Second Look at Geolocation and Starlink

Flux2 VAE Research Report

The AI Industry Is Built on a Big Unproven Assumption

How to keep your apps up when AWS is down

Ozempic does not slow Alzheimer's, study finds

Is AI Eating the World?

Thai woman's cremation stopped as knocking on coffin heard

Ask HN: What AI tool to use for coding in 2025?

Universal LLM Memory Does Not Exist

Topas: A Convergent Neuro-Symbolic Architecture for General Intelligence

Return to China not an option for Taiwan's people, premier says responding to Xi

Show HN: Superglue – OSS integration tool that understands your legacy systems

AI agents should be serverless

Solved Remote Hiring

Choosing a hash function for 2030 and beyond: SHA-2 vs. SHA-3 vs. BLAKE3

Seymour Cray at 100 – Clive England – TNMoC Talk [video]

Manifesto: AI (as a term and field) should subsume CS

Wdyt about a blended technical how-to and case study for Snowflake optimization?

Show HN: AlgoVoice – Voice-based mock technical interviews for L3-L4 roles

Show HN: Smart GitHub Contribution Tracker – Fair analysis beyond line counts

Nvidia Says It's Not Enron in Private Memo Refuting Accounting Questions

Finland clings to happiness crown as economic gloom deepens

A Housing Roadmap for New York's Next Mayor

Modern Views of Transaction Isolation

Orion 1.0 – Browse Beyond

Ask HN: Who Is Looking for a Consultant?

Ask HN: Are there LLMs that can do UX testing?