frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
2•AlexeyBrin•1m ago•0 comments

What the longevity experts don't tell you

https://machielreyneke.com/blog/longevity-lessons/
1•machielrey•2m ago•0 comments

Monzo wrongly denied refunds to fraud and scam victims

https://www.theguardian.com/money/2026/feb/07/monzo-natwest-hsbc-refunds-fraud-scam-fos-ombudsman
2•tablets•7m ago•0 comments

They were drawn to Korea with dreams of K-pop stardom – but then let down

https://www.bbc.com/news/articles/cvgnq9rwyqno
2•breve•9m ago•0 comments

Show HN: AI-Powered Merchant Intelligence

https://nodee.co
1•jjkirsch•11m ago•0 comments

Bash parallel tasks and error handling

https://github.com/themattrix/bash-concurrent
2•pastage•11m ago•0 comments

Let's compile Quake like it's 1997

https://fabiensanglard.net/compile_like_1997/index.html
1•billiob•12m ago•0 comments

Reverse Engineering Medium.com's Editor: How Copy, Paste, and Images Work

https://app.writtte.com/read/gP0H6W5
2•birdculture•18m ago•0 comments

Go 1.22, SQLite, and Next.js: The "Boring" Back End

https://mohammedeabdelaziz.github.io/articles/go-next-pt-2
1•mohammede•23m ago•0 comments

Laibach the Whistleblowers [video]

https://www.youtube.com/watch?v=c6Mx2mxpaCY
1•KnuthIsGod•25m ago•1 comments

Slop News - HN front page right now hallucinated as 100% AI SLOP

https://slop-news.pages.dev/slop-news
1•keepamovin•29m ago•1 comments

Economists vs. Technologists on AI

https://ideasindevelopment.substack.com/p/economists-vs-technologists-on-ai
1•econlmics•31m ago•0 comments

Life at the Edge

https://asadk.com/p/edge
3•tosh•37m ago•0 comments

RISC-V Vector Primer

https://github.com/simplex-micro/riscv-vector-primer/blob/main/index.md
4•oxxoxoxooo•41m ago•1 comments

Show HN: Invoxo – Invoicing with automatic EU VAT for cross-border services

2•InvoxoEU•41m ago•0 comments

A Tale of Two Standards, POSIX and Win32 (2005)

https://www.samba.org/samba/news/articles/low_point/tale_two_stds_os2.html
3•goranmoomin•45m ago•0 comments

Ask HN: Is the Downfall of SaaS Started?

3•throwaw12•46m ago•0 comments

Flirt: The Native Backend

https://blog.buenzli.dev/flirt-native-backend/
2•senekor•48m ago•0 comments

OpenAI's Latest Platform Targets Enterprise Customers

https://aibusiness.com/agentic-ai/openai-s-latest-platform-targets-enterprise-customers
1•myk-e•50m ago•0 comments

Goldman Sachs taps Anthropic's Claude to automate accounting, compliance roles

https://www.cnbc.com/2026/02/06/anthropic-goldman-sachs-ai-model-accounting.html
3•myk-e•53m ago•5 comments

Ai.com bought by Crypto.com founder for $70M in biggest-ever website name deal

https://www.ft.com/content/83488628-8dfd-4060-a7b0-71b1bb012785
1•1vuio0pswjnm7•54m ago•1 comments

Big Tech's AI Push Is Costing More Than the Moon Landing

https://www.wsj.com/tech/ai/ai-spending-tech-companies-compared-02b90046
4•1vuio0pswjnm7•56m ago•0 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
2•1vuio0pswjnm7•58m ago•0 comments

Suno, AI Music, and the Bad Future [video]

https://www.youtube.com/watch?v=U8dcFhF0Dlk
1•askl•59m ago•2 comments

Ask HN: How are researchers using AlphaFold in 2026?

1•jocho12•1h ago•0 comments

Running the "Reflections on Trusting Trust" Compiler

https://spawn-queue.acm.org/doi/10.1145/3786614
1•devooops•1h ago•0 comments

Watermark API – $0.01/image, 10x cheaper than Cloudinary

https://api-production-caa8.up.railway.app/docs
1•lembergs•1h ago•1 comments

Now send your marketing campaigns directly from ChatGPT

https://www.mail-o-mail.com/
1•avallark•1h ago•1 comments

Queueing Theory v2: DORA metrics, queue-of-queues, chi-alpha-beta-sigma notation

https://github.com/joelparkerhenderson/queueing-theory
1•jph•1h ago•0 comments

Show HN: Hibana – choreography-first protocol safety for Rust

https://hibanaworks.dev/
5•o8vm•1h ago•1 comments
Open in hackernews

Ask HN: Why aren't AIs being used as app beta testers yet?

15•amichail•7mo ago
For example, why don't beta testing services such as TestFlight have ChatGPT as a possible beta tester along with the human testers?

Comments

duxup•7mo ago
I'm going to throw out my own ignorant theory.

AIs that I find useful are still just LLMs and LLMs power comes from having a massive amount of text to work with to string together word math and come up with something ok. That's a lot of data that comes together to get things ... kinda right... sometimes.

I don't think there's that data set for "use an app" yet.

We've seen from "AI plays games" efforts that there have been some pretty spectacular failures. It seems like "use app" is a different problem.

cheevly•7mo ago
LLMs have literally won Pokemon. Im pretty sure that using an app is 10x simpler.
Vilian•7mo ago
A lot simpler to run pokemon than test an app, the game play by itself sometimes
diggan•7mo ago
> A lot simpler to run pokemon than test an app

Yes and no, I'd say. On one hand, apps tend to be used by even the most dumb person (they are "users" after all ;) ) and I'm sure there are more people out there who can use most apps enough than people who can beat Pokemon, even if one might generally be easier than the other.

It's kind of hard to judge though I guess, I played Pokemon (Red) last time when it launched in my country and I was like 8 or something, maybe I underestimate the average person but I feel like I overestimate people generally.

v5v3•7mo ago
Are llm testers doing anything traditional scripts with for loops can't?
postalrat•7mo ago
llm testers have for loops so they can do everything traditional scripts with for loops can plus more.
afrederico•7mo ago
They should totally be able to. If there's "vibe coding" there should be "vibe testing." We're working on just such a product (https://actory.ai); right now it only does websites but just imagine when we turn it on mobile/apps, etc. How cool would that be?
aristofun•7mo ago
Because for meaningful tests of an app (assuming b2c or b2b for end users) you are supposed to be or imitate a human being.

Current AI is not even designed to do that. It is just a very sophisticated auto-complete.

It is sophisticated enough to fool some VCs that you can chop your round peg into square hole. But there is no ground to expect a scalable solution.

gametorch•7mo ago
Eh, I disagree. Lots of valuable open source code purely written by AI has already been shipped.
aristofun•7mo ago
Give me 1 decent example of code "purely" written by AI
gametorch•7mo ago
https://github.com/gametorch/image_to_pixel_art_wasm

Thousands of users. 40+ GitHub stars. Original draft took 30 minutes. Added numerous feature requests and each took like 5 minutes a pop.

I never wrote a single line of that code.

Furthermore, my startup, https://gametorch.app/ has 110 sign ups, paying users, millions of impressions. Never wrote any of that code either. Typing it out at ~100 wpm is far too slow.

drakonka•7mo ago
They are; we're working on agents for web application testing over at qa.tech.
HeyLaughingBoy•7mo ago
Anecdotally, I know someone who tried to have ChatGPT generate unit tests and it was an abject failure.
cheevly•7mo ago
I know someone that generated unit tests successfully.
whoknowsidont•7mo ago
And I know exactly which one of these is an enterprise B2B app/platform.
haiku2077•7mo ago
I generate tests with Claude almost every day.
owebmaster•7mo ago
I have generated unit tests successfully, how did the someone you know failed?
gametorch•7mo ago
I generated tons of valuable code with a bunch of GitHub stars, paying users, hundreds of signups, millions of impressions. Just chipping in my anecdote.
danbrooks•7mo ago
I worked with a team that did this for the Facebook app.

https://engineering.fb.com/2018/05/02/developer-tools/sapien...

bravesoul2•7mo ago
That's a good idea. You are on to something
shibatanaoto•7mo ago
I think it depends on the situation. Unit test can be done by Claude code. I use it everyday. For E2E testing, browser-based tools are already pretty convenient. AI could definitely help by suggesting UX improvements, but setting up a smooth workflow is still tricky. You’d need to figure out things like where to put the AI’s feedback, when it should kick off testing, and who’s going to sort through all the suggestions it generates. But technically it can be useful and quality is good enough.
Nerd_Nest•7mo ago
I’m still torn on this. On one hand, memory could make ChatGPT more useful, especially for people using it regularly for work or coding. But on the other hand, the idea that it “remembers” me just feels a little uncomfortable.

I’d want more control over what’s remembered and when. Curious if anyone here has used this yet — is it actually helpful in practice?

logic_node•7mo ago
I’ve been trying it out recently, mostly for writing and summarizing research. The memory feels subtle so far — it doesn’t jump in unless you really build on past prompts.

That said, I totally agree about control. I wish there was a more obvious way to “pause” or “reset” memory mid-session instead of diving into settings. It’s useful, but still a little opaque.

diggan•7mo ago
> I’d want more control over what’s remembered and when. Curious if anyone here has used this yet

I use the "memory" feature of ChatGPT, and taking a look right now, it seems to have about ~30 items saved from me, some of them are like "Is using egui for a UI task, particularly related to configuring smooth automatic scrolling in a scrollarea." which is useful for maybe the ~3 chats I had about it, and also other things like "Prefers more accuracy in terminology and is looking to represent LLMs in a detailed and structured way." that are more broadly applicable.

Then you can obviously remove any of them, and also manually add by telling it explicitly you want something added.

I'm not sure of its usefulness, I guess it's nice that it correctly "knows" I'm mostly on Arch Linux most of the time but have my servers with NixOS, so if I ask it to create new unix commands I usually get something that works on both, or two versions. But sometimes it also incorrectly infers something because I didn't specify otherwise in the prompt and didn't think of it, but it could see something from the memories.

muzani•7mo ago
https://docs.maestro.dev/

It works without AI, but there's a MCP and stuff, so you should be able to connect Claude etc with your emulator/device now.

rajkumarsekar•7mo ago
One big reason AI isn’t doing much beta testing yet is that it doesn’t use apps the way humans do. It doesn’t get confused, frustrated, or delighted by a clever UI. Most bugs that matter, like a button that’s in the wrong place, a flow that doesn’t make sense, or something that feels off, are things a human notices because they’re actually trying to do something with the app.

Also, training an AI to navigate and test every type of app takes serious setup. Every app works differently, and AI needs context: what’s the goal? what’s a normal result? what counts as broken? Without that, it just pokes around randomly or follows a script, not much better than traditional automation.

That said, we’re getting close. Some teams already use LLMs to write test cases or spot UI issues in screenshots. Give it a couple years, and you might actually see TestFlight bots pointing out bugs before users ever get there.