frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: UI testing using multimodal LLMs

https://kodefreeze.com
1•kodefreeze•8h ago
Hi HN,

I built this tool to solve the "flakiness" problem in UI testing. Existing AI agents often struggle with precise interactions, while traditional frameworks (Selenium/Playwright) break whenever the DOM changes.

The Approach: Instead of relying on hard-coded selectors or pure computer vision, I’m using a multi-agent system powered by multimodal LLMs. We pass both the screenshot (pixels) and the browser context (network requests, console logs, etc) to the model. This allows the agent to:

"See" the UI like a user and accurately map semantic intent ("Click the Signup button") to precise coordinates even if the layout shifts.

The goal is to mimic natural user behavior rather than following a predefined script. It handles exploratory testing and finds visual bugs that code-based assertions miss.

I’d love feedback on the implementation or to discuss the challenges of using LLMs for deterministic testing.

India proposes forcing smartphone makers to give source code

https://www.reuters.com/world/china/india-proposes-forcing-smartphone-makers-give-source-code-sec...
1•alabhyajindal•32s ago•0 comments

Iran Shuts Down Starlink Internet for First Time

https://www.forbes.com/sites/zakdoffman/2026/01/11/kill-switch-iran-shuts-down-starlink-internet-...
2•neom•2m ago•0 comments

Ask HN: Manus.im (Meta) left me hanging for 7 days – is this normal?

1•ebikebilly•8m ago•0 comments

Show HN: Weekly code audits for vibe coders

https://pyscn.ludo-tech.org/
1•d-yoda•8m ago•1 comments

Recreate Pluribus Intro from Scratch

https://medium.com/@skewcy/recreate-pluribus-intro-from-scratch-8f64c7ff50a3
2•skewcy•13m ago•0 comments

Google: Don't make "bite-sized" content for LLMs

https://arstechnica.com/google/2026/01/google-dont-make-bite-sized-content-for-llms-if-you-care-a...
2•cebert•17m ago•0 comments

NPM-agentskills – Bundle AI agent documentation with NPM packages

https://github.com/onmax/npm-agentskills
1•onmax•21m ago•1 comments

Show HN: A tiny free job application tracker so you stop forgetting follow-ups

https://applytrack.netlify.app/
2•p-stanchev•22m ago•1 comments

A Python Integration of Asset Allocation Based on Modern Portfolio Theory

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5915004
2•7777777phil•22m ago•0 comments

Recommended sources to read up on new tech and thinking

2•wnscooke•24m ago•0 comments

Show HN: UebGuard – Email Protection to Stop Phishing Before Users Click

https://www.uebguard.com/
2•arlindb•25m ago•0 comments

Djjkk

1•jzksbvyskb•26m ago•0 comments

A coder considers the waning days of the craft (2023)

https://www.newyorker.com/magazine/2023/11/20/a-coder-considers-the-waning-days-of-the-craft
1•jsomers•28m ago•0 comments

Designing a Design Contract for AI

https://askcodi.substack.com/p/designing-a-design-contract-for-ai
1•himalayansailor•29m ago•0 comments

Is the Iranian Regime About to Collapse?

https://www.theatlantic.com/international/2026/01/iran-revolution-protests-collapse/685578/
4•mpweiher•34m ago•1 comments

Why Object of Arrays beat interleaved arrays: a JavaScript performance issue

https://www.royalbhati.com/posts/js-array-vs-typedarray
1•howToTestFE•34m ago•0 comments

Tiny Coder – AI coding agent in ~300 LOC writing itself

https://github.com/xrip/tinycode
1•xrip•35m ago•0 comments

Will LLMs Help or Hurt New Programming Languages?

https://blog.flix.dev/blog/will-llms-help-or-hurt-new-programming-languages/
3•appliku•35m ago•0 comments

BasiliskII Macintosh 68k Emulator Ported to ESP32-P4 / M5Stack Tab5

https://github.com/amcchord/M5Tab-Macintosh
1•rcarmo•37m ago•0 comments

Show HN: Meshii – Open-source AI tool to generate 3D meshes for game development

https://github.com/sciences44/meshii
2•sciences44•40m ago•1 comments

The Ralph Wiggum Loop from first principles (by the creator of Ralph)

https://www.youtube.com/watch?v=4Nna09dG_c0
1•ghuntley•41m ago•0 comments

Matrix.envs.net Is Shutting Down

https://envs.net/
1•Sami_Lehtinen•43m ago•0 comments

Lava Lamps Protect Your Data [video]

https://www.youtube.com/shorts/oW6YwSUyfzw
2•doener•44m ago•0 comments

Matrix.envs.net Is Shutting Down

https://matrix.to/#/!dDZYx7e4nzZjqR2tnC6v1pDbZX52HJVfQRuuBpinG9U/$QUY4XtMR2WS56N-VN9na768Fd37_N7Y...
1•Sami_Lehtinen•45m ago•2 comments

The Permanent Emergency

https://www.astralcodexten.com/p/the-permanent-emergency
1•ipnon•46m ago•0 comments

Ask HN: Are we overthinking maintainability of LLM written code?

1•grainier•47m ago•1 comments

Show HN: Ultralight iOS apps (~1 MB), no tracking, on-device only

https://mindbebop.com/
1•kentaroyamauchi•50m ago•0 comments

YouTube Playlist Length Calculator

https://ytplaylistlength.one/
1•wangxin199•50m ago•0 comments

MCP Server with X402

https://twitter.com/fveiras_/status/2010083092502069348
1•fveiras•52m ago•0 comments

Why Selling WhatsApp to Facebook Would Be the Biggest Mistake (2012)

https://www.forbes.com/sites/ericjackson/2012/12/03/why-selling-whatsapp-to-facebook-would-be-the...
4•chistev•53m ago•3 comments