frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Skilled AI agents for embedded and IoT systems development

https://github.com/iot-agent/iot-skillsbench
2•tingjunchen•1h ago

Comments

tingjunchen•1h ago
LLMs and agentic AI systems show immense promise for automated software development. However, applying them to hardware-in-the-loop (HIL) embedded and IoT systems is notoriously difficult due to the tight coupling between software logic, timing constraints, and physical hardware behavior. Code that compiles successfully often fails on real devices.

To bridge this gap, we introduce an open-source, skills-based agentic AI framework for embedded and IoT systems development, and a comprehensive IoT-SkillsBench. Key highlights: 1. A skills-based agentic framework: A principled approach for injecting structured, domain-specific knowledge into LLM-based agents for reliable embedded and IoT systems development. 2. IoT-SkillsBench: A comprehensive benchmark designed to evaluate AI agents in real-world embedded programming settings, spanning 3 platforms, 23 peripherals, and 42 tasks across 3 difficulty levels. 3. 378 hardware-in-the-loop (HIL) experiments: Each task is evaluated under three agent configurations (no-skills, LLM-generated skills, and human-expert skills) and validated on real, physical hardware, demonstrating that structured human-expert skills achieve near-perfect success rates without reliance on retrieval or long-context reasoning.

Ask HN: Disable/Destroy Kharg Island Preferable to Occupation?

1•giardini•3m ago•0 comments

I built an MCP server so your agent stops picking the wrong cloud services

https://github.com/Tlalvarez/Auxiliar-ai
1•thiagolalvarez•5m ago•0 comments

Show HN: CSV Analyzer – natural-language analysis, chat and preview, dashboards

https://csv-analyzer.up.railway.app/
1•bchhabra2490•5m ago•0 comments

Outdoor lighting remote control systems

https://miboxer.com/industry-news/embrace-the-freedom-the-advantages-of-wireless-remote-control-s...
2•miboxer•10m ago•0 comments

A Readable Specification of TLS 1.3

https://www.davidwong.fr/tls13/
1•subset•15m ago•0 comments

The Effect of War-Inflicted Environmental Damage on Free-Ranging Domestic Dogs

https://onlinelibrary.wiley.com/doi/10.1111/eva.70182
1•gdevillers•15m ago•0 comments

Building Useful Agents over Email

https://haulos.com/blog/building-agents-over-email/
2•s4i•17m ago•0 comments

LipoVive (Urgent Report) the Science Behind the Gelatin Trick for Metabolic

https://www.morningstar.com/news/accesswire/1138075msn/lipovive-reviews-shocking-2026-report-what...
1•tafynahu•18m ago•0 comments

macbonk – interactive macOS security hardening CLI

https://github.com/0xhsn/macbonk
2•7asan•23m ago•0 comments

The network is the database:data management for highly distributed system (2021)

https://dl.acm.org/doi/abs/10.1145/375663.375737
2•teleforce•26m ago•0 comments

The Curious Case of Retro Demo Scene Graphics

https://www.datagubbe.se/aipixels/
3•zdw•29m ago•0 comments

Stanford study reveals AI vision models invent images they never see

https://arxiv.org/abs/2603.21687
2•LionTurtle13•32m ago•0 comments

A history of styling choices leading to native CSS

https://cassidoo.co/post/css-todometer/
1•cassidoo•34m ago•0 comments

Show HN : DrawX - Excalidraw with Back End

https://drawx.ossy.dev
2•postatic•39m ago•1 comments

Helping disaster response teams turn AI into action across Asia

https://openai.com/index/helping-disaster-response-teams-asia
2•surprisetalk•40m ago•0 comments

Tech CEOs suddenly love blaming AI for mass job cuts. Why?

https://www.bbc.com/news/articles/cde5y2x51y8o
3•tartoran•42m ago•0 comments

About the growing verification debt in software

https://clifford.ressel.fyi/blog/cost-to-implement-vs-verify/
3•csressel•42m ago•0 comments

Dow Doubles Plastics Price Hike as Iran War Blocks Supply Route

https://www.wsj.com/livecoverage/iran-war-us-israel-news-updates-2026/card/dow-doubles-plastics-p...
2•walterbell•43m ago•0 comments

President Trump Gaggles with Press on Air Force One En Route

https://www.npr.org/2025/09/08/nx-s1-5526066/leni-riefenstahl-nazi-filmmaker-new-documentary
3•KnuthIsGod•45m ago•3 comments

Show HN: Eli5.cc – type any topic, get a simple explanation and visual diagram

https://eli5.cc
1•digi_wares•45m ago•1 comments

Social Media Addiction Trial Should Lead to Platform Redesigns

https://spectrum.ieee.org/social-media-trial
1•jruohonen•46m ago•0 comments

A look at what's possible with BPF arenas (2025)

https://lwn.net/Articles/1019885/
2•teleforce•52m ago•0 comments

Ask HN: Best bank for new startup in the US

1•thepace•53m ago•0 comments

Tribe v2: Predictive Foundation Model on Human Brain Processing Complex Stimuli

https://ai.meta.com/blog/tribe-v2-brain-predictive-foundation-model/?_fb_noscript=1
2•walterbell•55m ago•0 comments

HD Audio Driver for Windows 98SE / Me

https://github.com/andrew-hoffman/wdmhda
10•userbinator•55m ago•0 comments

Towards end-to-end automation of AI research

https://www.nature.com/articles/s41586-026-10265-5
3•baylearn•57m ago•0 comments

Excel2r – R package that migrates Excel workbooks to standalone R scripts

https://github.com/emantzoo/excel2r
2•bthallplz•59m ago•0 comments

Apple scales back its AI ambitions and sticks to selling hardware

https://www.neowin.net/news/report-apple-scales-back-its-ai-ambitions-and-sticks-to-selling-hardw...
4•bundie•1h ago•0 comments

Eval Set Generaton - accelerate your eval workflow

https://dutchmanlabs.com/
1•thesarsour•1h ago•0 comments

LLMnesia – Local-first search across your AI conversations

https://chromewebstore.google.com/detail/llmnesia/leekfgbdojiaabifbjbbgiiclannjdkf
2•keiranflynn•1h ago•0 comments