news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Tensor Manipulation Unit (TMU): Reconfigurable, Near-Memory, High-Throughput AI

https://arxiv.org/abs/2506.14364

23•transpute•3h ago

Comments

KnuthIsGod•3h ago

Cutting edge and innovative AI hardware research from China.

Looks like Amerikan sanctions are driving a new wave of innovation in China.

" This work addresses that gap by introducing the Ten- sor Manipulation Unit (TMU): a reconfigurable, near-memory hardware block designed to execute data-movement-intensive (DMI) operators efficiently. TMU manipulates long datastreams in a memory-to-memory fashion using a RISC-inspired execution model and a unified addressing abstraction, enabling broad support for both coarse- and fine-grained tensor transformations.

The proposed architecture integrates TMU alongside a TPU within a high-throughput AI SoC, leveraging double buffering and output forwarding to improve pipeline utilization. Fab- ricated in SMIC 40 nm technology, the TMU occupies only 0.019 mm2 while supporting over 10 representative TM operators. Benchmarking shows that TMU alone achieves up to 1413.43× and 8.54× operator-level latency reduction over ARM A72 and NVIDIA Jetson TX2, respectively.

When integrated with the in- house TPU, the complete system achieves a 34.6% reduction in end-to-end inference latency, demonstrating the effectiveness and scalability of reconfigurable tensor manipulation in modern AI SoCs."

RAG is the way about retrieval, agent, and grounding truth

https://psiace.me/posts/rag-is-the-way/

1•repsiace•6m ago•0 comments

RAG in Coding Agents: Making Smarter Programming Assistants

https://psiace.me/posts/rag-in-coding-agent/

1•repsiace•8m ago•0 comments

Another What to Watch App

https://what2watch2.vercel.app/

1•memonkey•17m ago•1 comments

Faking Relativity

https://tiffnix.com/faking-relativity

1•gsky•18m ago•0 comments

Show HN: SX – Transfer files from within SSH sessions without reconnecting

https://github.com/Memphizzz/SX

1•memphizzz•20m ago•0 comments

Dropping the First Atomic Bombs

https://www.theguardian.com/world/2025/jun/22/atomic-bomb-hiroshima-nagasaki-author-stephen-walker

1•rippeltippel•22m ago•0 comments

Are we overfitting our code to trends instead of problems?

1•fewbenefit•24m ago•1 comments

Nano-Vllm: lightweight vLLM implementation built from scratch

https://github.com/GeeeekExplorer/nano-vllm

1•simonpure•24m ago•0 comments

Are we overfitting our code to trends instead of problems?

1•fewbenefit•27m ago•0 comments

Framework for Skill Learning

4•michaelshoe•35m ago•0 comments

Openmovement – Watchmaking 2.0

https://openmovement.org/

1•pabs3•45m ago•0 comments

FedEx founder Fred Smith dies at 80

https://apnews.com/article/fred-smith-fedex-founder-died-645c93a192744e1ab8817ffb894ead01

1•keepamovin•48m ago•0 comments

Show HN: A zero-config HTML report plugin for Pytest (single file, CI-friendly)

1•nefaurio•51m ago•0 comments

Product Management: The Good, the Hard, and How to Know If It's Right for You

https://elezea.com/2025/06/product-management-the-good-the-hard-and-how-to-know-if-its-right-for-you/

3•mooreds•52m ago•0 comments

Implicit is better than explicit

https://sophiabits.com/blog/implicit-is-better-than-explicit

3•todsacerdoti•52m ago•0 comments

What I learned recording hours of teens on their phones

https://www.theguardian.com/film/2025/jun/22/lauren-greenfield-social-studies-documentary-film-adolescence-teenagers-phones

3•sandebert•54m ago•0 comments

Compressing for the Browser in Go

https://blog.kowalczyk.info/a-5hum/compressing-for-the-browser-in-go.html

2•todsacerdoti•58m ago•0 comments

I built an app to backup Live Photos from iPhone to external hard drives

2•xmasterdev•59m ago•0 comments

Unlocking Efficiency: Asus IoT Drives Smart Factory Development Through AI

https://iot.asus.com/resources/casestudies/drive-smart-factory-development-through-ai/

1•teleforce•1h ago•0 comments

Vera Rubin Scientists Reveal Telescope's First Images

https://www.nytimes.com/2025/06/23/science/vera-rubin-scientists-reveal-telescopes-first-images.html

3•donohoe•1h ago•0 comments

Converting Sourcemaps to Original JavaScript/TypeScript Sourcecode

https://yasoob.me/posts/converting-sourcemap-to-original-sourcecode/

2•yasoob•1h ago•0 comments

The Sad Case of the Youngest-Ever Alzheimer's Diagnosis

https://www.sciencealert.com/the-sad-case-of-the-worlds-youngest-ever-alzheimers-diagnosis

7•breitling•1h ago•1 comments

LeetCode for System Design

https://leetsys.dev

21•rbajp•1h ago•3 comments

Open Source AI Presentation Generator

https://github.com/AmNotAGoose/PPTX-Presentation-Generator

2•jonjlee•1h ago•0 comments

What's inside OpenAI Codex CLI: It’s simpler than I thought

https://medium.com/@dolevietthang/whats-inside-openai-codex-63a43ea94a48

2•vietthangif•1h ago•0 comments

Trump can pull the plug on the internet, and Europe can't do anything about it

https://www.politico.eu/article/donald-trump-eu-internet-europe-us-trade-war-data-cyber/

6•baobun•1h ago•2 comments

My go to tech stack for hosting Web Apps (including vibe coded variety)

https://musings-mr.net/post/my-go-to-tech-stack-for-web-apps

3•mrkiouak•1h ago•2 comments

Questions About Apple's Updated Models and the 'Use Model' Action in Shortcuts

https://www.macstories.net/notes/i-have-many-questions-about-apples-updated-foundation-models-and-the-great-use-model-action-in-shortcuts/

2•CharlesW•1h ago•0 comments

Prompt-It

https://www.prompt-it.xyz/

2•handfuloflight•1h ago•0 comments

Show HN: Ariadne – a Rust implementation of aperiodic cryptography

https://codeberg.org/ciphernomad/ariadne-suite

2•ciphernomad-org•1h ago•0 comments