Show HN: Tessera – An open protocol for AI-to-AI knowledge transfer

https://github.com/incocreativedev/tessera-core

3•kirkmaddocks•2h ago

Tessera is an activation-based protocol that lets trained ML models transfer knowledge to other models across architectures. Instead of dumping weight tensors, it encodes what a model has learnt — activations, feature representations, behavioural patterns — into self-describing tokens that a receiving model can decode into its own architecture.

The reference implementation (tessera-core) is a Python/PyTorch library. Current benchmarks show positive transfer across CNN, Transformer, and LSTM pairs. It runs on CPU and the demo finishes in under 60 seconds.

Happy to answer questions about the protocol design, the wire format, or the benchmark methodology.

Comments

0xecro1•2h ago

Interesting approach. I work in embedded Linux/edge AI where we constantly struggle to move knowledge from large training models down to quantized INT8 models on constrained hardware (ARM Cortex-A class). Have you tested transfer to quantized or pruned targets? If the behavioural encoding survives that compression, this could be a much cleaner path than classical distillation for on-device deployment.

kirkmaddocks•2h ago

We haven't built quantisation-aware transfer yet, but the architecture lends itself to it better than you might expect.

Mode A (activation transfer) operates at the representation level, not the parameter level. The source model's knowledge gets projected into a 2048-dim hub space — the receiving model doesn't need to match architecturally or in precision. A 200M FP32 training model and a 5M INT8 edge model can both have UHS encoders/decoders. The hub space is agnostic to what's underneath.

Mode B (behavioural) is probably the most interesting path for your use case. It transfers decision boundaries rather than activations or weights. If the quantised model can reproduce the input-output mapping, internal precision is irrelevant.

It's similar in spirit to distillation but decoupled through the hub space — teacher and student don't need to be online simultaneously, and you get a full audit trail of what knowledge went where (which matters if you're shipping medical/industrial edge models under EU AI Act).

The gap today is the decoder side. DecoderMLP outputs FP32. We'd need a quantisation-aware variant that respects the INT8 grid — straight-through estimator at minimum, learned quantisation boundaries ideally. We'd also want empirical drift characterisation across FP32→FP16→INT8→INT4 so you'd know your expected fidelity floor for a given target.

The swarm angle is where it gets genuinely useful for edge fleets. If you've got N devices training locally on-site data, they contribute quantised-model tokens back to a full-precision aggregator. The robust aggregation strategy (Huber-style cosine clipping) handles quantisation noise across heterogeneous devices naturally.

We're planning a quantisation-aware transfer module next. If you're interested in testing against real Cortex-A INT8 workloads, we'd welcome the collaboration — repo is at github.com/incocreativedev/tessera-core.

Show HN: enveil – hide your .env secrets from prAIng eyes

Show HN: X86CSS – An x86 CPU emulator written in CSS

Show HN: Steerling-8B, a language model that can explain any token it generates

Show HN: Awsim – Lightweight AWS emulator in Go (40 services in progress)

Show HN: PgDog – Scale Postgres without changing the app

Show HN: AI-native SDLC – 156 test docs, 16 skills, 1 human

Show HN: Dicta.to – Local voice dictation for Mac with on-device AI

Show HN: AI phone assistant that became a lifeline for people who can't speak

Show HN: Out Plane – A PaaS I built solo from Istanbul in 3 months

Show HN: Cellarium: A Playground for Cellular Automata

Show HN: Babyshark – Wireshark made easy (terminal UI for PCAPs)

Show HN: Tessera – An open protocol for AI-to-AI knowledge transfer

Show HN: Sowbot – Open-hardware agricultural robot (ROS2, RTK GPS)

Show HN: WebPerceptor – Enabling AI Mediated Web Browsing

Show HN: Claude Copy – Drop-in fix for Claude Code's broken copy-paste

Show HN: AI Timeline – 171 LLMs from Transformer (2017) to GPT-5.3 (2026)

Show HN: Git-native-issue – issues stored as commits in refs/issues/

Show HN: CIA World Factbook Archive (1990–2025), searchable and exportable

Show HN: L88 – A Local RAG System on 8GB VRAM (Need Architecture Feedback)

Show HN: Notion-CLI – Full Notion API from the terminal, 39 commands, one binary

Show HN: 3D Mahjong, Built in CSS

Show HN: 60 Years of Metal Music Data, Visualized

Show HN: Agent Multiplexer – manage Claude Code via tmux

Show HN: AgentBudget – Real-time dollar budgets for AI agents

Show HN: A geometric analysis of Chopin's Prelude No. 4 using 3D topology

Show HN: BVisor – An Embedded Bash Sandbox, 2ms Boot, Written in Zig

Show HN: ClinTrialFinder –AI-powered clinical trial matching for cancer patients

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

Show HN: Local-First Linux MicroVMs for macOS

Show HN: Implementing ping from the Ethernet layer (ARP,IPv4,ICMP in user space)

Show HN: Tessera – An open protocol for AI-to-AI knowledge transfer

Comments

Show HN: enveil – hide your .env secrets from prAIng eyes

Show HN: X86CSS – An x86 CPU emulator written in CSS

Show HN: Steerling-8B, a language model that can explain any token it generates

Show HN: Awsim – Lightweight AWS emulator in Go (40 services in progress)

Show HN: PgDog – Scale Postgres without changing the app

Show HN: AI-native SDLC – 156 test docs, 16 skills, 1 human

Show HN: Dicta.to – Local voice dictation for Mac with on-device AI

Show HN: AI phone assistant that became a lifeline for people who can't speak

Show HN: Out Plane – A PaaS I built solo from Istanbul in 3 months

Show HN: Cellarium: A Playground for Cellular Automata

Show HN: Babyshark – Wireshark made easy (terminal UI for PCAPs)

Show HN: Tessera – An open protocol for AI-to-AI knowledge transfer

Show HN: Sowbot – Open-hardware agricultural robot (ROS2, RTK GPS)

Show HN: WebPerceptor – Enabling AI Mediated Web Browsing

Show HN: Claude Copy – Drop-in fix for Claude Code's broken copy-paste

Show HN: AI Timeline – 171 LLMs from Transformer (2017) to GPT-5.3 (2026)

Show HN: Git-native-issue – issues stored as commits in refs/issues/

Show HN: CIA World Factbook Archive (1990–2025), searchable and exportable

Show HN: L88 – A Local RAG System on 8GB VRAM (Need Architecture Feedback)

Show HN: Notion-CLI – Full Notion API from the terminal, 39 commands, one binary

Show HN: 3D Mahjong, Built in CSS

Show HN: 60 Years of Metal Music Data, Visualized

Show HN: Agent Multiplexer – manage Claude Code via tmux

Show HN: AgentBudget – Real-time dollar budgets for AI agents

Show HN: A geometric analysis of Chopin's Prelude No. 4 using 3D topology

Show HN: BVisor – An Embedded Bash Sandbox, 2ms Boot, Written in Zig

Show HN: ClinTrialFinder –AI-powered clinical trial matching for cancer patients

Show HN: Llama 3.1 70B on a single RTX 3090 via NVMe-to-GPU bypassing the CPU

Show HN: Local-First Linux MicroVMs for macOS

Show HN: Implementing ping from the Ethernet layer (ARP,IPv4,ICMP in user space)