frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: STDM – Make Your Documents and Data Think by Embedding LLM Instructions

https://github.com/csiro/stdm
1•benl_c•6mo ago
Hi HN, I’m Ben from CSIRO, Australia’s national science agency. We’ve been exploring how to make data and documents "think" when you use them with LLMs. We call it Self-Thinking Data Manifests (STDM). The idea is to embed plain-text instructions directly within files that tell an LLM how it should think about that data and interact with the user. We demonstrate it with PDF and HTML documents but in the future hope it might be possible for lots of formats.

Why Thinking Data?

* *Enhance PDF drag-and-drop* People already drag scientific papers and reports into LLMs to chat with them, but the interaction is often generic. STDM gives authors more control and customisation in these scenarios. It inverts custom chat-to-pdf systems: instead of building custom RAG interfaces on top of documents, we’re programming the LLM from within the document itself.

* *Author-directed interpretation* STDM helps ensure LLMs approach content with the author’s intended context and purpose, especially for complex scientific or technical data.

* *Smarter documents* Files with embedded STDM carry their own interactive logic, analysis routines, or guided explorations, making them more like mini-applications.

* *Towards in-document LLM programming* We see STDM as a step toward a future where data and instructions combine to form a kind of memory and quasi-procedural instruction set for LLMs; perhaps entire programs could live inside agentic LLM contexts using this approach.

To build an STDM you define a GOAL for the LLM, set CONSTRAINTS for interpretation, suggest REQUESTED_TOOLS (such as code_interpreter for analysis or web_retrieval for context), and optionally sketch out a CUSTOM_UI_DEFINITION (e.g a text-based UI, UX, or specific output format). When a user loads an STDM-enabled file into a capable LLM and explicitly tells the LLM to follow these instructions, the LLM uses the embedded manifest to guide its behaviour.

A mandatory Safety Preamble within the STDM instructs the LLM to await explicit user command and consent before executing any significant actions (especially tool use), ensuring the user is in control.

STDM is designed to be model-agnostic, STDM has been tested with GPT, Claude, and Gemini, if an LLM can read text and follow structured instructions, it should work with STDM. See it in action (save the file, upload/paste it into your LLM, then tell the LLM: Follow the STDM instructions in this document):

* Interactive Floodplain Study (HTML) This one can think about fetching live news if you allow it: https://csiro.github.io/stdm/examples/floodplain.html

* Same study (PDF) See how it thinks to answer questions based on its embedded guide: https://csiro.github.io/stdm/examples/floodplain.pdf

* The Brain (GitHub Spec v0.1, more examples, 2-min explainer video in README): https://github.com/csiro/stdm

This is an early-stage v0.1 specification and very much an experiment. We’re excited by the potential of data that can explain itself or guide its own analysis via an LLM, data that can think! We’d love to hear your thoughts. Is this a useful direction for programming LLMs or creating more dynamic documents? What are the pitfalls (we’ve focused on explicit invocation and consent as key safeguards)? How might you use data that thinks or programs its own interaction?

Particle Dynamics // Physics for Game Dev #1 [video]

https://www.youtube.com/watch?v=123nweKVCWA
1•ibobev•1m ago•0 comments

Limitless Acquired by Meta. Rewind Mac app shutting down

https://9to5mac.com/2025/12/05/rewind-limitless-meta-acquisition/
2•twohearted•3m ago•0 comments

Elon Musk's X fined €120M by EU for transparency violations

https://www.ft.com/content/cf1ece65-3544-4267-95b4-61ed5372288c
2•1vuio0pswjnm7•3m ago•0 comments

Memory Supercycle: How AI's HBM Hunger Is Squeezing DRAM (and What to Own)

https://medium.com/@Elongated_musk/memory-supercycle-how-ais-hbm-hunger-is-squeezing-dram-and-wha...
1•T-A•4m ago•0 comments

Rust Language Cheat Sheet

https://cheats.rs/
1•Brysonbw•9m ago•0 comments

Capellini Geodesic Extrusion Felting: Constants emerge from dimensional collapse

https://substack.com/inbox/post/180839248
1•obius_prime•11m ago•1 comments

The same paper got rejected as spam online, accepted in a coffee shop [pdf]

https://philpapers.org/archive/ALPTMC.pdf
2•Fanaxuki•11m ago•1 comments

Show HN: Middlerok Turns Your GitHub Codebase into a Complete Analytics System

https://www.middlerok.com/
1•rokontech•11m ago•0 comments

Chamberlain blocks smart home integrations with its garage door openers – again

https://www.theverge.com/tech/839294/chamberlain-myq-garage-door-opener-update-blocks-aftermarket...
1•buescher•12m ago•0 comments

How to Create and Read Flamegraphs

https://keninkujovic.com/blog/flamegraphs
1•kenonet•13m ago•1 comments

Lord of the Rings Extended Editions returning to theaters with a 4D twist

https://www.fathomentertainment.com/series/the-lord-of-the-rings-trilogy/
1•HelloUsername•16m ago•0 comments

Ask HN: Best AI model to generate UGC videos via API

1•rrmdp•17m ago•0 comments

Show HN: NeuroLint – CLI that fixes React/Next.js issues automatically (NO AI)

1•Just_Clive•20m ago•0 comments

This EV might be affordable for the masses but there's no screen inside

https://ktla.com/news/this-ev-might-actually-be-affordable-for-the-masses/
1•Bender•23m ago•1 comments

Poetiq shatters ARC-AGI 2 benchmark at half the cost

https://poetiq.ai/posts/arcagi_verified/
5•flavio87•25m ago•0 comments

Anna's Archive finished releasing all obtained Chinese books

https://annas-archive.li/blog/finished-chinese-release.html
6•ravenical•25m ago•0 comments

Show HN: SaasToast – It's not analytics. It's emotions. Feel your SaaS pulse

https://www.saastoast.com/
3•buildnext•32m ago•2 comments

Productivity App for Neurodivergent Individuals

https://dashzz.com
1•razvanbord•32m ago•0 comments

Apptron: Local-First Development Platform

https://github.com/tractordev/apptron
1•emersonmacro•36m ago•0 comments

Live Avatar: Streaming Real-Time Audio-Driven Avatar Generation, Infinite Length

https://liveavatar.github.io/
1•cstigler•36m ago•0 comments

How can I read the standard output of an already-running process?

https://devblogs.microsoft.com/oldnewthing/20251204-00/?p=111841
1•ibobev•37m ago•0 comments

AI Slop Is Ruining Reddit for Everyone

https://www.wired.com/story/ai-slop-is-ruining-reddit-for-everyone/
2•toomuchtodo•39m ago•2 comments

Could MrBeast IPO? CEO wants fans to have 'a chance to be owners of the company'

https://techcrunch.com/2025/12/03/could-mrbeast-ipo-his-ceo-wants-fans-to-have-a-chance-to-be-own...
1•gfortaine•40m ago•1 comments

Cancer rates in Australians under 50 are rising at an alarming pace

https://www.abc.net.au/news/2025-07-07/cancer-diagnosis-rates-under-50s-rising-causes-four-corner...
2•breve•41m ago•0 comments

Rant: I Don't Need an Office. Let Me Work Remotely

https://0xff.nu/remote-work/
5•hxii•41m ago•3 comments

The Polyglot Neuroscientist Resolving How the Brain Parses Language

https://www.quantamagazine.org/the-polyglot-neuroscientist-resolving-how-the-brain-parses-languag...
2•ibobev•43m ago•0 comments

Final Judgement in United States of America et al., vs. Google LLC [pdf]

https://storage.courtlistener.com/recap/gov.uscourts.dcd.223205/gov.uscourts.dcd.223205.1462.0.pdf
4•decentrality•43m ago•0 comments

Gmail Same-Tab Switcher: keep account switches in one tab

https://twitter.com/BourAbdelhadi/status/1997067403381850129
2•bscript•44m ago•1 comments

AOC: 'World will end in 12 years' if climate change not addressed (2019)

https://thehill.com/policy/energy-environment/426353-ocasio-cortez-the-world-will-end-in-12-years...
3•RickJWagner•44m ago•0 comments

The (Knife) Sharpener's Credo

https://www.thesharpist.com/sharpener-s-credo
1•mooreds•45m ago•0 comments