It opens up absolutely bonkers capabilities.
There is tons of more complexity to sandboxing, I agree!
[0] https://github.com/Barre/ZeroFS
[1] https://github.com/Barre/ZeroFS?tab=readme-ov-file#why-nfs-a...
I've done exactly that with the Filestash [1] virtual filesystem plugin [2] that can connect to any possible storage be it sftp, s3, gdrive, dropbox, ftp, a open directory, mysql, postgres and literally any storage the possibly exist. It lets you make chroot and restrict user access on top of it [3]
ref:
[1]: https://github.com/mickael-kerjean/filestash
[2]: https://www.filestash.app/docs/guide/virtual-filesystem.html
[3]: https://imgur.com/Ewk3nAg
The whole thing is available as a MCP and been published to the openai marketplace since around Christmas but somehow I'm still pending for review.
heavyset_go•1h ago
Implementing a database abstraction as a file system for an LLM feels like an extra layer of indirection for indirection's sake: just have the LLM write some views/queries/stored procs and give it sane access permissions.
LLMs are smart enough to use databases, email, etc without needing a FUSE layer to do so, and permissions/views/etc will keep it from doing or seeing stuff it shouldn't. You'll be keeping access and permissions where they belong, and not in a FUSE layer, and you won't have to maintain a weird abstraction that's annoying/hampered with licensing issues if you want to deploy it cross platform.
Also, your simplified FUSE abstraction will not map accurately to the state of the world unless you're really comprehensive with your implementation, and at that point, you might as well be interacting directly in order to handle that state accurately.
jakobem•1h ago
I think there is a gap between “real file systems” and “non file things in a database” where mapping your application representation of things to a filesystem is useful. Basically all those platforms that let users upload files for different purposes and work with them (ex Google Drive, notion, etc). In those cases representing files to an agent via a filesystem is the more intuitive and powerful interface compared to some home grown tools that the model never saw during training.
ximeng•37m ago
jakobem•27m ago
outofpaper•11m ago