news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Mm-ctx – fast, multimodal context for agents

https://huggingface.co/posts/spillai/891696740911772

2•visioninmyblood•55m ago

Comments

visioninmyblood•55m ago

LLM-based agents handle text incredibly well, but images, videos, or PDFs with visual content are hard to interpret. mm-ctx gives your CLI agent multi-modal skills.

Try it interactively in Spaces: vlm-run/mm-ctx

Readme: https://vlm-run.github.io/mm/ PyPI: https://pypi.org/project/mm-ctx SKILL.md: https://github.com/vlm-run/skills/blob/main/skills/mm-cli-sk...

mm-ctx is meant to feel familiar: the UNIX tools we already love (find/cat/grep/wc), rebuilt for file types LLMs can't read natively and designed to work with agents via the CLI. - mm grep "invoice #1234" ~/Downloads searches across PDFs and returns line-numbered matches - mm cat <document>.pdf returns a metadata description of the file - mm cat <photo>.jpg returns a caption of the photo - mm cat <video>.mp4 returns a caption of the video

A few things we obsessed over: Speed: Rust core for the hot paths Local-first, BYO model: Uses any OpenAI-compatible endpoint: Ollama, vLLM/SGLang, LMStudio with any multimodal LLM (Gemma4, Qwen3.5, GLM-4.6V). Composable: stdin + structured outputs Drops into any agent via mm-cli-skills: Claude Code, Codex, Gemini CLI, OpenClaw.

We’d love to hear your feedback! Especially on the CLI and what file types and workflows you would like to see next.

Cherry Kearton: The eccentric influence on a young Sir David Attenborough

https://www.bbc.com/future/article/20260507-cherry-kearton-the-eccentric-influence-on-a-young-sir...

1•breve•2m ago•0 comments

Two more public disclosures, it will never stop

https://deadeclipse666.blogspot.com/2026/05/two-more-public-disclosures-it-will.html

1•Animux•2m ago•0 comments

Fuck You, Bambu Lab. Go Ahead, Sue Us

https://gamersnexus.net/fk-you-bambu-lab

2•pabs3•6m ago•0 comments

Anything that is underneath the cursor gets fed into Google's surveillance AI

https://mastodon.social/@mcc/116563821063587689

3•doener•9m ago•0 comments

RealtimePokerCalculator

https://www.mik.lt/RealtimePokerCalculator.zip

1•reztart•9m ago•0 comments

The Walled Garden of the Surveilled Web

https://kirill.korins.ky/articles/the-walled-garden-of-the-surveilled-web/

1•catap•9m ago•0 comments

Snack giant switches to black and white packaging as Iran war hits ink supplies

https://www.bbc.com/news/articles/c78k405j8pdo

1•breve•10m ago•0 comments

Show HN: Dexgram – Telegram to Codex Desktop Bridge for Windows

https://github.com/yashau/dexgram

1•yashau•11m ago•0 comments

Not so dusty: How tech is changing woodworking

https://www.bbc.com/news/articles/c747n11933eo

1•breve•12m ago•0 comments

Waymo recalls U.S. robotaxi fleet after vehicle swept away in flood

https://www.expressnews.com/business/article/waymo-recall-san-antonio-flood-22254607.php

1•zzzeek•13m ago•1 comments

OpenAI Trial – Greg Brockman's Journal

https://www.wsj.com/tech/musk-openai-trial-greg-brockman-diary-journal-6950270e

1•ilarum•16m ago•0 comments

Could At-Home Brain Stimulation Reduce Psychiatry's Reliance on S.S.R.I.s?

https://www.nytimes.com/2026/04/28/health/depression-at-home-brain-stimulation-fda.html

2•bookofjoe•16m ago•1 comments

Open source rule based guardrails for coding agents

https://github.com/falcosecurity/prempti/tree/main

1•knoxa2511•17m ago•0 comments

America is experiencing a productivity miracle

https://www.economist.com/finance-and-economics/2026/05/11/america-is-experiencing-a-productivity...

1•mackmcconnell•24m ago•0 comments

Turritopsis Dohrnii

https://en.wikipedia.org/wiki/Turritopsis_dohrnii

1•thelastgallon•25m ago•0 comments

Loading/running every LLM with 4M ctx in 3 clicks

https://old.reddit.com/r/Hugston/comments/1tbgrbb/4_million_ctx_for_every_ai_llm_model/

1•trilogic•26m ago•0 comments

DuckDB Quack Announcement [video]

https://www.youtube.com/watch?v=RQBhuL9Ve8g

1•fredguth•29m ago•0 comments

The Unmet Needs Index

https://www.convoke.bio/blog/introducing-the-unmet-needs-index

3•ray__•32m ago•0 comments

How AI Is Making Us All Dumber [video]

https://www.youtube.com/watch?v=eSABedBwZjQ

2•mooreds•33m ago•0 comments

All the demons hiding in your AIs

https://drtompollak.substack.com/p/all-the-demons-hiding-in-your-ais

1•gmays•33m ago•0 comments

Companies start getting tariff refunds after Supreme Court decision

https://www.cnbc.com/2026/05/12/trump-tariff-refunds.html

2•tcp_handshaker•34m ago•0 comments

Apple will soon start using AI-generated presenters on its Sales Coach app

https://9to5mac.com/2026/05/12/apple-will-soon-start-using-ai-generated-presenters-on-its-sales-c...

1•cdrnsf•35m ago•0 comments

Twin brothers wipe 96 government databases minutes after being fired

https://arstechnica.com/tech-policy/2026/05/drop-database-what-not-to-do-after-losing-an-it-job/

4•jnord•35m ago•1 comments

The revolt against I-Ready: Private equity-backed education software faces fury

https://www.nbcnews.com/news/education/iready-school-software-faces-parent-teacher-student-fury-r...

1•Umofomia•37m ago•0 comments

I Bought a "Junk" PSP from Japan: Here's How It Went

https://gardinerbryant.com/i-bought-a-junk-psp-from-japan-heres-how-it-went/

1•Kate0CoolLibby•37m ago•0 comments

Subvert: The music platform owned by its community

https://www.subvert.fm/

1•vectordust•38m ago•0 comments

Preview bill is now available

https://copilot-billing-preview.github.com/

1•predkambrij•40m ago•0 comments

Empathy as Principal Computation Substrate

1•mimoos•46m ago•0 comments

Two Thousand Line educational operating system released by Cornell University

https://github.com/yhzhang0128/egos-2000

3•argosopentech•50m ago•1 comments

DeepSeek V4's indexer dies at 65K. We got it to 1M on 6GB

https://arxiv.org/abs/2605.02568

5•OsamaJaber•51m ago•0 comments