Run interactive commands in Gemini CLI

https://developers.googleblog.com/en/say-hello-to-a-new-level-of-interactivity-in-gemini-cli/

86•ridruejo•6d ago

Comments

nacs•3h ago

It's nice that they mention node-pty that does most of the heavy lifting for the terminal/pseudo-tty that powers this (VSCode's terminal emulator is powered by the same library).

It looks like they've added a layer on top of node-pty to allow serializing/streaming of the contents to the terminal within the mini-terminal viewports they're allocating for the terminal rendering. I wonder if they're releasing that portion as open source?

amitav1•3h ago

I think that this feature might have taken Gemini CLI from just Temu Claude Code with higher usage limits, to actually competitive as a tool. It'll be interesting to see how well this actually works in practice.

adastra22•2h ago

Idk, the more skilled I get with Claude Code, the less I use interactive workflows.

anp•1h ago

I tend to agree but there are a few scenarios where I really want it to work. Debuggers in particular seem hard to get right for the current agents. I’ve not been able to get the various MCP servers I’ve tried to work, I’ve struck out using the debug adapter protocol from agent-authored python. The best results I’ve gotten are from prompting it to run the debugger under screen, but it takes many tool calls to iterate IME. I’m curious to see how gemini cli works for that use case with this feature.

rockwotj•31m ago

I would love to use gdb through an agent instead of directly. I spend so much time looking up commands and I sometimes skip things because I get impatient stepping over the next thing

efskap•3h ago

To be clear, the LLM is only aware of the final state of the ptty when the command exits, right? It's not a TUI computer-use model at this point from what I can tell.

jgoodhcg•3h ago

Popping into nvim to check on something really quick seems immediately useful. I think I'll still want a dedicated tab or different terminal app to have my longer lived editor open but this might be nice for validating output with test runners or checking on a database entry in psql or something.

nacs•3h ago

I'm not sure how usable neovim will be in what looks to be a 6 line high window as they show in the demo video.

mark_l_watson•2h ago

I had problems running Emacs in no window mode (emacs -nw). I should try again, or maybe just use vim.

jasonjmcghee•3h ago

I've had a pretty poor experience with Gemini.

I've had to convince it to do things it should just be able to do but thinks it can't for some reason. Like reading from a file outside of the project directory- it can do it fine, but refuses to unless you convince it that no it actually can.

Also has inserted "\n" instead of newlines on a number of occasions.

I'd argue these behaviors are much more important than being able to use interactive commands.

Liebmann5•3h ago

I second this man’s take. I’ve been using it consistently for a few months to give it a try and is definitely subpar. It can give really good answers at times however isn’t worth the time, energy, or luck to get it there.

ChrisGreenHeur•1h ago

I have had these exact issues a lot with codex (gpt-5-codex)

sunaookami•1h ago

Gemini doesn't seem to be trained on tool use (which Claude is) so it quiet often thinks it can't do something it certainly can and does a lot of nonsense. For me it fails nearly everytime while it's trying to read project files because it uses relative paths instead of absolute so I've put "For your "ReadFile" and "WriteFile" tool, you MUST use absolute paths to files" in my system instructions.

Speaking of system instructions, Gemini always forgets them or doesn't follow them. And it still puts code comments nearly everywhere, it drives me nuts.

Codex is much better at following system instructions but the CLI is..... very bad.

theshrike79•1h ago

Codex doesn’t give feedback while it’s running. It just works quietly in a way that’s not easy to interrupt if you could see it going off the rails.

Claude is better at this.

lxgr•30m ago

Gemini seems to have a poor model of both what it can and what it is allowed to do.

I’ve noticed the latter with several image generation refusals I could eventually easily talk them out of (usually by mentioning fair use in a copyright/trademark context).

orliesaurus•2h ago

As someone who has been experimenting with AI ‑powered command‑line helpers, I think adding interactive commands to the Gemini CLI is a logical step, but it won’t be useful unless the underlying model is reliable for basic tasks. Several people here noted that Gemini sometimes refuses to read files outside the project directory or mishandles newlines; those sorts of inconsistencies undermine trust.

In a world where you have 100 options, trust is of utmost importance. The CLI’s integration with node‑pty and the ability to stream pseudo‑tty output into mini‑terminal viewports is clever, and I’d love to see that layer documented or open‑sourced so other tools can build on it. I see this feature as something you’d use for short‑lived tasks like running a quick script, checking a log, or doing a one‑off database query. For longer editing sessions I’d still use a real terminal multiplexer and editor. If Google can fix the reliability issues and make the API for interactive sessions open, that would be hella good for everyone!

coderatlarge•2h ago

i’ve had little luck getting ai systems to correctly set up networking for a set of vms. they tend to go round and round with ip tables commands that don’t ultimately solve the problem. is config fundamentally harder than writing code ?

theshrike79•1h ago

Did you give them a way to check the networking rules?

If not, the model is just shooting in the dark and guessing.

caymanjim•1h ago

The best thing about this is that now Claude and Codex have to add it.

theshrike79•1h ago

I’m still waiting for Gemini to add hooks and sub-agents

selvan•1h ago

From the blog " Gemini CLI spawns a new process within a pseudo-terminal in the background, leveraging the node-pty library...So how does this virtual terminal running in the background show up on your screen? Think of it like a video stream. Our new serializer takes a snapshot of the pseudo terminal at every moment—capturing every piece of text, every color, and even the cursor's position. These snapshots are then streamed to you, allowing you to see and interact with the terminal application in real-time. It's not just a stream of text; it's a live feed."

Terminal serializer code: https://github.com/google-gemini/gemini-cli/blob/main/packag...

Uses @xterm/headless npm package.

ffsm8•1h ago

Your link 404s

selvan•42m ago

Thanks. Fixed it.

baalimago•43m ago

How much tokens does it eat up? Does the context stay concise? Who owns these "serializations" that's uploaded to google all the time?

prodigycorp•29m ago

I've gotten Claude Code to interact with the cli by using tmux. Just tell Claude to use tmux to interact with the shell. It'll use `capture pane` and `send-keys`, to view and control the interface. I t works quite well, and you can monitor the session by running `tmux attach -t <session>`.

I've gotten Claude to run a nested Claude instance this way. One of the dumbest laughs I've had was by "pranking" main Claude to think that child Claude ran `rm -rf` on the entire repo we were working on. The thing had a virtual panic attack.

thallavajhula•18m ago

Aside: The demo shows git commands being run in the CLI. I absolutely hate it when devs use a commit message that says "chore: my first commit from gemini cli" - I get that it's meant for the demo, but in general too, I've seen codebases that enforce these commit prefixes such as "chore", "feat", "bugfix" etc. Is there any real value to that? Besides wasting up the 50 character limit on the first line of the commit message, I don't see anything else being done including those. Also, non-imperative commit messages?! Come on, guys!

ilikepi•9m ago

> I've seen codebases that enforce these commit prefixes such as "chore", "feat", "bugfix" etc. Is there any real value to that?

It's a choice some teams make, presumably because _they_ see value in it (or at least think they will). The team I'm on has particular practices which I'm sure would not work on other teams, and might cause you to look at them with the same incredulity, but they work for us.

For what it's worth, the prefixes you use as examples do arise from a convention with an actual spec:

https://www.conventionalcommits.org/en/v1.0.0/

arjie•18m ago

It was very buggy for me. You kind of have to coax it into interactive use and then some of the time it got stuck pondering once I exited the app flow and returned to the Gemini CLI (not with Ctrl-F, full exit, it closes the TUI window). It's also super laggy.

To be honest, at this point having Claude Code monitor the output of a `tmux pipe-pane` is probably going to be superior.

VST3 audio plugin format is now MIT

Google flags Immich sites as dangerous

Run interactive commands in Gemini CLI

Scripts I wrote that I use all the time

Willow quantum chip demonstrates verifiable quantum advantage on hardware

Sodium-ion batteries have started to appear in cars and home storage

Accessing Max Verstappen's passport and PII through FIA bugs

Ovi: Twin backbone cross-modal fusion for audio-video generation

JMAP for Calendars, Contacts and Files Now in Stalwart

Why SSA Compilers?

Play abstract strategy board games online with friends or against bots

Karpathy on DeepSeek-OCR paper: Are pixels better inputs to LLMs than text?

The first interstellar software update: The hack that saved Voyager 1 [video]

Glasses-free 3D using webcam head tracking

The mild mannered Englishman who was the most prolific ghost hunter

Element: setHTML() method

Rivian's TM-B electric bike

Derek Sivers's database and web apps

Speculations on arenas and non-trivial destructors

Common yeast can survive Martian conditions

When You Get to Be Smart Writing a Macro

LibCube: Find new sounds from audio synths easier

HP SitePrint

InpharmD (YC W21) Is Hiring – NLP Engineer

Show HN: Cuq – Formal Verification of Rust GPU Kernels

VortexNet: Neural network based on fluid dynamics

I see a future in jj

Criticisms of “The Body Keeps the Score”

YASA beats own power density record pushing electric motor to 59kW/kg benchmark

I, Sharpie

Run interactive commands in Gemini CLI

Comments

VST3 audio plugin format is now MIT

Google flags Immich sites as dangerous

Run interactive commands in Gemini CLI

Scripts I wrote that I use all the time

Willow quantum chip demonstrates verifiable quantum advantage on hardware

Sodium-ion batteries have started to appear in cars and home storage

Accessing Max Verstappen's passport and PII through FIA bugs

Ovi: Twin backbone cross-modal fusion for audio-video generation

JMAP for Calendars, Contacts and Files Now in Stalwart

Why SSA Compilers?

Play abstract strategy board games online with friends or against bots

Karpathy on DeepSeek-OCR paper: Are pixels better inputs to LLMs than text?

The first interstellar software update: The hack that saved Voyager 1 [video]

Glasses-free 3D using webcam head tracking

The mild mannered Englishman who was the most prolific ghost hunter

Element: setHTML() method

Rivian's TM-B electric bike

Derek Sivers's database and web apps

Speculations on arenas and non-trivial destructors

Common yeast can survive Martian conditions

When You Get to Be Smart Writing a Macro

LibCube: Find new sounds from audio synths easier

HP SitePrint

InpharmD (YC W21) Is Hiring – NLP Engineer

Show HN: Cuq – Formal Verification of Rust GPU Kernels

VortexNet: Neural network based on fluid dynamics

I see a future in jj

Criticisms of “The Body Keeps the Score”

YASA beats own power density record pushing electric motor to 59kW/kg benchmark

I, Sharpie