The login process didn't work well, but it's due to my browser having a seperate container for Github.
I asked it to add a feature using Stimulus.js to my very simple gallery page written in Rails, when you hover over an image in the gallery, it should make an icon appear, and when clicking the icon, a modal should pop-up that allows the user to do cropping.
It picked up on my database models and it produced a very good result, however, it seemed to have issues verifying its solution, it particularly seemed to struggle with "on-hover".
The solution it produced was a lot better than what claude-code produced, and I liked the interface.
I also ended up hitting the limits on the Anthropic API, and it wasn't obvious to me what I should do in that case, but that's likely not the fault of Tidewave, and the Context Window Usage having a maximum of 200.0k also seemed very high, but that probably contributed to me hitting a limit.
We are going to introduce more visibility into the context limit and make it easier to summarize.
Regarding the on-hover, do you still have the snippets around that it tried? We instruct the model to dispatch events using the proper browser APIs and I wonder what it tried to do. I would love to hear more! You can ping me on Tidewave's Discord (https://discord.gg/5GhK7E54yA) or send me an email (see github.com/josevalim).
Usually only one correct and idiomatic way to do things, and rails in particular really leans on convention over configuration.
Regarding next steps, we're currently working on React integration, with Python and JavaScript server-side web frameworks coming next. If you're interested, here's our survey/waiting list: https://forms.gle/8MeXwGjpBFDeGNQw9
Our announcement post goes more into detail and explains why moving the agent to the browser means you (the developer) can spend more time out of the loop: https://tidewave.ai/blog/tidewave-web-phoenix-rails
This would be such a game-changer. I am also convinced that monorepos will become the de facto standard, since it is way easier for LLMs to navigate / implement features across the whole stack.
Question: What happens to our data - i.e. the code and context sent to your service?
Is this because data is going through a tidewave server or something, or is it just a way to create a bit of a free trial vs "now you need to pay us"?
awongh•1h ago
To me this is an obvious direction for all coding agents to go in. Right now the cursor chrome browser MCP server doesn't work very well, but it's very obvious to me that this is the direction things need to go in.
I'm not sure why focusing on a framework would fundamentally make this any better than the generic approach to getting good context directly from the browser.
towhans•1h ago
I can imagine having an in browser, framework level tool would know exactly which controller and which template generated this element and could target it more precisely while using less tool calls and less tokens. I haven't tested it yet but that is my expectation.
josevalim•1h ago
1. We annotate and understand your template language. So when you select an element, we not only know its tag and attributes, but we also know which template file rendered it. The more structured the template language is (JSX, HEEx, etc), the better job we can do, which means LLMs don't have to guess.
2. We also run code within the web framework. This means database access, built-in documentation using the exact versions in your project, access to language and framework reflection/introspection APIs, etc. So the agents have more, err, agency.
I hope this clarifies it a bit!