Ask HN: What Happened to xAI?

4•zof3•2h ago

All the other big name model providers have been pushing stuff out the door steadily; what’s happened to xAI? Their models were never the best but now they are getting wiped on the floor — for coding work, at least.

Is anyone here actively using a Grok model?

The SpaceX merger just happened, is that part of why they haven’t released anything in the last 5 months?

Comments

nopurpose•2h ago

Not a user, but in what sense they are getting wiped on the flor? 4th place on llmarena looks solid: https://huggingface.co/spaces/lmarena-ai/arena-leaderboard

uyzstvqs•1h ago

Grok 4.20 is currently in beta and ranking #4 on arena.ai for text generation, only beaten by Claude Opus 4.6 and Gemini 3.1 Pro. Grok 4.1 Thinking is in #9, still a top model. Grok Imagine is consistently in the top 10 for image and video generation, ranking #1 for Image-to-Video.

If you meant code generation, Grok has never really been a top model for that purpose. Claude remains the undefeated king there.

https://arena.ai/leaderboard

oren1531•1h ago

Grok's value was never really about model quality - it was the only model with real-time access to what's actually being said on X. And it's less filtered than the others, which matters for certain topics where ChatGPT/Claude will just refuse or hedge. Those are two genuinely unique things. But they're sitting on both advantages without shipping anything meaningful, and competitors will find workarounds eventually. The SpaceX merger adding organizational noise right now seems like the worst possible timing.

muzani•1h ago

Grok was the best every now and then but they committed the capital crime of trying to charge 50% more than the competitors who were only 20% or so worse.

There's kind of this weird thing in Silicon Valley where CEOs and investors keep telling people to raise prices! The definition of a moat is you can raise prices!!!

But these folks usually don't have the experience of being broke or saving for retirement and such. They don't understand that an extra $10/month is a heck lot of money. And so nobody really wants to try it and it has a dampening effect on the virality.

ej31•1h ago

5 months is nothing. we launched a rocket twice in that time ngl

downbad_•1h ago

I find Grok to be better than OpenAI (ChatGPT)

Ask HN: Remember Fidonet?

Ask HN: What Are You Working On? (March 2026)

The Architecture of an Exit Scam: A Technical Audit of Zszrun

Ask HN: Since a week HN keeps logging me off every few days, why?

Ask HN: How to be alone?

Ask HN: What AI content automation stack are you using in 2026?

Ask HN: Please restrict new accounts from posting

Ask HN: Can I repurpose a Bluetooth voice remote as input device for a PC?

Ask HN: Most beautiful personal blog UI you have ever seen?

Ask HN: Let's rethink the architecture and future of Emacs

Ask HN: Do you still run Redis and workers just for background jobs?

Why is GPT-5.4 obsessed with Goblins?

Tell HN: I'm 60 years old. Claude Code has re-ignited a passion

Ask HN: Is GitHub getting less reliable, or is it just me?

Ask HN: Favorite Non-Spammy iPhone Games?

Why is email so resilient as a technology?

Ask HN: What game engine would you recommend for vibe coding?

Ask HN: Read‑only LLM tool for email triage and knowledge extraction?

Ask HN: Any informed guesses on the actual size/architecture of GPT-5.4 etc.?

Code-review-graph: persistent code graph that cuts Claude Code token usage

Ask HN: Who Needs Help?

A job ad for Agentic AI Advocate

Ask HN: Are showlang and thelang HN endpoints not being maintained?

I replaced my freelance SaaS stack with 5 single-file HTML tools

Ask HN: Which book are you reading these days?

Ask HN: Anyone else feel this community has changed recently?

OpenAI might end up on the right side of history

Ask HN: How are you handling persistent memory across local Ollama sessions

All tmux sessions as a single terminal

Whisker – Self hosted e-commerce cart, pure PHP, zero dependencies