frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What Happened to xAI?

4•zof3•2h ago
All the other big name model providers have been pushing stuff out the door steadily; what’s happened to xAI? Their models were never the best but now they are getting wiped on the floor — for coding work, at least.

Is anyone here actively using a Grok model?

The SpaceX merger just happened, is that part of why they haven’t released anything in the last 5 months?

Comments

nopurpose•2h ago
Not a user, but in what sense they are getting wiped on the flor? 4th place on llmarena looks solid: https://huggingface.co/spaces/lmarena-ai/arena-leaderboard
uyzstvqs•1h ago
Grok 4.20 is currently in beta and ranking #4 on arena.ai for text generation, only beaten by Claude Opus 4.6 and Gemini 3.1 Pro. Grok 4.1 Thinking is in #9, still a top model. Grok Imagine is consistently in the top 10 for image and video generation, ranking #1 for Image-to-Video.

If you meant code generation, Grok has never really been a top model for that purpose. Claude remains the undefeated king there.

https://arena.ai/leaderboard

oren1531•1h ago
Grok's value was never really about model quality - it was the only model with real-time access to what's actually being said on X. And it's less filtered than the others, which matters for certain topics where ChatGPT/Claude will just refuse or hedge. Those are two genuinely unique things. But they're sitting on both advantages without shipping anything meaningful, and competitors will find workarounds eventually. The SpaceX merger adding organizational noise right now seems like the worst possible timing.
muzani•1h ago
Grok was the best every now and then but they committed the capital crime of trying to charge 50% more than the competitors who were only 20% or so worse.

There's kind of this weird thing in Silicon Valley where CEOs and investors keep telling people to raise prices! The definition of a moat is you can raise prices!!!

But these folks usually don't have the experience of being broke or saving for retirement and such. They don't understand that an extra $10/month is a heck lot of money. And so nobody really wants to try it and it has a dampening effect on the virality.

ej31•1h ago
5 months is nothing. we launched a rocket twice in that time ngl
downbad_•1h ago
I find Grok to be better than OpenAI (ChatGPT)

Ask HN: Remember Fidonet?

90•ukkare•4h ago•57 comments

Ask HN: What Are You Working On? (March 2026)

278•david927•1d ago•1051 comments

The Architecture of an Exit Scam: A Technical Audit of Zszrun

4•cappyfjao•4h ago•0 comments

Ask HN: Since a week HN keeps logging me off every few days, why?

5•epolanski•5h ago•1 comments

Ask HN: How to be alone?

662•sillysaurusx•2d ago•545 comments

Ask HN: What AI content automation stack are you using in 2026?

2•jackcofounder•6h ago•2 comments

Ask HN: Please restrict new accounts from posting

700•Oras•1d ago•493 comments

Ask HN: Can I repurpose a Bluetooth voice remote as input device for a PC?

15•albert_e•2d ago•20 comments

Ask HN: Most beautiful personal blog UI you have ever seen?

133•ms7892•1d ago•53 comments

Ask HN: Let's rethink the architecture and future of Emacs

3•kurouna•2h ago•1 comments

Ask HN: Do you still run Redis and workers just for background jobs?

2•sergF•7h ago•6 comments

Why is GPT-5.4 obsessed with Goblins?

10•pants2•10h ago•6 comments

Tell HN: I'm 60 years old. Claude Code has re-ignited a passion

1063•shannoncc•3d ago•970 comments

Ask HN: Is GitHub getting less reliable, or is it just me?

9•_pdp_•18h ago•5 comments

Ask HN: Favorite Non-Spammy iPhone Games?

4•bix6•13h ago•3 comments

Why is email so resilient as a technology?

6•noemit•5h ago•5 comments

Ask HN: What game engine would you recommend for vibe coding?

5•general_reveal•14h ago•4 comments

Ask HN: Read‑only LLM tool for email triage and knowledge extraction?

2•maille•16h ago•3 comments

Ask HN: Any informed guesses on the actual size/architecture of GPT-5.4 etc.?

4•dsrtslnd23•16h ago•0 comments

Code-review-graph: persistent code graph that cuts Claude Code token usage

2•tirthkanani•20h ago•0 comments

Ask HN: Who Needs Help?

14•surprisetalk•1d ago•13 comments

A job ad for Agentic AI Advocate

4•greenpinia•1d ago•1 comments

Ask HN: Are showlang and thelang HN endpoints not being maintained?

4•freakynit•1d ago•1 comments

I replaced my freelance SaaS stack with 5 single-file HTML tools

7•AnnSri•2d ago•3 comments

Ask HN: Which book are you reading these days?

6•chistev•19h ago•18 comments

Ask HN: Anyone else feel this community has changed recently?

57•kypro•3d ago•30 comments

OpenAI might end up on the right side of history

12•shoman3003•1d ago•10 comments

Ask HN: How are you handling persistent memory across local Ollama sessions

5•null-phnix•2d ago•0 comments

All tmux sessions as a single terminal

2•lygten•1d ago•1 comments

Whisker – Self hosted e-commerce cart, pure PHP, zero dependencies

7•eLohith•2d ago•3 comments