frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Grok 4.3

https://docs.x.ai/developers/models/grok-4.3
63•simianwords•1h ago

Comments

simianwords•1h ago
https://artificialanalysis.ai/models/grok-4-3
BoorishBears•1h ago
What an exciting game we're playing, where the most popular leaderboard is completely made up and the stakes are in the trillions.
nextaccountic•1h ago
This puts Sonnet 4.6 above Opus 4.6 in the coding index.. kinda hard to trust those numbers.

(Also it puts Opus 4.7 universally above Opus 4.6, and I may be wrong but this doesn't seem to match the experience of most/many/some people. I think it's widely recognized that Anthropic is severely lacking compute and Opus 4.7 is a costs saving measure)

manmal•22m ago
Anthropic themselves have (had?) this thing where Opus is used for planning and Sonnet for coding.
Alifatisk•56m ago
Does numbers don't look exciting at all? I may have gotten spoiled by releases from Qwen, Kimi and Z.ai who keep closing the gap between closed weight SOTA models and open weight. From my experience, Grok is only useful for one thing, and that's looking up things for you and gathering a consensus on topics. That's it.

Update, I noted that Grok 4.3 is in the "Most attractive quadrant", that's cool! It is also in the top 5 highest in "AA-Omniscience Index", good! Really good.

progbits•55m ago
What's with the charts and numbers?

It says #1 for speed but then in the chart it's #2. Also says #10 for intelligence but then it's #7 in the chart.

maz1b•1h ago
I still wish they named it something else, but congratulations to the team on what seems to be a good release!

Pricing is also quite surprising, compared to comparable competitors. I guess they have tons of capacity or really want to bring over more people.

mythz•1h ago
Ok speed (202.7 tok/s) and value (1.25 -> 2.50) look great, with pretty decent intelligence.
pzo•55m ago
The problem with speed is that they usually are very fast for first few weeks and then suddenly much slower. They did such trick when they advertised Grok 4 fast ( dropped from 200 tps to 60tps)
victorbjorklund•45m ago
Wow. That is a big drop.
catcowcostume•41m ago
For the 1000th time, models do not possess Intelligence
MrDrDr•39m ago
Please elaborate.
nesk_•30m ago
Prediction is not intelligence.
mirekrusin•19m ago
Misprediction is?
exe34•29m ago
What does intelligence mean to you?
kuboble•22m ago
I don't remember the source of the quote.

But debating whether the models are intelligent is slim to debating whether a car can walk.

You can offload to the model a lot of work that until recently we thought requires intelligence. The more and better of those tasks the model can do, it's fair to call it intelligence*

Imustaskforhelp•58m ago
Pelican riding a bike here: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...

(ran this on arena.ai direct chat and also tried to write this gist inspired by how simon writes his gists about pelicans)

Edit: just realized that I made pelican riding a bike instead of bicycle, which now makes sense as to why it hardened the bicycle to look tankier, going to compare this with pelican riding a bicycle if anybody else shares the pelican riding a bicycle.

gchamonlive•54m ago
https://simonwillison.net/2025/Nov/13/training-for-pelicans-...

You should probably come up with variations, like a beaver riding a scooter or something, just so see what's what :)

Imustaskforhelp•38m ago
Thanks I have generated both

beaver riding a scooter: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...

pelican riding a bicycle: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...

Personal opinion but the beaver one looks especially bad as compared to pelicans. Can we be for sure that this model of grok-4.3 hasn't been trained on pelican. Simonw in blog-post says that he will try with other creatures so I hope he does that but it does feel to me as the model/xAI is trying to cheat, Hope Simonw tests it out more.

Edit: Also added turtle riding a scooter, something which literally has images online or heck even teenage mutant ninja turtles and I thought that it would be able to pass this but it wasn't even able to generate this: https://gist.github.com/SerJaimeLannister/f6de26bd0d0817e056...

This literally looks more avocado than turtle. Perhaps this could be a bug from arena.ai or something else too, not sure but at this point waiting for simon's analysis.

gchamonlive•16m ago
We can never be sure of course, but I think this is a very strong indication that pelican riding a bike is indeed going into the training dataset.

Thanks for generating those!

ragchronos•58m ago
When looking at the benchmarks, this model seems to be really close to Kimi K2.6 in terms of intelligence and pricing, hitting that sweet spot. It does also have a higher AA-Omniscience index, which is something kimi and other open models lack in. Curious to see how pleasant it is to use.
alfiedotwtf•42m ago
I’ll eat my hat if it even comes close to Kimi
mirekrusin•18m ago
How would you like it? Well done?
alyxya•55m ago
Despite their attrition, this combined with their cursor partnership is likely going to make them competitive in coding agents soon.
netdur•53m ago
In court vs openai, Musk said Grok is partly trained on openai models, so it should be somehow similar to Chinese models in terms of performance and cost!
OtherShrezzing•51m ago
The tok/s stat is interesting. Since the dominant constraint on inference speed is hardware, it suggests X purchased far more compute than was really needed to serve the demand for their models.

Expensive miscalculation.

flir•1m ago
Didn't a bunch of hardware that was destined for Tesla get redirected to xAI? I'm sure I remember something like that.
artdigital•48m ago
Grok is my favorite model for chatting, and my favorite voice mode. It seems to be the only voice mode that isn't routing to a extremely cheap model (like Haiku), and has been the highest quality out of all the frontier ones. When you subscribe to SuperGrok you can also create a "council" of agents, each with their own system prompt and when you ask something, they will all get asked in parallel to come to a conclusion. Good stuff!

Just wish they would finally put some work into their apps, it's the only thing keeping me from actually subscribing to SuperGrok:

- No MCP / connected apps support. It's been teased but here we are, still not available. I can't connect Grok to anything, so I can't use it for serious work

- Projects are still not available in the app so as soon as you move something into a project, it's gone from all the native apps

- No way to add artifacts (like generated markdown docs) directly to a project, we have to export to PDF/markdown and re-import. And there isn't even a way to export artifacts. This makes serious project work hard because we can't dynamically evolve projects with new information

- No memory, no ability to look up other chats, each chat is completely new

- No voice mode in projects at all

If someone from xAI is reading this, please consider adding some of these.

afpx•34m ago
When I signed up, I accidently paid for a full year. So from time to time, I'll throw it something just to see what it produces compared to the other LLMs. And, even after all this time, it still feels like a really "dumb" model compared to the other frontier ones. But, worse, many of my system prompts make it go wacky and puke jibberish. However it was pretty cool for those couple months awhile back when it was uncensored. You could ask it about a wild conspiracy, and it would actually build the case and link you to legitimite source material. They dropped the hammer down on that real quick.
artdigital•34m ago
I also think Grok would benefit from allowing usage of "SuperGrok Heavy" (their $300 plan) in coding harnesses with included usage. Currently they give you some API credits on the Heavy plan so you can use some Grok for coding, but $300 USD value is just not there.

Not saying they should create their own grok-code harness, just allowing usage in existing ones would already be beneficial. But that's probably what the Cursor acquisition is going to do eventually

walletdrainer•18m ago
> No MCP / connected apps support. It's been teased but here we are, still not available. I can't connect Grok to anything, so I can't use it for serious work

Grok has tool use, no? Why would you also need MCP? What does MCP add?

alfiedotwtf•45m ago
If there was any model I wouldn’t trust, it wouldn’t be the ones from China, it would be the one from Elon Musk
Cthulhu_•32m ago
Thankfully it's not an either / or, I don't trust any models. This is a healthy attitude to have because you shouldn't trust anyone on the internet either, especially when it comes to specific subjects.
khalic•39m ago
This project is a gigantic waste of resources, it’s fine tuned on politics of the CEO, was used for CSAM generation and just sucks overall
spiderfarmer•25m ago
It’s a model made for 36% of Americans. The rest of the world can’t care less.
servo_sausage•15m ago
I like that there are models with divergent politics; the status quo being creepy corporate left silicon valley is not healthy or pleasant to interact with.

Even with grock it's only broadening things to creepy corporate right of silicon valley.

BoredPositron•39m ago
Yay, free tokens. I don't know why but grok always seems good fast in the free token phase and after that degrades.
happosai•38m ago
I lost the trust in them when they added the racist "what about killing of Boers in south Africa" thing to their system prompt.

No way am I going to use a model where the backing has such blatantly obvious brain washing goals.

miroljub•33m ago
Than you shouldn't be using any model from a 5-eyes country.

They all have biases, it's just that you don't like Grok bias, but are fine with Anthropic, OpenAI and Google brain washing.

vrganj•21m ago
There is no non-bias. What you call unbiased is always just a reflection of your personal biases.

That being said, I am definitely against a model that is biased to be following the ideology of a far-right extremist.

Jtarii•19m ago
Musk bought a social media company for the specific purpose of getting Trump elected by turning it into a right wing propaganda machine. Have Anthropic/OpenAI/Google done something similar to that?
sundarurfriend•37m ago
As an English-as-second-language speaker and writer, one thing Grok really shines at is capturing the tone and level of "formality" of a piece of text and the replicating it correctly. It seems to understand the little human subtleties of language in a way the other major providers don't. Chatgpt goes overly stiff and formal sounding, or ends up in a weird "aye guvnor" type informal language (Claude is sometimes better but not always).

Grok seems in general better at being "human" in ways that are hard to define: for eg. if I ask it "does this message roughly convey things correctly, to the level it can given this length", it will likely answer like a human would (either a yes or a change suggestion that sticks to the tone and length), while Chatgpt would write a dissertation on the message that still doesn't clear anything up.

Recently I've noticed that Grok seems to have gotten really good at dictation too (that feature where you click the mic to ask it something). Chatgpt has like 90-95% accuracy with my accent, the speech input on Android's Gboard something like 75%, Grok surprisingly gets something like 98% of my words correct.

djyde•26m ago
I've also noticed that when I communicate with Grok in my native language, its tone is more natural than other models. I think this is due to the advantage of being trained on a large amount of Twitter data. However, as Twitter contains more and more AI-generated content now, I'm afraid continued training will make it less natural.
thunderbong•17m ago
I'm sure Twitter knows which are the bot accounts and is surely excluding them from their model training. Twitter bots aren't a new phenomenon after all.
tornikeo•34m ago
So, we have: - claude for corps and gov - codex for devs - grok for what, roleplay, racism? Those are the two things I've ever heard grok associated with around me.
nsowz•28m ago
Grok is as progressive as any of the other models. Despite some of the highly-publicised fuck-ups, try asking Grok anything racist and see how it replies. Yes, I know you didn't try this and you won’t.
aqme28•21m ago
There is a lot of daylight in between “progressive” and “openly explicitly racist”
nsowz•18m ago
I didn’t say “progressive”; I said “as progressive”.
SanjayMehta•20m ago
100% agree. Grok may or may not be biased one way or the other as far as the US is concerned but from the rest of the world perspective it's mostly the same as any other model trained on Wikipedia.
ndr•28m ago
You should try all of them, then update your opinion about your information sources accordingly.
vrganj•23m ago
Grok for furthering the far-right filter bubble Elon has been hard at work building.
sudb•9m ago
So interestingly, I know of at least one application in a charity that deals with trafficking where grok was happy to do one-shot classification tasks where all other models refused to cooperate.

I think there's a surprising number of actually useful applications in this sort of grey area for a slightly-less guardrailed, near-frontier model (also the grok-fast models are cheap!).

mirekrusin•11m ago
All those plans from providers should be sliders – prepay more, get more in return.

Show HN: A Wheel of Fortune game you can run in the browser

https://spinorama.io/
1•Wolfmans55•3m ago•0 comments

Show HN: Modeleon – Python DSL that compiles to live Excel formulas

https://github.com/modeleonai/modeleon
2•adilkhanovkz•5m ago•0 comments

Detailed New Features of Firebird 5

https://firebirdsql.org/en/community-news/free-book-detailed-features-of-firebird-5-is-published/
1•mariuz•6m ago•0 comments

I'm 17 and built a Palantir alternative for mid-market supply chains

https://risksim.ai/
1•ottocosta•10m ago•0 comments

One Developer, Two Dozen Agents, Zero Alignment

https://maggieappleton.com/zero-alignment
1•maxua•12m ago•0 comments

Om Malik – What Microsoft's 10-Q Says About OpenAI

https://om.co/2026/05/01/what-microsofts-10-q-says-about-openai/
3•rmason•13m ago•0 comments

What Is Z-Angle Memory and Why Is Intel Developing It?

https://www.hpcwire.com/2026/02/05/what-is-z-angle-memory-and-why-is-intel-developing-it/
1•rbanffy•17m ago•0 comments

Crypto Anarchy and Virtual Communities – Tim C May

https://nakamotoinstitute.org/library/virtual-communities/
1•hamiecod•17m ago•0 comments

Greek mountain snow cover halved in past four decades due to regional warming

https://egusphere.copernicus.org/preprints/2026/egusphere-2026-327/
1•littlexsparkee•18m ago•0 comments

Your Company Is a Skill Now

https://twitter.com/arlobish/status/2046270374174925196
1•arlobish•18m ago•0 comments

The Myth That Made Social Psychology Famous

https://www.speakandregret.michaelinzlicht.com/p/social-psychologys-favourite-murder
1•Anon84•19m ago•0 comments

EPR-2 QaaS Blockchain

https://github.com/Babayagga-bite/epr2-qaas-blockchain
1•babayagga-bite•19m ago•1 comments

I Work 10 Hours a Day for Others. and No, I Don't Regret It

https://comuniq.xyz/post?t=1010
1•01-_-•20m ago•0 comments

Show HN: Perfect Bluetooth MIDI for Windows

8•mayerwin•22m ago•0 comments

Show HN: Ghidora build system, An nx/Bazel alternative

https://ghidora.hyperforge.in
1•StellaMary•22m ago•0 comments

When NASA Told Its Astronauts to Quit Smoking (2014)

https://www.smithsonianmag.com/air-space-magazine/when-nasa-ordered-astronauts-quit-smoking-18095...
2•thunderbong•24m ago•0 comments

What's in a name? Dogs or wolves, painted or wild

https://africageographic.com/stories/whats-name-dogs-wolves-painted-wild/
1•altilunium•24m ago•0 comments

New Lithium-Plasma Engine Passes Key Mars Propulsion Test

https://www.universetoday.com/articles/new-lithium-plasma-engine-passes-key-mars-propulsion-test
1•rbanffy•25m ago•0 comments

Chinese Courts Rule Companies Cannot Fire Workers Simply to Replace Them with AI

https://www.caixinglobal.com/2026-04-30/chinese-courts-rule-companies-cannot-fire-workers-simply-...
3•virgildotcodes•31m ago•0 comments

The Rotary Un-Smartphone

https://skysedge.com/telecom/RUSP/index.html
5•tzury•35m ago•0 comments

Gremlin

https://en.wikipedia.org/wiki/Gremlin
2•jumploops•35m ago•0 comments

The Ethiopian Running Secret

https://aeon.co/essays/what-ethiopian-running-says-about-the-limits-of-human-ability
2•rifish•38m ago•0 comments

Learning my lesson that Python virtual environments aren't always movable

https://utcc.utoronto.ca/~cks/space/blog/python/VenvsNotEntirelyMovable
1•ingve•40m ago•0 comments

Raspberry Pi: A Foundation Model in Your Pocket – Colossus

https://colossus.com/article/raspberry-pi-eben-upton/
1•rbanffy•43m ago•0 comments

Meta Just Killed Open-Source AI

https://www.utkarshapoorva.com/writing/meta-llama-trap/
1•utkarsh_apoorva•46m ago•0 comments

RexIDE now has minimal "integration" with Codex App [video]

https://www.youtube.com/watch?v=K3j8ydsLfWs
1•tomerbd•51m ago•0 comments

Xmemory: Benchmarking Structured AI Memory Against RAG and Hybrid RAG

https://arxiv.org/abs/2604.27906
1•alex_petrov•58m ago•0 comments

Ukraine Overtakes US, 6 EU Countries in Press Freedom Index

https://kyivindependent.com/ukraine-overtakes-united-states-on-press-freedom/
2•dgellow•1h ago•0 comments

Advanced Quantization Algorithm for LLMs

https://github.com/intel/auto-round
1•lastdong•1h ago•0 comments

From Taxman to VATmiraal: Fifty Years of Teaching Machines the Law

https://vatmiraal.be/blog/from-taxman-to-vatmiraal
1•triska•1h ago•1 comments