frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Judge Detested by Trump Will Oversee Epstein Files Case

https://newrepublic.com/post/198937/judge-chutkan-trump-hates-epstein-files-case
1•treetalker•23s ago•1 comments

Google's AI model Gemini got its name (2024)

https://blog.google/technology/ai/google-gemini-ai-name-meaning/
1•alhazrod•1m ago•0 comments

Just released NanoCoder CLI – open-source agentic coding, local-first

https://github.com/Mote-Software/nanocoder
1•mrspence•1m ago•0 comments

Show HN: Selfhostllm.org – Plan GPU capacity for self-hosting LLMs

https://selfhostllm.org/
1•erans•5m ago•0 comments

My commitment to you and our company

https://newsroom.intel.com/corporate/my-commitment-to-you-and-our-company
1•rntn•7m ago•0 comments

Word Chains

https://petargyurov.com/2025-06-22/word-chains
1•petargyurov•10m ago•0 comments

Someone keeps stealing, flying, fixing and returning this man's plane. But why?

https://www.latimes.com/california/story/2025-08-08/mystery-plane-thief
2•MBCook•10m ago•0 comments

Quacking Performance: DuckDB

https://mackle.io/posts/quacking-performance-duckdb/
1•tsaifu•10m ago•0 comments

Why firms are merging HR and IT departments

https://www.bbc.com/news/articles/cy0w8gvq84xo
1•sizesandsevens•12m ago•2 comments

A Letter to the CalyxOS Community

https://calyxos.org/news/2025/08/01/a-letter-to-our-community/
2•walterbell•14m ago•0 comments

ChatGPT users are not happy with GPT-5 launch

https://www.techradar.com/ai-platforms-assistants/chatgpt/chatgpt-users-are-not-happy-with-gpt-5-launch-as-thousands-take-to-reddit-claiming-the-new-upgrade-is-horrible
2•eagleislandsong•14m ago•1 comments

Entering Technical Debt's ZIRP Era

https://worksonmymachine.ai/p/entering-technical-debts-zirp-era
2•Stwerner•15m ago•0 comments

Princeton Eliminates Tuition for Families Making $250k a Year

https://www.bloomberg.com/news/articles/2025-08-08/princeton-eliminates-tuition-for-families-making-250-000-a-year
2•toomuchtodo•16m ago•1 comments

FOIA Lawsuit from 2017 exposes 170-page index to EpsteinFiles

https://www.bloomberg.com/news/newsletters/2025-08-08/here-s-a-look-at-what-the-fbi-s-epstein-files-would-reveal
6•ck2•17m ago•2 comments

Magnus Carlsen Commentates Grok vs. OpenAI Finale [video]

https://www.youtube.com/watch?v=vtHfJ6iYyEY
1•mbowcut2•20m ago•0 comments

A Conceptual Framework for Leveraging AI

https://journals.sagepub.com/doi/10.1177/15234223251335908
1•rmyawson•21m ago•0 comments

All known 49-year-old Apple-1 computer

https://www.apple1registry.com/en/list.html
1•elvis70•22m ago•0 comments

Show HN: Potty – A Python CLI tool to download Spotify music using yt-dlp

https://github.com/Ssenseii/spotify-yt-dlp-downloader
1•ssenssei•22m ago•0 comments

Months building this did I just reinvent the wheel?

https://landing.owlbos.com/
2•sarkarsh•25m ago•2 comments

Chatbots Go on a Delusional Spiral

https://www.nytimes.com/2025/08/08/technology/ai-chatbots-delusions-chatgpt.html
1•dougdonohoe•25m ago•0 comments

Why the Far Right Hates Churchill

https://www.wsj.com/politics/why-the-far-right-hates-churchill-20fdc710
3•throwanem•25m ago•0 comments

I Am Too Young to Be a Wife Now

https://etechx.co.ke/i-am-too-young-to-be-a-wife-now
1•Manyi•27m ago•0 comments

Algorithm and Blues: How Streaming Discourages Active Listening

https://www.harmonic.fm/blog/algorithm-blues
2•coloneltcb•29m ago•0 comments

Byte Buddy is a code generation and manipulation library for Java

https://bytebuddy.net/
1•mooreds•30m ago•0 comments

Model Evaluation

https://ampcode.com/news/model-evaluation
2•pbardea•30m ago•0 comments

Adding limestone to farmland boosts carbon capture and crop yields

https://phys.org/news/2025-08-adding-limestone-farmland-boosts-carbon.html
1•bikenaga•31m ago•1 comments

I clustered four Framework Mainboards to test LLMs

https://www.jeffgeerling.com/blog/2025/i-clustered-four-framework-mainboards-test-huge-llms
14•bobajeff•35m ago•3 comments

The Best Companies Are Dictatorships

https://writing.nikunjk.com/p/the-best-companies-are-dictatorships
2•whatatimeline•36m ago•1 comments

Hodgkin-Huxley Model

https://www.fabriziomusacchio.com/blog/2024-04-21-hodgkin_huxley_model/
2•almost-exactly•37m ago•0 comments

One billion-year-old rules of protein stability revealed

https://phys.org/news/2025-07-billion-year-protein-stability-revealed.html
2•PaulHoule•38m ago•0 comments
Open in hackernews

Google's Genie is more impressive than GPT5

https://theahura.substack.com/p/tech-things-genies-lamp-openai-cant
120•theahura•3h ago

Comments

SV_BubbleTime•1h ago
Geez… Make me pick between trusting Google, or trusting OpenAI… I’ll go with Anthropic.
sirbutters•1h ago
Honestly, same. Anthropic CEO radiates good vibes.
tekno45•1h ago
yay! security and privacy are just VIBES!!!
wagwang•59m ago
The anthropic CEO dooms all day about how AI is going to kill anyone and yet works on frontier models and gives them agentic freedom.
thegrim33•1h ago
I like how we've just collectively forgotten about the absolutely disastrous initial release of Gemini. Were the people responsible for that fired? Are they still still there making decisions? Why should I ever trust them and give them a second chance when I could just choose to use a competitor that doesn't have that history?
rvnx•1h ago
We did not forget this scam that was Google Bard, but still, it is the past now
echelon•1h ago
I know this is sarcasm, but a misstep like this by OpenAI will harm their future funding and hiring prospects.

They're supposed to be in the lead against a company 30x their size by revenue, and 10,000x their might. That lead is clearly slipping.

Despite ChatGPT penetration, it's not clear that OpenAI can compete toe to toe with a company that has distribution on every pane of glass.

While OpenAI has incredible revenue growth, they also have incredible spending and have raised at crazier and crazier valuations. It's a risky gamble, but one they're probably forced to take.

Meanwhile, Meta is hiring away all of their top talent. I'll bet that anyone that turned down offers is second guessing themselves right now.

raincole•1h ago
So where can I try out Genie 3? Did the author try it out?

If not it's just vibe^2 blogging.

password54321•1h ago
Basically free advertising for something not released.
echelon•1h ago
Genie 3 just had the Sora treatment.

Lots of press for something by invitation only.

This probably means it takes an incredible amount of resources to power in its current form. Possibly tens of H100s (or TPUs) simultaneously. It'll take time to turn that from a wasteful tech preview into a scaleable product.

But it's clearly impressive, and it did the job of making what OpenAI did look insignificant.

beepbooptheory•1h ago
> The goal of AGI is to make programs that can do lots of things.

Wait, is it?

rvnx•1h ago
We reached AGI about 30 years ago then
thewebguyd•1h ago
lol. The definition of AGI seems to change on the daily, and usually coincides with whatever the describer is trying to sell.
adeelk93•1h ago
I’d amend that to - it coincides with whatever the describer is trying to get funding for
lm28469•1h ago
That certainly how it feels to me. Every demo seems like it's presenting some kind of socially maladjusted silicon valley nerd's wet dream. Half of it doesn't interest non tech people, the other half seems designed for teenagers.

Look at this image of Zuckerberg demoing his new product: https://imgur.com/1naGLfp

Or gpt5 press release: "look at this shitty game it made", "look at the bars on this graph showing how we outperform some other model by 2% in a benchmark that doesn't actually represent anything"

mind-blight•1h ago
GPT-5 is a bit better -particularly around consistency - and a fair amount cheaper. For all of my use cases, that's a huge win.

Products using AI powered days processing (a lot of what I use it for) don't need mind blowing new features. I just want it to be better at summarizing and instruction following, and I want it to be cheaper. GPT-5 seems to knock all of that out of the park

benjiro•7m ago
> GPT-5 is a bit better -particularly around consistency - and a fair amount cheaper. For all of my use cases, that's a huge win.

What is more or less a natural evolution of LLMs... The thing is, where are my benefits as a developer?

If for instance CoPilot charges 1 Premium request for Claude and 1 Premium request for GPT-5, despite that GPT-5 is (with resource usage), supposed to be on a level of GPT 4.1 (a free model). Then (from my point of view) there is no gain.

So far from coding point of view, Claude does coding (often) still better. I made the comparison that Claude feels like a Senior dev, with years of experience, where GPT 5 feels like a academic professor, that is too focus on analytic presentation.

So while its nice to see more competition in the market, i still rank (with Copilot):

Claude > Gemini > GPT5 ... big gap ... GPT4.1 (beast mode) > GPT 4.1

LLM's are following the same progression these days like GPUs, or CPU ... Big jumps at first, then things slow down, you get more power efficiency but only marginal jumps on improvements.

Where we will see benefits, is specialized LLMs, for instance, Anthropic doing a good job for creating a programmer focused LLM. But even those gates are starting to get challenged by Chinese (open source) models, step by step.

GPT5 simply follows a trend. And within a few months, Anthropic will release something probably not much of a improvement over 4.0 but cheaper. Probably better with tool usage. And then comes GPT5.1, 6 months later, and ...

GPT-5.0 in my opinion, for a company with the funding that openAI has, needed to be beat the competition with much more impact.

pton_xd•1h ago
> "look at this shitty game it made"

This is basically every agentic coding demo I've seen to date. It's the future but man we're still years and years away.

OsrsNeedsf2P•1h ago
This article has zero substance
floren•1h ago
A substack article? Zero substance? Whaaaaaaaaaaaaaaaaat
raincole•1h ago
It's just someone noticed that people are not happy with GPT5 release and came up with an apple-to-screech-owl comparison (two completely different kinds of models, one product ready and the other internal test only) to farm clicks.
aerhardt•1h ago
Why bother with substance in the era of vibes?
aydyn•27m ago
Sounds like someone needs to come up with VibesBench.

Maybe it could just be a distilled scoring of social media sentiment the day after announcement? The more positive hype, the higher the VibeScore.

theahura•51m ago
:(
zb3•1h ago
Is Genie available for me to try? No? Then I can't tell, because I won't blindly trust Google.

Remember Imagen? They advertised Imagen 4 level quality long before releasing the original Imagen model. Not falling for this again.

bko•1h ago
It's pretty incredible a model like Genie can deduce the laws of physics from mere observation of video. Even fluid dynamics which is a notoriously difficult problem. It's not obvious that this would happen or would even be possible from this kind of architecture. It's obviously doing something deep here.

As an aside, I think it's funny that the AI Doomer crowd ignores image and video AI models when it comes to AI models that will enslave humanity. It's not inconceivable that a video model would have a better understanding of the world than an LLM. So perhaps it would grow new capabilities and sprout some kind of intent. It's super-intelligence! Surely these models if trained long enough will deduce hypnosis or some similar kind of mind control and cause mass extinction events.

I mean, the only other explanation why LLMs are so scary and likely to be the AI that kills us all is that they're trained on a lot of sci-fi novels so sometimes they'll say things mimicking sentient life and express some kind of will. But obviously that's not true ;-)

gmueckl•50m ago
These models aren't rigorously deriving the future state of a system from a quantitative model based in physical theories. Their understanding of the natural environment around is in line the innate understanding that animals and humams have that is based on the experience of living in an environment that follows deterministic patterns. It is easy learn that a river flows faster in the middle by empirical observation. But that is not correlated with a deeper understanding of hydrodynmics.
cortesoft•18m ago
What is a deeper understanding of the laws of physics other than understanding the patterns?
chairhairair•19m ago
I don't know how one would think doomers "ignore image and video AI models". They (Yudkowsky, Hinton, Kokotajlo, Scott Alexander) point at these things all the time.
surround•48m ago
> The betting markets were not impressed by GPT-5. I am reading this graph as "there is a high expectation that Google will announce Gemini-3 in August", and not as "Gemini 2.5 is better than GPT-5".

This is an incorrect interpretation. The benchmark which the betting market is based upon currently ranks Gemini 2.5 higher than GPT-5.

theahura•43m ago
EDIT: I updated the article to account for this perspective.

------

This can't be right -- they're using LMArena without style control to resolve the market, and GPT-5 is ahead right? (https://lmarena.ai/leaderboard/text/overall-no-style-control)

> This market will resolve according to the company which owns the model which has the highest arena score based off the Chatbot Arena LLM Leaderboard (https://lmarena.ai/) when the table under the "Leaderboard" tab is checked on August 31, 2025, 12:00 PM ET.

> Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.

> If two models are tied for the top arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g. if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")

> The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.

JimDabell•27m ago
> This is an incorrect interpretation. The benchmark which the betting market is based upon currently ranks Gemini 2.5 higher than GPT-5.

You can see from the graph that Google shot way up from ~25% to ~80% upon the release of GPT-5. Google’s model didn’t suddenly get way better at any benchmarks, did it?

dcre•4m ago
It's not about Google's model getting better. It is that gpt-5 already has a worse score than Gemini 2.5 Pro had before gpt-5 came out (on the particular metric that determines this bet: Overall Text without Style Control).

https://lmarena.ai/leaderboard/text/overall-no-style-control

That graph is a probability. The fact that it's not 100% reflects the possibility that gpt-5 or someone else will improve enough by the end of the month to beat Gemini.

jeremyjh•28m ago
> Imagine asking a model a question like “what's the weather in Tibet” and instead of doing something lame like check weather.com, it does something awesome like stimulate Tibet exactly so that it can tell you the weather based on the simulation.

Was where I stopped reading.

justonceokay•1m ago
We already automate away all possible human interaction. Maybe in the future we can automate away our senses themselves.

My roommate already looks at his weather app to see what to wear instead of putting his hand out the window. Simulating the weather instead of experiencing it is just the next logical step

standardUser•18m ago
> The goal of AGI is to make programs that can do lots of things.

What do Genie and GPT have to do with AGI? I'm sure the people who stand to make billions love to squint and see their LLM as only steps away from an AGI. Or that guy at Google who fell in love with one. But the rest of us know better.

mirblitzarmaven•16m ago
> Imagine asking a model a question like “what's the weather in Tibet” and instead of doing something lame like check weather.com, it does something awesome like stimulate Tibet [...]

Let's not stimulate Tibet