frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

When Albert Einstein Moved to Princeton

https://twitter.com/Math_files/status/2020017485815456224
1•keepamovin•18s ago•0 comments

Agents.md as a Dark Signal

https://joshmock.com/post/2026-agents-md-as-a-dark-signal/
1•birdculture•2m ago•0 comments

System time, clocks, and their syncing in macOS

https://eclecticlight.co/2025/05/21/system-time-clocks-and-their-syncing-in-macos/
1•fanf2•3m ago•0 comments

McCLIM and 7GUIs – Part 1: The Counter

https://turtleware.eu/posts/McCLIM-and-7GUIs---Part-1-The-Counter.html
1•ramenbytes•6m ago•0 comments

So whats the next word, then? Almost-no-math intro to transformer models

https://matthias-kainer.de/blog/posts/so-whats-the-next-word-then-/
1•oesimania•7m ago•0 comments

Ed Zitron: The Hater's Guide to Microsoft

https://bsky.app/profile/edzitron.com/post/3me7ibeym2c2n
2•vintagedave•10m ago•1 comments

UK infants ill after drinking contaminated baby formula of Nestle and Danone

https://www.bbc.com/news/articles/c931rxnwn3lo
1•__natty__•11m ago•0 comments

Show HN: Android-based audio player for seniors – Homer Audio Player

https://homeraudioplayer.app
1•cinusek•11m ago•0 comments

Starter Template for Ory Kratos

https://github.com/Samuelk0nrad/docker-ory
1•samuel_0xK•13m ago•0 comments

LLMs are powerful, but enterprises are deterministic by nature

1•prateekdalal•16m ago•0 comments

Make your iPad 3 a touchscreen for your computer

https://github.com/lemonjesus/ipad-touch-screen
2•0y•21m ago•1 comments

Internationalization and Localization in the Age of Agents

https://myblog.ru/internationalization-and-localization-in-the-age-of-agents
1•xenator•21m ago•0 comments

Building a Custom Clawdbot Workflow to Automate Website Creation

https://seedance2api.org/
1•pekingzcc•24m ago•1 comments

Why the "Taiwan Dome" won't survive a Chinese attack

https://www.lowyinstitute.org/the-interpreter/why-taiwan-dome-won-t-survive-chinese-attack
1•ryan_j_naughton•24m ago•0 comments

Xkcd: Game AIs

https://xkcd.com/1002/
1•ravenical•26m ago•0 comments

Windows 11 is finally killing off legacy printer drivers in 2026

https://www.windowscentral.com/microsoft/windows-11/windows-11-finally-pulls-the-plug-on-legacy-p...
1•ValdikSS•26m ago•0 comments

From Offloading to Engagement (Study on Generative AI)

https://www.mdpi.com/2306-5729/10/11/172
1•boshomi•28m ago•1 comments

AI for People

https://justsitandgrin.im/posts/ai-for-people/
1•dive•29m ago•0 comments

Rome is studded with cannon balls (2022)

https://essenceofrome.com/rome-is-studded-with-cannon-balls
1•thomassmith65•35m ago•0 comments

8-piece tablebase development on Lichess (op1 partial)

https://lichess.org/@/Lichess/blog/op1-partial-8-piece-tablebase-available/1ptPBDpC
2•somethingp•36m ago•0 comments

US to bankroll far-right think tanks in Europe against digital laws

https://www.brusselstimes.com/1957195/us-to-fund-far-right-forces-in-europe-tbtb
3•saubeidl•37m ago•0 comments

Ask HN: Have AI companies replaced their own SaaS usage with agents?

1•tuxpenguine•40m ago•0 comments

pi-nes

https://twitter.com/thomasmustier/status/2018362041506132205
1•tosh•42m ago•0 comments

Show HN: Crew – Multi-agent orchestration tool for AI-assisted development

https://github.com/garnetliu/crew
1•gl2334•42m ago•0 comments

New hire fixed a problem so fast, their boss left to become a yoga instructor

https://www.theregister.com/2026/02/06/on_call/
1•Brajeshwar•44m ago•0 comments

Four horsemen of the AI-pocalypse line up capex bigger than Israel's GDP

https://www.theregister.com/2026/02/06/ai_capex_plans/
1•Brajeshwar•44m ago•0 comments

A free Dynamic QR Code generator (no expiring links)

https://free-dynamic-qr-generator.com/
1•nookeshkarri7•45m ago•1 comments

nextTick but for React.js

https://suhaotian.github.io/use-next-tick/
1•jeremy_su•47m ago•0 comments

Show HN: I Built an AI-Powered Pull Request Review Tool

https://github.com/HighGarden-Studio/HighReview
1•highgarden•47m ago•0 comments

Git-am applies commit message diffs

https://lore.kernel.org/git/bcqvh7ahjjgzpgxwnr4kh3hfkksfruf54refyry3ha7qk7dldf@fij5calmscvm/
1•rkta•50m ago•0 comments
Open in hackernews

Do you prefer GPT or Google Gemini?

2•AaronSwift1•5mo ago
and why

Comments

fbhabbed•5mo ago
GPT5-Thinking if I need a precise answer with the least possible amount of mistakes.

GPT5-Pro is the real deal.

Gemini if I need creative insights and a pleasant talk, but this comes at the cost of more mistakes (and it's hella stubborn).

Hopefully Gemini 3.0 will fix this.

Topfi•5mo ago
Generally I currently rely on GPT-5 Thinking (Medium Reasoning) for most tasks because of all the models currently available, the GPT-5 series (and GPT-4.1 before that) have been the most reliable in following instructions to the letter, doing no less, but more importantly, no more.

Both Claude models (from 3.5 Sonnet to 4.1 Opus) and Gemini 2.5 Pro have historically always taken a lot more liberties, which some users find appealing, but which I have come to not want when relying on a model for consistent output. I can see why some find great value in a model already implementing e.g. an auth provider when requesting the frontend for a login page, but for guiding a model, I personally prefer something to not happen if I didn't explicitly requested such behavior. Especially Claude as part of agentic coding workflows has a tendency to simply try e.g. a different package then what was requested, which some users may not notice. Found this very funny when Claude 4 Sonnet once fully reimplemented an infinite canvas as @xyflow failed to install properly. I'd rather a model error out there/ask for the user in the loop to confirm.

In regard to instruction following, while all three Frontier providers do well with their context windows, GPT-5 models are still a bit more preferable for me, despite having only 400k vs 1million, simply because what is there can not just be recalled, but will be adhered to reliably as well.

GPT-5 also seems a bit better regarding CSS, though I have far to limited UI taste to actually make a solid judgement on that front and styling is of course subjective.

Additionally, when benchmarking all three frontier models side by side, I have yet to find a coding task that GPT-5 cannot solve but the others can. I did however find certain cases where my initial instructions were lacking/not comprehensive enough, leading to all three having issues completing a task. In those cases, I found that Gemini 2.5 Pro when provided with the code base does best at rewriting an existing prompt. These then usually have far higher success rates when provided to GPT-5 Thinking (Medium Reasoning) or to a lesser extend when using one of the Claude 4 models. However, these Gemini provided prompts also occasionally contain inventions/hallucinations", so I must always triple check prompts when doing this.

For context, the main coding problem I am using model assistance for at the moment is some poorly designed Figma inspired real time syncing code with some overly odd edge cases, courtesy of my limited skillset.

For none-coding stuff, I have in the last semester mainly relied on Gemini 2.5 (sometimes Pro, often Flash) for creating nice summaries of lectures. I found any other models (doesn't matter whether OpenAIs previous models or anything from Anthropic, Mistral, Deepseek, Qwen, etc.) less suitable, mainly because these tended to output far to strong summaries, often truncating what is absolutely vital information. Gemini models are far more willing to actually output an extensive, maybe a bit to verbose summary, but I'd rather remove a few lines. I haven't yet gotten enough experience with GPT-5 as a summarization tool as the semester is only just starting, so cannot say how well OpenAIs newest series does there, but from very limited experience, GPT-5-mini has potential here to replace Gemini 2.5 Flash as my go to.