frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Focus and Context and LLMs

https://taras.glek.net/posts/focus-and-context-and-llms/
16•tarasglek•5h ago

Comments

quantum_state•2h ago
Context is all you need :-)
tarasglek•2h ago
Indeed, that was my original working title
summarity•17m ago
I found the same in my personal work. I have o3 chats (as in OAI's Chat interface) that are so large they crash the site, yet o3 still responds without hallucination and can debug across 5k+ LOC. I've used it for DSP code, to debug a subtle error in a 800+LOC Nim macro that sat in a 4k+ LOC module (it found the bug), work on compute shaders for audio analysis, work on optimizing graphics programs and other algorithms. Once I "vibe coded" (I hate that term) a fun demo using a color management lib I wrote, which encoded the tape state for a brainfuck interpreter in the deltaE differences between adjacent cells. Using the same prompts replayed in Claude chat and others doesn't even get close. It's spooky.

Yet when I use the Codex CLI, or agent mode in any IDE it feels like o3 regresses to below GPT-3.5 performance. All recent agent-mode models seem completely overfitted to tool calling. The most laughable attempt is Mistral's devstral-small - allegedly the #1 agent model, but going outside of scenarios you'd encounter in SWEbench & co it completely falls apart.

I notice this at work as well, the more tools you give any model (reasoning or not), the more confused it gets. But the alternative is to stuff massive context into the prompts, and that has no ROI. There's a fine line to be walked here, but no one is even close it yet.

__mharrison__•7m ago
Building complex software is certainly possible with no coding and minimal promoting.

This YT video (from 2 days ago) demonstrates it https://youtu.be/fQL1A4WkuJk?si=7alp3O7uCHY7JB16

The author builds a drawing app in an hour.

emorning3•7m ago
The article summed itself up as 'Context is everything".

But the article itself also makes the point that a human assistant was also necessary. That's gonna be my take away.

Simplified Transformers

https://github.com/bobby-he/simplified_transformers
1•JPLeRouzic•10s ago•0 comments

Show HN: Fontweaver – AI Generated Fonts

https://fontweaver.com/about-us
1•lcmchris•26s ago•0 comments

OpenAI scraping Reddit through redlib instances

https://hcrypt.net/2025/06/08/scrapers.html
1•udev4096•1m ago•0 comments

Rohde and Schwarz AMIQ Modulation Generator Teardown

https://tomverbeure.github.io/2025/04/26/RS-AMIQ-Teardown-Analog-Deep-Dive.html
1•iamsrp•3m ago•0 comments

VoxeLibre (Formerly MineClone2)

https://content.luanti.org/packages/wuzzy/mineclone2/
1•l2dy•5m ago•0 comments

Mastering Modern Time Series Forecasting: Guide to Statistical, ML and DL Models

https://valeman.gumroad.com/l/MasteringModernTimeSeriesForecasting
1•nabla9•5m ago•0 comments

Installing SteamOS on Lenovo Legion Go S [video]

https://www.youtube.com/watch?v=i0ht2HJBQwc
1•Risse•5m ago•0 comments

Launching the BeOS on Hitachi Flora Prius Systems (1999)

http://testou.free.fr/www.beatjapan.org/mirror/www.be.com/support/guides/hitachi_boot.html
1•doener•8m ago•1 comments

Design as You Go: The Case Study of Chenab Railway Bridge (pdf)

https://link.springer.com/epdf/10.1007/s40098-025-01270-y?sharing_token=ZlVt8RfLjl3NuVrUmrFpc_e4RwlQNchNByi7wbcMAY4ypeD59rkCV9uWHxjOjJ-SWm6doZdM4bnXn88WVvT2s8OxYqJrHGTgf5N6RIC62nEXKRVC-61E3oRnjg6f31TG8UVvDSaqTlGReO4L-DNXZ88bZkvK4WbYkyRO1T1TP9w%3D
1•rustoo•12m ago•0 comments

A simple ray tracer written in the meson.build language

https://github.com/annacrombie/meson-raytracer
1•fanf2•13m ago•0 comments

Show HN: ClearTok – Automatically remove reposted videos

https://tiktokrepostremover.com
1•auroroa•13m ago•0 comments

BeOS – The Forgotten '90s Operating System (Retrospective and Demo) [video]

https://www.youtube.com/watch?v=MzosnPSETzk
1•doener•15m ago•0 comments

Visualize Data Structures in VS Code

https://www.youtube.com/shorts/3O6BFlOiFRg
1•Brysonbw•19m ago•0 comments

How to Run Private and Uncensored LLMs Offline – Dolphin Llama 3

https://www.youtube.com/watch?v=eiMSapoeyaU
1•Brysonbw•20m ago•0 comments

Enterprises are getting stuck in AI pilot hell, say Chatterbox Labs execs

https://www.theregister.com/2025/06/08/chatterbox_labs_ai_adoption/
1•rntn•21m ago•0 comments

SnitchBench – Learn if your AI model will rat you out to the feds

https://github.com/t3dotgg/SnitchBench
1•empressplay•22m ago•0 comments

Boulder Dash Was Created

https://spillhistorie.no/2025/06/06/how-boulder-dash-was-created/
1•megamike•22m ago•0 comments

The Computer Chronicles – HyperCard (1987) [video]

https://www.youtube.com/watch?v=FquNpWdf9vg
1•dpapathanasiou•26m ago•0 comments

It's 1970, you're Thompson/Ritchie applying to YC

1•pootietangus•27m ago•2 comments

A Demonstrator's Guide to Understanding Riot Munitions

https://crimethinc.com/2021/01/04/a-demonstrators-guide-to-understanding-riot-munitions-and-how-to-defend-against-them
1•nabla9•27m ago•0 comments

Bento, a Steam Deck in a Keyboard

https://old.reddit.com/r/SteamDeckModded/comments/1l4etg3/introducing_bento_a_steam_deck_in_a_keyboard/
1•croes•29m ago•0 comments

Timeouts and Cancellation for Humans (2018)

https://vorpus.org/blog/timeouts-and-cancellation-for-humans/
1•pkkm•31m ago•0 comments

Show HN: WhisperBuddy, Privacy-first AI-transcription app built after my layoff

https://whisperbuddy.com
1•nghialuong•34m ago•1 comments

NakaPay – Accept Bitcoin Lightning payments in your business with ease

https://www.nakapay.app/
1•hubavka•35m ago•1 comments

The Role of the Human Brain in Programming

https://www.youtube.com/watch?v=1WC8dxMC4Xw
1•stevekrouse•44m ago•0 comments

Show HN: Free CLI to download any YouTube videos in HD

https://github.com/pH-7/Download-Simply-Videos-From-YouTube
1•pierres7•44m ago•0 comments

Show NH: AI Dream analyzer in process oriented psychology paradigm

https://dreampower.app/
1•prosheprostogo•46m ago•1 comments

Federal agents do immigration raids in LA. Trump orders National Guard response

https://laist.com/news/federal-agents-immigration-raids-across-la
2•sizzle•46m ago•0 comments

AI "reasoning" models don't reason at all

https://twitter.com/RubenHssd/status/1931389580105925115
1•doener•49m ago•1 comments

Exploring vocabulary alignment of neurons in Llama-3.2-1B

https://grgv.xyz/blog/neurons1/
1•coolvision•50m ago•0 comments