frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: How would you write evals for chat apps running open models?

1•hv_•1d ago
Hi all,

I'm interviewing for a certain Half-Life provider (full-stack role, application layer) that prides itself on serving open models. I think there is a decent chance I'll be asked how to design a chat app in the systems design interview, and my biggest knowledge gap is writing evals.

The nature of a chat app is so dynamic that it is difficult to hone in on specifics for the evals outside of correct usage of tools.

Thanks for reading!

Cheers

How Chinese High School Students Made It to Google Gemma Hackathon Finals

https://aicamp.substack.com/p/how-we-made-it-to-google-gemma-hackathon
1•inlandrookie•3m ago•0 comments

Cold Showers

https://github.com/hwayne/awesome-cold-showers
1•lemper•12m ago•0 comments

Show HN: Enhanced DCA Trading Bot in Go – 24% returns vs. 12% classic DCA

https://github.com/Zmey56/enhanced-dca-bot
1•Zmey56•15m ago•0 comments

Merg – Deep Research for Media

https://mergai.vercel.app
2•garygao333•15m ago•1 comments

Browse the web in Markdown using an HTML->Markdown language model

https://leidnedya.github.io/markweb/
1•otherayden•16m ago•1 comments

Canada became the centre of a measles outbreak in North America

https://www.bbc.com/news/articles/c4g8d39gdr0op
1•dataminer•16m ago•0 comments

ESP32-Faikin: ESP32 based module to control Daikin aircon units

https://github.com/revk/ESP32-Faikin
2•todsacerdoti•16m ago•0 comments

Project Gemini – new internet technology for interconnected text documents

https://geminiprotocol.net
2•andsoitis•19m ago•0 comments

Ask HN: Will transformer based LLMs hit an improvement ceiling?

1•jaguar75•20m ago•0 comments

Cosmograph: Visualize big networks within seconds

https://cosmograph.app/
1•matteodelabre•22m ago•0 comments

Dirt to Airplanes: Making Aluminium

https://maurycyz.com/projects/al/
2•nothacking_•22m ago•0 comments

Door Wide AI: The 64M Users McDonald's Left Behind

https://www.vitraag.com/2025/07/20/door-wide-ai/
1•vaibhavb•22m ago•1 comments

Homo Floresiensis

https://en.wikipedia.org/wiki/Homo_floresiensis
3•kaycebasques•22m ago•0 comments

qman – a more modern manual page viewer for our terminals

https://github.com/plp13/qman
1•pabs3•26m ago•0 comments

Retrieval Embedding Benchmark

https://huggingface.co/spaces/embedding-benchmark/RTEB
1•fzliu•29m ago•0 comments

ARMv8 AArch64/ARM64 Full Beginner's Assembly Tutorial

https://mariokartwii.com/armv8/
2•andsoitis•30m ago•0 comments

FFmpeg School of Assembly Language

https://github.com/FFmpeg/asm-lessons
2•vismit2000•33m ago•0 comments

At least 67 killed while waiting for aid in Gaza, officials say

https://news.sky.com/story/at-least-67-killed-while-waiting-for-aid-in-gaza-officials-say-13399225
12•mhga•37m ago•1 comments

It's Easier to Get Mad About One Tree Than It Is Deforestation

https://www.bloomberg.com/news/articles/2025-07-18/it-s-easier-to-get-mad-about-one-tree-than-it-is-deforestation
1•petethomas•40m ago•0 comments

Fallout programmer Tim Cain on the game's memory model [video]

https://www.youtube.com/watch?v=6kB_fko6SIg
1•wk_end•45m ago•0 comments

Speeding up zsh and Oh-My-Zsh (2018)

https://blog.jonlu.ca/posts/speeding-up-zsh
1•vinhnx•48m ago•0 comments

Show HN: A free hostel in the heart of Switzerland

https://stayinginbern.ch
1•chagaif•48m ago•0 comments

Sonos Radio Issue

https://status.sonos.com
1•langfo•50m ago•1 comments

US Navy, Coast Guard Shipbuilding in Disarray and No US Commercial Shipbuilding [video]

https://www.youtube.com/watch?v=HOHKog66DaA
3•toomuchtodo•51m ago•0 comments

Scientists want to build 'living' computers–powered by live brain cells

https://www.nationalgeographic.com/science/article/brain-cells-organoids-computers-ai-energy
6•Bluestein•58m ago•1 comments

System-wide outage with Alaska Airlines

https://old.reddit.com/r/AlaskaAirlines/comments/1m57oij/comment/n49ylia/
4•tobinfekkes•1h ago•3 comments

Open Source Radar – Share, Collab, Find Software Projects

https://opensourceradar.org
1•ReddBird•1h ago•6 comments

Archaeologists find evidence of Europe's oldest lake settlement

https://www.independent.co.uk/news/science/archaeology/lake-ohrid-albania-oldest-human-settlement-b2790762.html
5•Bluestein•1h ago•0 comments

WWII Veteran Recalls Discovering a Nazi Concentration Camp [video]

https://www.youtube.com/watch?v=LGWHf8Pe320
1•thomassmith65•1h ago•0 comments

Science is almost ready to "redefine the second" with this new research

https://www.neowin.net/news/science-is-almost-ready-to-redefine-the-second-with-this-new-research/
6•Bluestein•1h ago•0 comments