frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LLMKit – Compare LLMs side-by-side with real-time streaming

https://www.llmkit.cc/model-comparison/gpt-4-vs-claude-3-5-sonnet
2•chieund•1h ago
Hi HN!

I built LLMKit after getting frustrated with choosing the right LLM for different projects. Instead of guessing or relying on benchmarks that don't match real use cases, I wanted to see actual performance with my own prompts.

What it does: • Compare up to 5 models simultaneously (GPT-4, Claude, Gemini, etc.) • Real-time streaming comparison - watch models race to respond • Custom scoring weights based on your priorities (speed vs cost vs quality) • System prompt support for production-realistic testing • TTFT (Time to First Token) metrics for latency-sensitive apps • No signup required, API keys stay in your browser

The "aha moment" was adding streaming comparison - seeing GPT-4 start fast but Claude catch up, or watching cost-effective models perform surprisingly well. It's like A/B testing but for LLMs.

Built with Next.js + TypeScript. The streaming implementation was tricky - had to handle different provider formats (OpenAI vs Anthropic) and parallel SSE connections.

I built a retro mini PC which uses game cartridges [video]

http://youtube.com/watch?v=iJbJDowBfi4
1•abeisgreat•1m ago•1 comments

Genesis Open Source Embodied AGI Simulation, Rust (Mamba-3, Not Transformers)

1•RGBra•2m ago•0 comments

Reddit comment led police to identify Brown University shooter

https://9to5mac.com/2025/12/22/reddit-comment-led-police-to-identify-brown-university-shooter/
1•akyuu•4m ago•0 comments

Free to Post, Impossible to Hide: The End of Anonymous Marketplaces

https://medium.com/@lea.leumassart/free-to-post-impossible-to-hide-the-end-of-anonymous-marketpla...
1•pbacdf•5m ago•0 comments

Meilisearch: Make the S3-streaming snapshots an Enterprise Edition feature

https://github.com/meilisearch/meilisearch/pull/6057
1•iruoy•7m ago•0 comments

How AI Collapses and Rebuilds Marketplace Moats

https://www.caseyaccidental.com/p/when-agents-attack-how-ai-collapses
1•gmays•7m ago•0 comments

Banana is Generating Antimatter, And I Detected It [video]

https://www.youtube.com/watch?v=ZOTsDmeM0No
2•jmward01•9m ago•1 comments

Show HN: Cardly – a tiny card-first app to capture people's Gift Cards

https://www.cardlyai.app/
1•Pastaza•9m ago•1 comments

Show HN: Making SVG Sparkline Component with an Agent to Graph Token Usage

https://bsky.app/profile/verdverm.com/post/3makhu3nbm22n
1•verdverm•9m ago•0 comments

Show HN: Find games with few -but positive- reviews based on games that you like

https://www.notsoaaa.com/
1•AmbroseBierce•12m ago•0 comments

Show HN: LLVM-jutsu: Anti-LLM obfuscation pass

https://github.com/thebabush/llvm-jutsu
1•babush•13m ago•0 comments

Welcome to Kenya's Great Carbon Valley a bold new gamble to fight climate change

https://www.technologyreview.com/2025/12/22/1130153/geothermal-energy-carbon-capture-kenya-climat...
1•rbanffy•13m ago•0 comments

How the Cybertruck's design may have trapped crash survivors in flames

https://www.washingtonpost.com/technology/interactive/2025/cybertruck-crash-design-lawsuit/
2•Jtsummers•13m ago•0 comments

Paperbacks and TikTok

https://calnewport.com/on-paperbacks-and-tiktok/
1•zdw•13m ago•0 comments

Lua 5.5 Released

https://www.lua.org/manual/5.5/readme.html#changes
2•todsacerdoti•14m ago•1 comments

Best way to annotate large parquet LLM logs without full rewrites?

1•platypii•17m ago•0 comments

The Program 2025 annual review: How much money does an audio drama podcast make?

https://programaudioseries.com/the-program-results-7/
2•I-M-S•17m ago•1 comments

ChatGPT Is a Search Engine

https://queryburst.com/blog/how-chatgpt-works/
1•AznHisoka•18m ago•0 comments

Power outage paralyzes Waymo robotaxis when traffic lights go out

https://arstechnica.com/cars/2025/12/power-outage-paralyzes-waymo-robotaxis-when-traffic-lights-g...
2•chirau•18m ago•1 comments

Reducing contrails reduces CO2 effect of air travel 73%, adds only 0.08% to cost [video]

https://www.youtube.com/watch?v=QoOVqQ5sa08
1•CGMthrowaway•23m ago•0 comments

Algorithmic Personalization Causes Inaccurate Generalization and Overconfidence

https://psycnet.apa.org/fulltext/2026-31272-001.html
1•PaulHoule•23m ago•0 comments

Inquiry ongoing after UK government hacked, says minister

https://www.bbc.co.uk/news/articles/cj4qpwprw9vo
1•GaryBluto•23m ago•0 comments

Older Americans Quit Weight-Loss Drugs in Droves

https://www.nytimes.com/2025/12/21/health/older-people-glp1-weight.html
3•bookofjoe•23m ago•1 comments

Samsung Biologics to buy US drug production facility from GSK for $280M

https://www.reuters.com/business/healthcare-pharmaceuticals/samsung-biologics-buy-us-drug-product...
2•randycupertino•24m ago•0 comments

The Fisherman and the Businessman

https://kevquirk.com/blog/the-fisherman-and-the-businessman/
1•0x54MUR41•24m ago•0 comments

A zero-dependency approach to archival, interactive research

https://tjid3.org/tech
1•TimothyMJones•25m ago•1 comments

MacSync Stealer variant finds a way to bypass Apple malware protections

https://9to5mac.com/2025/12/22/macsync-stealer-variant-finds-a-way-to-bypass-apple-malware-protec...
1•zdw•26m ago•0 comments

Intel x86 considered harmful [pdf]

https://blog.invisiblethings.org/papers/2015/x86_harmful.pdf
2•throwoutway•26m ago•0 comments

National Portrait Gallery Buys Rare Photographs of Ada Lovelace for the UK

https://www.ianvisits.co.uk/articles/national-portrait-gallery-saves-rare-photographs-of-ada-love...
3•ianvisits•26m ago•1 comments

Around 1k systems compromised in ransomware attack on Romanian water agency

https://www.theregister.com/2025/12/22/around_1000_systems_compromised_in/
1•GaryBluto•26m ago•0 comments