frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Kimi vendor verifier – verify accuracy of inference providers

https://www.kimi.com/blog/kimi-vendor-verifier
80•Alifatisk•2h ago

Comments

OsamaJaber•1h ago
Good to see this exist. Inference providers quietly swap quant levels. Most users never check. A standard verifier from the model maker is the right move, would love to see other labs ship the same
bobbiechen•1h ago
If I understand correctly, threat model here seems to be to protect against accidental issues that would impact performance, but doesn't cover malicious actor.

For example, Sketchy Provider tells you they are running the latest and greatest, but actually is knowingly running some cheaper (and worse) model and pocketing the difference. These tests wouldn't help since Sketchy Provider could detect when they're being tested and do the right thing (like the Volkswagen emissions scandal). Right?

j-bos•1h ago
Seems like a great challenge for all these systems, see fromtier labs serving quants when under hesvy load.
gpm•21m ago
Yes and no.

For a truly malicious actor, you're right. But it shifts it from "well we aren't obviously committing fraud by quantizing this model and not telling people" to "we're deliberately committing fraud by verifying our deployment with one model and then serving customer requests with another".

I suspect there's a lot of semi-malicious actors who are only happy to do the former.

seism•1h ago
A test that runs for 15 hours on a high powered rig is going to be hard to reproduce or scale. But I think this addresses a widespread concern, which affects all kinds of cloud services. What you ping is not necessarily what you get.
curioussquirrel•1h ago
After Anthropic, Moonshot is another model provider who restricts tweaking of sampling parameters. I do like the idea of the vendor verifier, though.

Tim Cook to become Apple Executive Chairman. John Ternus to become CEO

https://www.apple.com/newsroom/2026/04/tim-cook-to-become-apple-executive-chairman-john-ternus-to...
474•schappim•53m ago•199 comments

AI Resistance Is Growing

https://stephvee.ca/blog/artificial%20intelligence/ai-resistance-is-growing/
188•speckx•1h ago•132 comments

Qwen3.6-Max-Preview: Smarter, Sharper, Still Evolving

https://qwen.ai/blog?id=qwen3.6-max-preview
452•mfiguiere•7h ago•237 comments

Kimi vendor verifier – verify accuracy of inference providers

https://www.kimi.com/blog/kimi-vendor-verifier
80•Alifatisk•2h ago•6 comments

We got 207 tok/s with Qwen3.5-27B on an RTX 3090

https://github.com/Luce-Org/lucebox-hub
94•GreenGames•2h ago•25 comments

GitHub's fake star economy

https://awesomeagents.ai/news/github-fake-stars-investigation/
665•Liriel•13h ago•335 comments

ggsql: A Grammar of Graphics for SQL

https://opensource.posit.co/blog/2026-04-20_ggsql_alpha_release/
312•thomasp85•8h ago•71 comments

Deezer says 44% of songs uploaded to its platform daily are AI-generated

https://techcrunch.com/2026/04/20/deezer-says-44-of-songs-uploaded-to-its-platform-daily-are-ai-g...
231•FiddlerClamp•5h ago•228 comments

Modern Rendering Culling Techniques

https://krupitskas.com/posts/modern_culling_techniques/
47•krupitskas•1d ago•5 comments

Kefir C17/C23 Compiler

https://sr.ht/~jprotopopov/kefir/
81•conductor•2d ago•4 comments

Quantum Computers Are Not a Threat to 128-Bit Symmetric Keys

https://words.filippo.io/128-bits/
64•hasheddan•4h ago•35 comments

Kimi K2.6: Advancing open-source coding

https://www.kimi.com/blog/kimi-k2-6
481•meetpateltech•6h ago•242 comments

F-35 is a masterpiece built for the wrong war

https://warontherocks.com/cogs-of-war/the-f-35-is-a-masterpiece-built-for-the-wrong-war/
92•anjel•1h ago•109 comments

10 years ago, someone wrote a test for Servo that included an expiry in 2026

https://mastodon.social/@jdm_/116429380667467307
164•luu•1d ago•97 comments

Bloom (YC P26) Is Hiring

https://www.ycombinator.com/companies/trybloom/jobs
1•RayFitzgerald•4h ago

Writing string.h functions using string instructions in asm x86-64

https://pmasschelier.github.io/x86_64_strings/
21•thaisstein•3d ago•2 comments

WebUSB Extension for Firefox

https://github.com/ArcaneNibble/awawausb
169•tuananh•9h ago•151 comments

M 7.4 earthquake – 100 km ENE of Miyako, Japan

https://earthquake.usgs.gov/earthquakes/eventpage/us6000sri7/
237•Someone•11h ago•105 comments

We accepted surveillance as default

https://vivianvoss.net/blog/why-we-accepted-surveillance
242•speckx•4h ago•108 comments

Atlassian enables default data collection to train AI

https://letsdatascience.com/news/atlassian-enables-default-data-collection-to-train-ai-f71343d8
426•kevcampb•9h ago•99 comments

Brussels launched an age checking app. Hackers took 2 minutes to break it

https://www.politico.eu/article/eu-brussels-launched-age-checking-app-hackers-say-took-them-2-min...
83•axbyte•12h ago•62 comments

The Work Runs on Different Maps

https://yusufaytas.com/the-work-runs-on-different-maps
27•yusufaytas•1d ago•1 comments

Tim Cook Stepping Down

https://www.macrumors.com/2026/04/20/tim-cook-stepping-down/
34•schappim•57m ago•3 comments

I learned Unity the wrong way

https://darkounity.com/blog/how-i-learned-unity-the-wrong-way
110•lelanthran•4d ago•45 comments

Not buying another Kindle

https://www.androidauthority.com/amazon-kindle-2026-3657863/
238•mikhael•6h ago•198 comments

Figma's woes compound with Claude Design

https://martinalderson.com/posts/figmas-woes-compound-with-claude-design/
80•martinald•11h ago•68 comments

Sauna effect on heart rate

https://tryterra.co/research/sauna-effect-on-heart-rate
319•kyriakosel•7h ago•173 comments

OpenClaw isn't fooling me. I remember MS-DOS

https://www.flyingpenguin.com/build-an-openclaw-free-secure-always-on-local-ai-agent/
250•feigewalnuss•13h ago•283 comments

OpenAI ad partner now selling ChatGPT ad placements based on "prompt relevance"

https://www.adweek.com/media/exclusive-leaked-deck-reveals-stackadapts-playbook-for-chatgpt-ads/
8•jlark77777•12m ago•0 comments

Show HN: Alien – Self-hosting with remote management (written in Rust)

83•alongub•6h ago•29 comments