frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

EVs Are a Failed Experiment

https://spectator.org/evs-are-a-failed-experiment/
1•ArtemZ•2m ago•0 comments

MemAlign: Building Better LLM Judges from Human Feedback with Scalable Memory

https://www.databricks.com/blog/memalign-building-better-llm-judges-human-feedback-scalable-memory
1•superchink•3m ago•0 comments

CCC (Claude's C Compiler) on Compiler Explorer

https://godbolt.org/z/asjc13sa6
1•LiamPowell•5m ago•0 comments

Homeland Security Spying on Reddit Users

https://www.kenklippenstein.com/p/homeland-security-spies-on-reddit
2•duxup•8m ago•0 comments

Actors with Tokio (2021)

https://ryhl.io/blog/actors-with-tokio/
1•vinhnx•9m ago•0 comments

Can graph neural networks for biology realistically run on edge devices?

https://doi.org/10.21203/rs.3.rs-8645211/v1
1•swapinvidya•21m ago•1 comments

Deeper into the shareing of one air conditioner for 2 rooms

1•ozzysnaps•23m ago•0 comments

Weatherman introduces fruit-based authentication system to combat deep fakes

https://www.youtube.com/watch?v=5HVbZwJ9gPE
2•savrajsingh•24m ago•0 comments

Why Embedded Models Must Hallucinate: A Boundary Theory (RCC)

http://www.effacermonexistence.com/rcc-hn-1-1
1•formerOpenAI•26m ago•2 comments

A Curated List of ML System Design Case Studies

https://github.com/Engineer1999/A-Curated-List-of-ML-System-Design-Case-Studies
3•tejonutella•30m ago•0 comments

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

https://ponyalpha.pro
1•qzcanoe•34m ago•1 comments

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

https://github.com/Goofygiraffe06/tunbot
1•g1raffe•36m ago•0 comments

Open Problems in Mechanistic Interpretability

https://arxiv.org/abs/2501.16496
2•vinhnx•42m ago•0 comments

Bye Bye Humanity: The Potential AMOC Collapse

https://thatjoescott.com/2026/02/03/bye-bye-humanity-the-potential-amoc-collapse/
2•rolph•47m ago•0 comments

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

https://github.com/virattt/dexter
1•Lwrless•48m ago•0 comments

Digital Iris [video]

https://www.youtube.com/watch?v=Kg_2MAgS_pE
1•vermilingua•53m ago•0 comments

Essential CDN: The CDN that lets you do more than JavaScript

https://essentialcdn.fluidity.workers.dev/
1•telui•54m ago•1 comments

They Hijacked Our Tech [video]

https://www.youtube.com/watch?v=-nJM5HvnT5k
1•cedel2k1•58m ago•0 comments

Vouch

https://twitter.com/mitchellh/status/2020252149117313349
34•chwtutha•58m ago•6 comments

HRL Labs in Malibu laying off 1/3 of their workforce

https://www.dailynews.com/2026/02/06/hrl-labs-cuts-376-jobs-in-malibu-after-losing-government-work/
4•osnium123•59m ago•1 comments

Show HN: High-performance bidirectional list for React, React Native, and Vue

https://suhaotian.github.io/broad-infinite-list/
2•jeremy_su•1h ago•0 comments

Show HN: I built a Mac screen recorder Recap.Studio

https://recap.studio/
1•fx31xo•1h ago•1 comments

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

1•kachapopopow•1h ago•0 comments

Vectors and HNSW for Dummies

https://anvitra.ai/blog/vectors-and-hnsw/
1•melvinodsa•1h ago•0 comments

Sanskrit AI beats CleanRL SOTA by 125%

https://huggingface.co/ParamTatva/sanskrit-ppo-hopper-v5/blob/main/docs/blog.md
1•prabhatkr•1h ago•1 comments

'Washington Post' CEO resigns after going AWOL during job cuts

https://www.npr.org/2026/02/07/nx-s1-5705413/washington-post-ceo-resigns-will-lewis
4•thread_id•1h ago•1 comments

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

https://twitter.com/claudeai/status/2020207322124132504
1•geeknews•1h ago•0 comments

TSMC to produce 3-nanometer chips in Japan

https://www3.nhk.or.jp/nhkworld/en/news/20260205_B4/
3•cwwc•1h ago•0 comments

Quantization-Aware Distillation

http://ternarysearch.blogspot.com/2026/02/quantization-aware-distillation.html
2•paladin314159•1h ago•0 comments

List of Musical Genres

https://en.wikipedia.org/wiki/List_of_music_genres_and_styles
1•omosubi•1h ago•0 comments
Open in hackernews

Block Diffusion: Interpolating Autoregressive and Diffusion Language Models

https://m-arriola.com/bd3lms/
72•t55•9mo ago

Comments

notrealyme123•9mo ago
This was posted here already a few weeks ago.
holoduke•9mo ago
Whenever I try to read and understand this paper, I feel extremely dumb. I have my degree in CS, but this is just too complex for me to understand.
AlexCoventry•9mo ago
Ask ChatGPT o3 about anything you don't understand, ask it about anything in its responses you don't understand. Keep drilling down until you do understand. Takes patience, but you can learn a lot very fast, this way.
echelon•9mo ago
ChatGPT o3 understands the latest literature and isn't going to hallucinate weird details or make incorrect analogies or math?

I'd worry about learning the wrong things.

Ey7NFZ3P0nzAe•9mo ago
I disagree. It's all about rephrasing information that is in the paper. Possinly a few other papers too.
vessenes•9mo ago
o3 with a pdf or in deep research mode is excellent. Especially if you’re disciplined about staying to what’s research. But really, it’s excellent, better than benchmarks indicate, I’d say.
AlexCoventry•9mo ago
Actually, in the past few days o3 has proven fairly unreliable for me. I've gone back to o1-pro. But when I wrote the above it was reasonably reliable.
evertedsphere•9mo ago
an undergraduate degree in a field is not enough to understand recent research in a specialised subfield of a subfield and you shouldn't beat yourself up over that

there's nothing wrong with you, you just need the right background and you can go get that. see e.g. the fast.ai course

smrtinsert•9mo ago
Do you mean the fast.ai stable diffusion lectures? The initial series doesn't get too deep at all from what I remember.
IncreasePosts•9mo ago
Might want to study some stats or other math.
tippytippytango•9mo ago
I wouldn’t beat yourself up over it. Very few papers can be understood without reading a significant amount of the neighboring literature and the history of how that work came to be. There are norms and customs and a kind of academic language in every community that you won’t be able to see unless you’ve read a lot from that community. Even if you have the right math level it’s tricky.

A single paper is part of a conversation, not something that stands alone. Trying to read one random paper is like finding a 1000 page thread on an obscure topic that has been running for 10+ years and reading only the last page. It won’t make any sense without reading back a ways.

nh23423fefe•9mo ago
depth first read the references until the leaves are obvious!
blurbleblurble•9mo ago
Wow.

I can't wait to see ideas from the diffusion image generation world (like controlnet) work their way into language models.

joejoo•9mo ago
There’s already a few models that are diffusion based.
soulofmischief•9mo ago
I've built diffusion based text models, it's old hat and not necessarily the most performant way to generate text. However it does produce interesting results and I'd love to test some ideas at scale.
gitroom•9mo ago
Yeah I always end up lost in papers like this too, even with my CS degree, the research keeps leveling up nonstop.