fp.

Open in hackernews

Softmax: Why neural networks need non-linearity? life isn't straight-line simple

https://blog.sparsh.dev/softmax-activation-function/

9•sparshrestha•1d ago

Comments

sparshrestha•1d ago

Math functions that calculate weighted sum of inputs and adds bias to give non-linearity to output of neuron.

microtonal•44m ago

It doesn't though. Wx + b is an affine transformation, which is just a linear transformation + translation: https://en.wikipedia.org/wiki/Affine_transformation

microtonal•52m ago

This is a weird post, it talks about non-linear functions, but then goes into softmax as a non-linear function. Softmax is rarely used as a direct non-linearity inside a neural network [1], but in the last layer as softmax regression, which is a linear decision boundary. You can easily show this in the two-class case (logistic function). The decision boundary is (a)=0.5, a=wx+b, so we have

    1/(1+e^-a) = 0.5

It can be shown trivially that -a must be 0 (since e^0=1), so we get the decision boundary is wx+b, which is linear.

From the title I'd expect the article to show that softmax classifiers use linear decision boundaries and would use it as a motivation to introduce a non-linearity in a hidden layer.

[1] You could of course argue that softmax as used in attention is a non-linearity in the attention layer, but it is used differently than a direct application of a non-linearity like ReLU, GELU, etc. to an affine transformation.

GentleOS – Classic operating system with a lovely retro GUI

https://github.com/luke8086/gentleos32

206•tekkertje•3h ago•48 comments

Making Graphics Like it's 1993

https://staniks.github.io/articles/catlantean-3d-blog-1/

157•sklopec•2h ago•27 comments

Microsoft's open source tools were hacked to steal passwords of AI developers

https://techcrunch.com/2026/06/08/microsofts-open-source-tools-were-hacked-to-steal-passwords-of-...

310•raffael_de•6h ago•125 comments

Cleaning up after AI rockstar developers

https://www.codingwithjesse.com/blog/rockstar-developers/

85•BrunoBernardino•4h ago•39 comments

OpenCV 5 Is Here: The Biggest Leap in Years for Computer Vision

https://opencv.org/opencv-5/

376•ternaus•3d ago•59 comments

Forever Young: how one molecule can lock plants in a youthful state (2025)

https://omnia.sas.upenn.edu/story/biologist-scott-poethig-plants-never-age

75•bryanrasmussen•5h ago•32 comments

An introduction to functional analysis for science and engineering

https://arxiv.org/abs/1904.02539

43•Anon84•1d ago•3 comments

Apple reveals new AI architecture built around Google Gemini models

https://www.macrumors.com/2026/06/08/apple-reveals-new-ai-architecture/

652•unclefuzzy•18h ago•500 comments

Emerge Career (YC S22) Is Hiring a Founding Growth Marketer

https://www.ycombinator.com/companies/emerge-career/jobs/v0S1AEG-founding-growth-marketer

1•gabesaruhashi•1h ago

Show HN: We post-trained a model that pen tests instead of refusing your code

https://www.argusred.com/cli

7•dk189•1h ago•4 comments

The iPhone's Last Stand

https://stratechery.com/2026/the-iphones-last-stand/

47•swolpers•3h ago•73 comments

Thi.ng – open-source building blocks for computational design and art

91•nmstoker•1d ago•17 comments

Job: Head of Stonehenge

https://www.english-heritage.org.uk/about/our-people/careers-with-us/job-search/default-job-page/...

174•mooreds•10h ago•142 comments

Siri AI

https://www.apple.com/apple-intelligence/

618•0xedb•19h ago•602 comments

Show HN: Performative-UI – A react component library of design tropes

https://vorpus.github.io/performativeUI/

1050•lizhang•23h ago•190 comments

xAI is looking more like a datacentre REIT than a frontier lab

https://martinalderson.com/posts/xais-new-rental-business/

604•martinald•22h ago•473 comments

Eagle Computer: The rise and fall of an early PC clone

https://dfarq.homeip.net/eagle-computer-the-rise-and-fall-of-an-early-pc-clone/

26•giuliomagnifico•4h ago•4 comments

H2JVM – A Haskell Library for Writing JVM Bytecode

https://discourse.haskell.org/t/h2jvm-a-haskell-library-for-writing-jvm-bytecode/14182

30•rowbin•2d ago•8 comments

EU-banned pesticides found in rice, tea and spices

https://www.foodwatch.org/en/eu-banned-pesticides-found-in-rice-tea-and-spices

465•john-titor•21h ago•242 comments

Porting the ThinkPad X61 to Coreboot

https://blog.aheymans.xyz/post/thinkpad_x61/

106•walterbell•9h ago•41 comments

Old'aVista – The most powerful guide to the old Internet

https://oldavista.com/

133•abnercoimbre•21h ago•27 comments

The beauty and simplicity of the good old C-style void* in C++

https://giodicanio.com/2026/06/05/how-to-declare-a-c-plus-plus-function-that-takes-a-blob-of-memory/

35•movd128•2d ago•56 comments

MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 tokens per second

https://mimo.xiaomi.com/blog/mimo-tilert-1000tps

590•gainsurier•22h ago•437 comments

Show HN: Gravity – interactive solar-system simulator, from Newton to Einstein

https://qunabu.github.io/Gravity/

12•qunabu•1h ago•4 comments

Apple Core AI Framework

https://developer.apple.com/documentation/coreai/

325•hmokiguess•18h ago•92 comments

Adopting the Parallel DWARF linker in dsymutil

https://jonasdevlieghere.com/post/dsymutil-parallel-linker/

3•JDevlieghere•2d ago•0 comments

Looking Forward to Postgres 19: Query Hints

https://www.pgedge.com/blog/looking-forward-to-postgres-19-query-hints

188•jjgreen•3d ago•34 comments

Show HN: Gitdot – A better GitHub. Open-source, written in Rust

https://gitdot.io/

289•baepaul•20h ago•265 comments

Ask HN: What are tools you have made for yourself since the advent of AI?

363•aryamaan•19h ago•585 comments

FrontierCode

https://cognition.ai/blog/frontier-code

227•streamer45•16h ago•47 comments