Open Source Model, GrafoConnect+vMF destroy completly standard transformer in NLP, Biology, Materials. This is a new attention ->
https://zenodo.org/records/20446506
This type of attention work different of transformers, this VAL, MAE, etc starts low but in epoch 3 jump insane to close the train keep the overfitting controled. I'm sorry if anything isn't corret but i have 0 background ate code even a HIGH school man (6 Class), all MY IDEIA (GRAFO CONNECT + vMF) where i put in PAINT my ideas and transfere to AI, u can reply the results, THEY are not fake.