My kingdom for renaming this paper to something like "Tensor Product Attention i...

Zacharias030 · 2025-01-22T05:17:24 1737523044

If you don’t like the title, wait till you see this acronym: „… we introduce the Tensor ProducT ATTenTion Transformer (T6), a new model architecture…“

imjonse · 2025-01-22T05:54:31 1737525271

There is a famous transformer model named T5 from Google, and also S4, S4 and S6 (Mamba) in the LLM space, so it is not unusual naming.

svantana · 2025-01-22T09:14:13 1737537253

Yes, but T5 is at least a normal acronym: Text-To-Text Transfer Transformer (albeit a bit forced)

TeMPOraL · 2025-01-22T11:08:32 1737544112

That it's not unusual tells us that too many researchers in the field are chasing citations and fame at the expense of doing quality work.

ben_w · 2025-01-22T14:00:58 1737554458

Mm. That or all sharing a sense of humour/in-jokes: I'm sure I'm not the only one here who immediately thought of "GOTO is all you need" and "Attention considered harmful"

TeMPOraL · 2025-01-22T14:08:04 1737554884

Right. But then, what they did to the title to make it collapse down to T6 is even worse than what I did to my nickname back in high school to squeeze in a long forgotten inner joke about our city's municipal sanitation department (MPO).

EGreg · 2025-01-22T17:15:01 1737566101

Ironically, both are true!

black_puppydog · 2025-01-22T07:34:30 1737531270

"... is all you need" isn't unusual either, and yet GGP isn't happy about it (and I understand why)

superjan · 2025-01-22T19:45:44 1737575144

I propose T-POT (Tensor Product attentiOn Transformer)

prometheon1 · 2025-01-22T22:28:33 1737584913

TPOT already exists in the ML field, it was a somewhat popular autoML package a few years ago if I remember correctly and still seems to be around: https://github.com/EpistasisLab/tpot2