In this third video of our Transformer series, we’re diving deep into the concept of Linear Transformations in Self Attention ...
This is a subject I struggled with the first time I took it. Ironically, this was the engineering version of it. It wasn't until I took the rigorous, axiomatic version that everything clicked.