If you’ve ever experimented with a microprocessor at the bare metal level, you’ll know that when it starts up, it will look ...
This repository contains the code accompanying the paper Linear Transformers Are Secretly Fast Weight Programmers which is published at ICML'21. It also contains the logs of all synthetic experiments.