General machine learning research papers and discussions.
Created by @taylanturker
Kaplan, J., McCandlish, S., Henighan, T., et al.
Understanding these scaling laws is crucial for anyone planning to train large models. The predictability is remarkable.
Vaswani, A., Shazeer, N., Parmar, N., et al.
The paper that started it all. If you work in ML and havent read this, stop everything and read it now.