Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization Paper β’ 2509.23202 β’ Published Sep 27 β’ 27 β’ 3
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper β’ 2305.07759 β’ Published May 12, 2023 β’ 36 β’ 10