Toward Joint Language Modeling for Speech Units and Text Paper • 2310.08715 • Published Oct 12, 2023 • 10
CCNet: Extracting High Quality Monolingual Datasets from Web Crawl Data Paper • 1911.00359 • Published Nov 1, 2019
FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech Paper • 2205.12446 • Published May 25, 2022 • 2