FlaxBigBird for Dutch language
Currently, there are only a very limited amount of long-range sequence models for other languages than English on the hub. The goal of this project is to create a strong Dutch FlaxBigBird model.
Model
A randomly initialized FlaxBigBird model.
Datasets
Available training scripts
Scripts to pre-train Flax models are here: https://github.com/huggingface/transformers/tree/master/examples/flax/language-modeling
Scripts for fine-tuning and evaluation for FlaxBigBird just got released here: https://github.com/huggingface/transformers/tree/master/examples/research_projects/jax-projects/big_bird Thanks to @vasudevgupta
(Optional) Desired project outcome
The desired project output is a strong Dutch FlaxBigBird model in Dutch. For downstream tasks that require long sequences (e.g. long text classification).
