--- tags: - molecular - chemistry - tokenizer --- # MoleBERT Tokenizer This model is a molecular tokenizer trained on Hf-based TMC. From scratch, 150k train molecules, 30 epochs Code from: https://github.com/Rich-XGK/GTMGC ## Usage ```python from models.mole_bert_tokenizer import MoleBERTTokenizerCollator, MoleBERTTokenizer tokenizer = MoleBERTTokenizer.from_pretrained(...)