facebook/bart-largehas roughly 0.4 billion parameters. Therefore, even with float32 is should fit in 14GB VRAM.
Yeah. I think so, too.
When using DataParallel in a multi-GPU environment, unexpected out-of-memory errors can occur. While somewhat complex, using FSDP or similar methods might help avoid this.