torch
deepspeed
sentencepiece
tensorboardX
datasets
transformers