torch transformers datasets pillow soundfile sentencepiece