大家好,请教一个问题: 使用 “Paraformer语音识别-中文-通用-16k-离线-large-长音频版”进行微调,报如下错误,还请帮忙!! 错误如下:
[31c7ad19e26c] 2023-04-15 15:33:52,255 (abs_task:1638) INFO: [valid] Batch sampler: LengthBatchSampler(N-batch=1051, batch_bins=1000, sort_in_batch=descending, sort_batch=descending) [31c7ad19e26c] 2023-04-15 15:33:52,255 (abs_task:1640) INFO: [valid] mini-batch sizes summary: N-batch=1051, mean=13.6, min=5, max=28 [31c7ad19e26c] 2023-04-15 15:33:52,381 (trainer:283) INFO: 1/50epoch started Traceback (most recent call last): File "finetune.py", line 35, in modelscope_finetune(params) File "finetune.py", line 22, in modelscope_finetune trainer.train() File "/opt/conda/lib/python3.7/site-packages/modelscope/trainers/audio/asr_trainer.py", line 168, in train self.trainer.run() File "/root/FunASR/funasr/tasks/abs_task.py", line 1134, in run cls.main_worker(args) File "/root/FunASR/funasr/tasks/abs_task.py", line 1443, in main_worker distributed_option=distributed_option, File "/root/FunASR/funasr/train/trainer.py", line 298, in run distributed_option=distributed_option, File "/root/FunASR/funasr/train/trainer.py", line 603, in train_one_epoch retval = model(**batch) File "/opt/conda/lib/python3.7/site-packages/torch/nn/modules/module.py", line 1110, in _call_impl return forward_call(*input, **kwargs) File "/root/FunASR/funasr/models/e2e_asr_paraformer.py", line 1075, in forward encoder_out, encoder_out_lens, text, text_lengths File "/root/FunASR/funasr/models/e2e_asr_paraformer.py", line 524, in _calc_att_loss ignore_id=self.ignore_id) ValueError: too many values to unpack (expected 4)
问题已修复,请更新funasr: https://github.com/alibaba-damo-academy/FunASR#installation