开发者社区 > ModelScope模型即服务 > 计算机视觉 > 正文

用ModelScope 粤语模型推理声音的时候出现了以下问题,有遇到过的吗?

用ModelScope 粤语模型推理声音的时候出现了以下问题,有遇到过的吗?
Traceback (most recent call last):
File ""kantts/bin/text_to_wav.py"", line 234, in
args.lang,
File ""kantts/bin/text_to_wav.py"", line 161, in text_to_wav
am_infer(symbols_file, am_ckpt, output_dir, se_file)
File ""/root/KAN-TTS/kantts/bin/infer_sambert.py"", line 222, in am_infer
line[1], fsnet, ling_unit, device, se=se
File ""/root/KAN-TTS/kantts/bin/infer_sambert.py"", line 87, in am_synthesis
[inputs_sy, inputs_tone, inputs_syllable, inputs_ws], dim=-1
RuntimeError: stack expects each tensor to be equal size, but got [5] at entry 0 and [21] at entry 1

展开
收起
Lucidly 2024-01-22 17:17:00 52 0
0 条回答
写回答
取消 提交回答

包含图像分类、图像生成、人体人脸识别、动作识别、目标分割、视频生成、卡通画、视觉评价、三维视觉等多个领域

相关电子书

更多
大规模机器学习在蚂蚁+阿里的应用 立即下载
阿里巴巴机器学习平台AI 立即下载
微博机器学习平台架构和实践 立即下载