开发者社区 > ModelScope模型即服务 > 正文

modelscope-funasr这个配置文件有没有问题?

modelscope-funasr这个配置文件有没有问题?# network architecture

encoder related

encoder: sanm
encoder_conf:
output_size: 512
attention_heads: 4
linear_units: 2048
num_blocks: 50
dropout_rate: 0.1
positional_dropout_rate: 0.1
attention_dropout_rate: 0.1
input_layer: pe
pos_enc_class: SinusoidalPositionEncoder
normalize_before: true
kernel_size: 11
sanm_shfit: 0
selfattention_layer_type: sanm

decoder

decoder: paraformer_decoder_sanm
decoder_conf:
attention_heads: 4
linear_units: 2048
num_blocks: 16
dropout_rate: 0.1
positional_dropout_rate: 0.1
self_attention_dropout_rate: 0.1
src_attention_dropout_rate: 0.1
att_layer_num: 16
kernel_size: 11
sanm_shfit: 0

frontend related

frontend: wav_frontend
frontend_conf:
fs: 16000
window: hamming
n_mels: 40
frame_length: 25
frame_shift: 10
lfr_m: 7
lfr_n: 6

model: funasr.models.paraformer.model:Paraformer

model: paraformer
model_conf:
ctc_weight: 0.0
lsm_weight: 0.1
length_normalized_loss: true
predictor_weight: 1.0
predictor_bias: 1
sampling_ratio: 0.75

accum_grad: 4
grad_clip: 5
max_epoch: 29
val_scheduler_criterion:

- valid
- loss

best_model_criterion:

    • valid
    • loss
    • min
      keep_nbest_models: 5

optim: adam
optim_conf:
lr: 0.0005
scheduler: warmuplr
scheduler_conf:
warmup_steps: 30000

specaug: specaug_lfr
specaug_conf:
apply_time_warp: false
time_warp_window: 5
time_warp_mode: bicubic
apply_freq_mask: true
freq_mask_width_range:

- 0
- 30
lfr_rate: 6
num_freq_mask: 1
apply_time_mask: true
time_mask_width_range:
- 0
- 12
num_time_mask: 1

predictor: cif_predictor_v2
predictor_conf:
idim: 512
threshold: 1.0
l_order: 1
r_order: 1
tail_threshold: 0.45

dataset_conf:
data_names: speech,text
data_types: kaldi_ark,text
filter_conf:
speech_length_min: 50
speech_length_max: 2000
token_speech_ratio_min: 0.2
token_speech_ratio_max: 10
shuffle: True
shuffle_conf:
shuffle_size: 2048
sort_size: 1204
batch_conf:
batch_type: token
batch_size: 8000
num_workers: 8

log_interval: 100
normalize: global_mvn

展开
收起
三分钟热度的鱼 2024-01-17 18:48:23 40 0
0 条回答
写回答
取消 提交回答

ModelScope旨在打造下一代开源的模型即服务共享平台,为泛AI开发者提供灵活、易用、低成本的一站式模型服务产品,让模型应用更简单!欢迎加入技术交流群:微信公众号:魔搭ModelScope社区,钉钉群号:44837352

相关电子书

更多
视觉AI能力的开放现状及ModelScope实战 立即下载
ModelScope助力语音AI模型创新与应用 立即下载
低代码开发师(初级)实战教程 立即下载