Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
Yifan Peng
pyf98
AI & ML interests
Multimodal LLMs, Speech-to-Speech, Speech Recognition
Organizations
models 48
pyf98/DPHuBERT
Updated β’ 4
pyf98/fisher_callhome_spanish_e_branchformer
Automatic Speech Recognition β’ Updated β’ 5
pyf98/fisher_callhome_spanish_conformer
Automatic Speech Recognition β’ Updated β’ 2
pyf98/slurp_entity_e_branchformer
Automatic Speech Recognition β’ Updated β’ 6
pyf98/aidatatang_200zh_e_branchformer_e16
Automatic Speech Recognition β’ Updated β’ 2
pyf98/librispeech_100_transducer_e_branchformer
Automatic Speech Recognition β’ Updated β’ 1
pyf98/librispeech_100_transducer_conformer
Automatic Speech Recognition β’ Updated β’ 5 β’ 1
pyf98/jsut_e_branchformer
Automatic Speech Recognition β’ Updated β’ 6
pyf98/aishell_ctc_e_branchformer_e12
Automatic Speech Recognition β’ Updated β’ 1
pyf98/aishell_ctc_conformer_e15_linear1024
Automatic Speech Recognition β’ Updated β’ 4 β’ 2
datasets 0
None public yet