Skip to content

Commit

Permalink
update SLP-P12.md
Browse files Browse the repository at this point in the history
  • Loading branch information
abikaki committed Oct 20, 2024
1 parent 2d82d9d commit 75e35a0
Showing 1 changed file with 8 additions and 2 deletions.
10 changes: 8 additions & 2 deletions sections/2024/main/SLP-P12.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@

## Robust Speech Recognition and Adaptation

![Section Papers](https://img.shields.io/badge/Section%20Papers-0-42BA16) ![Preprint Papers](https://img.shields.io/badge/Preprint%20Papers-0-b31b1b) ![Papers with Open Code](https://img.shields.io/badge/Papers%20with%20Open%20Code-0-1D7FBF) ![Papers with Video](https://img.shields.io/badge/Papers%20with%20Video-0-FF0000)
![Section Papers](https://img.shields.io/badge/Section%20Papers-23-42BA16) ![Preprint Papers](https://img.shields.io/badge/Preprint%20Papers-13-b31b1b) ![Papers with Open Code](https://img.shields.io/badge/Papers%20with%20Open%20Code-3-1D7FBF) ![Papers with Video](https://img.shields.io/badge/Papers%20with%20Video-0-FF0000)

| **Title** | **Repo** | **Paper** | **Video** |
|-----------|:--------:|:---------:|:---------:|
Expand All @@ -54,4 +54,10 @@
| Synthetic Conversations Improve Multi-Talker ASR | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446589-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446589) | :heavy_minus_sign: |
| Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition | [![GitHub](https://img.shields.io/github/stars/cs20s030/stable_distillation?style=flat)](https://github.com/cs20s030/stable_distillation) | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446335-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446335) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2312.12783-b31b1b.svg)](https://arxiv.org/abs/2312.12783) | :heavy_minus_sign: |
| Towards High-Performance and Low-Latency Feature-based Speaker Adaptation of Conformer Speech Recognition Systems | [![GitHub Page](https://img.shields.io/badge/GitHub-Page-159957.svg?style=flat)](https://jjdean321.github.io/FastAdapt/) | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10448488-E4A42C.svg)](https://ieeexplore.ieee.org/document/10448488) | :heavy_minus_sign: |
| Progressive Unsupervised Domain Adaptation for ASR Using Ensemble Models and Multi-Stage Training | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10448438-E4A42C.svg)](https://ieeexplore.ieee.org/document/10448438) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2402.04805-b31b1b.svg)](https://arxiv.org/abs/2402.04805) | :heavy_minus_sign: |
| Progressive Unsupervised Domain Adaptation for ASR Using Ensemble Models and Multi-Stage Training | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10448438-E4A42C.svg)](https://ieeexplore.ieee.org/document/10448438) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2402.04805-b31b1b.svg)](https://arxiv.org/abs/2402.04805) | :heavy_minus_sign: |
| Sparsely Shared LoRA on Whisper for Child Speech Recognition | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10447004-E4A42C.svg)](https://ieeexplore.ieee.org/document/10447004) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2309.11756-b31b1b.svg)](https://arxiv.org/abs/2309.11756) | :heavy_minus_sign: |
| Cross-Speaker Encoding Network for Multi-Talker Speech Recognition | [![GitHub](https://img.shields.io/github/stars/kjw11/CSEnet-ASR?style=flat)](https://github.com/kjw11/CSEnet-ASR) | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446249-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446249) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2401.04152-b31b1b.svg)](https://arxiv.org/abs/2401.04152) | :heavy_minus_sign: |
| Max-Margin Transducer Loss: Improving Sequence-Discriminative Training Using a Large-Margin Learning Strategy | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446322-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446322) | :heavy_minus_sign: |
| Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10447240-E4A42C.svg)](https://ieeexplore.ieee.org/document/10447240) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2309.10707-b31b1b.svg)](https://arxiv.org/abs/2309.10707) | :heavy_minus_sign: |
| FusDom: Combining In-Domain and Out-of-Domain Knowledge for Continuous Self-Supervised Learning | [![GitHub](https://img.shields.io/github/stars/cs20s030/fusdom?style=flat)](https://github.com/cs20s030/fusdom) | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10448147-E4A42C.svg)](https://ieeexplore.ieee.org/document/10448147) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2312.13026-b31b1b.svg)](https://arxiv.org/abs/2312.13026) | :heavy_minus_sign: |
| AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition | :heavy_minus_sign: | [![IEEE Xplore](https://img.shields.io/badge/IEEE-10446721-E4A42C.svg)](https://ieeexplore.ieee.org/document/10446721) <br/> [![arXiv](https://img.shields.io/badge/arXiv-2403.11578-b31b1b.svg)](https://arxiv.org/abs/2403.11578) | :heavy_minus_sign: |

0 comments on commit 75e35a0

Please sign in to comment.