Stars
15
Forks
5
Language
Python
Last Updated
Apr 03, 2023
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
Python | 2 | Implementation of the paper "Problem-agnostic speech embeddings for multi-speaker text-to-speech with SampleRNN". | Feb 09, 2022 | |
Python | 160 | PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation | Apr 23, 2023 | |
Python | 40 | Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker … | May 30, 2023 | |
Python | 82 | PyTorch re-implementation of Speech-Transformer | Sep 24, 2022 | |
Python | 65 | Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist: | Nov 30, 2022 | |
Python | 51 | (pytorch) multi speaker TTS, | Jun 06, 2022 | |
Python | 16 | SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems | May 01, 2023 | |
Python | 8 | PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition | May 29, 2022 | |
Python | 53 | Transformer implementation speciaized in speech recognition tasks using Pytorch. | Apr 20, 2023 | |
Python | 28 | Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking | Apr 17, 2023 | |
Python | 3 | Text to Speech (TTS) with synthetical speaker embeddings | Aug 21, 2021 | |
Python | 7 | An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper | Mar 28, 2023 | |
None | 99 | SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model | May 12, 2023 | |
Python | 135 | PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised … | Apr 20, 2023 | |
Python | 51 | A multi-speaker, multilingual speech generation tool | Apr 25, 2023 | |
Python | 14 | Official pytorch implementation of paper "Remote Sensing Image Captioning Based on Multi-Layer Aggregated Transformer" | May 11, 2023 | |
Python | 56 | Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment". | May 04, 2023 | |
Python | 720 | A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese. | May 17, 2023 | |
Python | 663 | PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020) | May 10, 2023 | |
Python | 2 | PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020) | Apr 01, 2024 | |
Jupyter Notebook | 2 | Speech Transformer Network Implementation for Automatic Speech Recognition | Oct 29, 2023 | |
Python | 2 | 🤖Transformer TTS: Implementation of a non-autoregressive Transformer-based text-to-speech. | May 24, 2023 | |
None | 4 | speaker recognition / speaker verification models in pytorch implementation | May 09, 2022 | |
Python | 81 | Implementation of Swin Transformer with Pytorch | Apr 10, 2023 | |
JavaScript | 6 | Build custom Speech to Text model with speaker diarization capabilities | Jul 29, 2022 | |
Python | 47 | Implementation of Multi speaker TTS | Dec 23, 2022 | |
Python | 21 | Pytorch implementation of NeurIPS'22 paper "Hierarchical Graph Transformer with Adaptive Node Sampling" | Apr 03, 2023 | |
Python | 99 | PyTorch reimplementation of the paper "MaxViT: Multi-Axis Vision Transformer" [arXiv 2022]. | Oct 13, 2022 | |
Jupyter Notebook | 2 | Pytorch implementation of Transformer | Feb 17, 2023 | |
Python | 19 | Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch | Jan 29, 2023 | |
Python | 231 | implementation of music transformer with pytorch (ICLR2019) | Apr 26, 2023 | |
Python | 23 | A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning | Apr 20, 2023 | |
Python | 3 | PyTorch implementation of TacoSpawn, Speaker Generation | May 07, 2022 | |
Python | 44 | Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer" | Sep 05, 2022 | |
Python | 919 | 🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech. | Aug 14, 2022 | |
None | 4 | 🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech. | Apr 23, 2023 | |
Python | 104 | Implementation of Deformable Attention in Pytorch from the paper "Vision Transformer with Deformable Attention" | Aug 11, 2022 | |
Jupyter Notebook | 3 | Pytorch implementation for CRNN paper for text recognition from images | Aug 02, 2022 | |
Python | 36 | The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis” | Oct 12, 2022 | |
None | 31 | Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch | Jul 15, 2022 | |
Python | 10 | Transformer for text summarization implemented in pytorch | Jun 18, 2021 | |
Python | 7 | PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper | Mar 27, 2023 | |
None | 2 | A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper. | Jan 24, 2021 | |
Python | 89 | Official PyTorch implementation of our AAAI22 paper: TransMEF: A Transformer-Based Multi-Exposure Image Fusion Framework via … | Jun 02, 2022 | |
Jupyter Notebook | 4 | PyTorch implementation of Levenshtein Transformer | Aug 09, 2022 | |
Jupyter Notebook | 2 | Minimalistic PyTorch implementation of transformer | Jan 21, 2023 | |
Jupyter Notebook | 172 | Text to Speech with PyTorch (English and Mongolian) | Apr 29, 2023 | |
Python | 4 | PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper | Oct 20, 2022 | |
Python | 3 | An implementation using PyTorch for the paper "Trusted Multi-View Classification" | Sep 27, 2022 | |
Python | 2 | Audio process homework, including speaker recognition & speech to text. | Jan 07, 2020 |