#

speech-commands

Here are 21 public repositories matching this topic...

YuanGongND / ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

audio deep-learning pytorch representation-learning audio-classification keyword-spotting speech-commands speech-classification

Updated May 21, 2023
Jupyter Notebook

Audio-WestlakeU / audiossl

A library built for easier audio self-supervised training, downstream tasks evaluation

pytorch audio-classification audioset nsynth speech-commands audio-datasets self-supervised-learning voxceleb1 urbansound8k pytorch-lightning audio-representation audio-self-supervised-learning audio-pretraining

Updated Aug 27, 2024
Python

dobby-seo / Wav2Keyword

Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.

transfer-learning keyword-spotting fine-tuning state-of-the-art kws speech-commands

Updated Jan 11, 2023
Python

nyumaya / nyumaya_audio_recognition

Classify audio with neural nets on embedded systems like the Raspberry Pi

raspberry-pi machine-learning embedded-systems hotword-detection keyword-spotting audio-recognition wake-word-detection speech-commands hotword

Updated Apr 10, 2024
Python

philsyn / DiffWave-unconditional

Pytorch Reimplementation of DiffWave unconditional generation: a high quality waveform synthesizer.

waveform speech pytorch speech-synthesis waveform-generation speech-commands waveform-generator diffwave

Updated Apr 13, 2021
Python

ace19-dev / tensorflow-speech-recognition-challenge

Kaggle Competitions: TensorFlow Speech Recognition Challenge

audio tensorflow kaggle-competition speech-recognition speech-commands

Updated Mar 4, 2018
Python

htqin / BiFSMN

Pytorch implementation of BiFSMN, IJCAI 2022

keyword-spotting binary-neural-networks speech-commands

Updated Feb 10, 2023
Python

isadrtdinov / kws-attention

Attention-based model for keywords spotting

deep-learning pytorch attention-mechanism keyword-spotting speech-commands

Updated Aug 9, 2021
Python

shitian-ni / speech-recognition-transfer-learning

Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow

tensorflow keras kaggle speech-recognition densenet transfer-learning dilatednet speech-commands

Updated Jan 19, 2018
Python

usc-sail / gen-dmcca

Generalized Deep Multiset Canonical Correlation Analysis for Multiview Learning of Speech Representations

multiview-learning speech-commands speech-command-recognition deep-multiset-cca speech-embeddings

Updated Apr 9, 2019
Python

danieleninni / small-footprint-keyword-spotting

Effective processing pipeline and advanced neural network architectures for small-footprint keyword spotting

data-science machine-learning deep-learning cnn speech-recognition rnn resnet attention-mechanism audio-classification keyword-spotting conformer speech-commands

Updated Mar 2, 2023
Python

manojsvgit / Voice_Based_Email_For_Blind

A Python-based application designed specifically for visually impaired users, enabling them to seamlessly send and receive emails using intuitive speech commands. This innovative solution enhances accessibility and independence by allowing users to manage their email communication effortlessly, utilizing voice recognition technology to ensure a us.

machine-learning natural-language-processing accessibility voice-recognition speech-to-text user-experience assistive-technology email-client command-line-interface python-development email-automation speech-commands voice-user-interface python-libraries project-for-visually-impaired

Updated Oct 9, 2024
Python

tuanio / audio-classification

Audio Classification with AlexNet and Speech Commands dataset

pytorch speech-recognition alexnet audio-classification speech-commands pytorch-lightning

Updated May 5, 2022
Python

mryndzionek / kws_cli

Small footprint, standalone, zero dependency, offline keyword spotting (KWS) CLI tool.

cli lightweight machine-learning voice-commands pytorch speech-recognition machinelearning hotword-detection keyword-spotting c-language wake-word-detection onnx kws speech-commands hotword-detector word-spotting tinyml wake-word edgeml

Updated Aug 4, 2024
C

epfluegel / TalkMaths

A Vocola 2 (DNS) extension for creating and editing mathematics (in LaTeX) by voice, using a ZOO interface (Zoomable Online Outliner) such as WorkFlowy or Dynalist.

latex voice-commands speech-recognition workflowy dynalist speech-commands spoken-digits vocola spoken-maths

Updated Dec 26, 2020

aminul-huq / Speech_Command_Recognition

Multi-class classification of speech command data. Dataset collected from kaggle speech recognition challenge and used pyTorch for implementation.

speech speech-recognition kaggle-dataset multiclass-classification speech-commands pytorch-implementation speech-command-recognition

Updated Jun 21, 2020
Python

reddiedev / 197z-kws

zero-shot keyword spotting with KWS test dataset using ImageBind

zero-shot kws pytorch-audio speech-commands imagebind

Updated Jun 12, 2023
Jupyter Notebook

Akash100997 / Keyword_Spotting

This project is about spotting a keyword from the Google Speech Commands Dataset.

tensorflow keras mfcc speech-commands

Updated Jun 28, 2021
Python

hoang1007 / FRIDAY

Female Replacement Intelligent Digital Assistant Youth

android assistant speech-recognition speech-commands

Updated Jun 27, 2023

Bill2015 / Speech-Chinese-Model-Agent

A Model-based Agent, for chinese speech recognize.

agent rule-based model-based speech-commands chines speech-command-recognition

Updated Jun 26, 2021
Python

Improve this page

Add a description, image, and links to the speech-commands topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-commands topic, visit your repo's landing page and select "manage topics."