This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
-
Updated
Dec 20, 2023 - Jupyter Notebook
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
WhisPad is a note management tool where you can write or dictate your notes using local or API AI models (supports speaker diarization). Rewrite your texts using different styles, dive in using AI, translate, summarize, create mind maps, node graphs and even quizs and flashcards based on each note. A powerful companion for researchers and students.
Exploration of different audio features and CNN-based architectures for building an effective Speech Emotion Recognition (SER) system. The goal is to improve the accuracy of detecting emotions embedded in speech signals. The repository contains code, notebook, and detailed explanations of the experiments conducted.
Whisper AI is an automated speech recognition (ASR) system. It is open source and can be access via GitHub or HuggingFace. This is the simplest way to implement Whisper AI via Github using python Google Colab Notebook.
Add a description, image, and links to the speech-processing topic page so that developers can more easily learn about it.
To associate your repository with the speech-processing topic, visit your repo's landing page and select "manage topics."