QuickDigest AI facilitates seamless interaction with various data formats, real-time web search, and creative image generation for advertising
-
Updated
Jun 19, 2024 - Python
QuickDigest AI facilitates seamless interaction with various data formats, real-time web search, and creative image generation for advertising
An interactive AI voice agent that can capture and transcribe speech in real-time, generate intelligent responses using the DeepSeek R1 (7B model) AI, and convert the responses back to natural speech for immediate playback. The agent maintains conversation context and supports cross-platform usage on macOS, Linux, and Windows.
DiscordNPC lets you interact with ChatGPT through a Discord voice channel, enabling a natural conversation.
PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-podcasts (Project for the AssemblyAI Winter 2022 Hackathon).
Retrieval Augmented Generation (RAG) on audio data with LangChain
Audio transcription UI for OpenAI Whisper, GPT4o Transcribe and AssemblyAI APIs
TagGPT: A simple ChatGPT based multimodal dialog generation engine that can "see/draw" and "hear/speak"
Transcription and translation scripts for Lex Fridman podcast about DeepSeek, at 2025-02-03
Transform podcast listening with our Podcast Summarizer Project! This innovative tool transcribes audio, extracts key content, and provides user-friendly summaries. The project utilizes AssemblyAI and Listen Notes APIs for transcription and episode details. Simply input an episode ID, click "Download Episode Summary," and experience podcast content
Python-based system designed to transcribe audio files, split the transcripts into manageable chunks, create text embeddings using HuggingFace models, and employ advanced question-answering models for retrieval-based QA.
A powerful Speech-to-Text API built with Django REST Framework and AssemblyAI. Textor-AI provides enterprise-grade transcription capabilities with advanced features like multi-language support, real-time status tracking, and comprehensive transcription management.
Imagine an application that autonomously take down notes for you during meetings, lectures, and conversations. Check this out...
VirtuAI Helper is a Python AI program that executes scripts based on user input, converses using OpenAI’s GPT-3, controls multimedia, navigates websites, and accepts text/voice inputs. It integrates VoiceVox Engine, a Japanese text-to-speech software with over 60 text-to-speech models you can choose.
A basic web-app for image classification using Streamlit and TensorFlow.
Advanced speech-to-text platform leveraging AssemblyAI's powerful API for accurate audio transcription and analysis.
Murf AI's 30 Days of AI Voice Agents Challenge
A webapp project for lablab.ai hackathon
RealTimeTranscriber is an application that leverages the AssemblyAI platform to perform real-time transcription of audio input.
Python Speech-To-Text projects using AssemblyAI API
Add a description, image, and links to the assemblyai topic page so that developers can more easily learn about it.
To associate your repository with the assemblyai topic, visit your repo's landing page and select "manage topics."