OpenAI Whisper Transcription and Summary Tool

Overview

This project is an AI-powered transcription and summarization tool built using Streamlit for the front end and OpenAI Whisper for audio transcription. The application allows users to upload or record audio files, which are then transcribed into text, summarized, and analyzed for sentiment. It also integrates various AI tools and APIs to provide fact-checking, math operations, and research capabilities from multiple sources such as Wikipedia, DuckDuckGo, YouTube, and PubMed.

Features

Audio Upload and Recording:
- Users can either upload audio files or directly record audio through the interface.
- Supported audio formats: mp3, mp4, mpeg, mpga, m4a, wav, webm.
OpenAI Whisper for Transcription:
- Audio is transcribed using OpenAI Whisper, with automatic corrections for grammar and spelling.
AI-Powered Summary:
- Transcriptions are automatically summarized using the OpenAI GPT-4 model with a map-reduce summarization method.
Sentiment Analysis:
- The transcription is analyzed to determine the sentiment, generating a report based on the content of the audio.
Fact Checking:
- Integrated tools allow fact-checking of the transcription using various research databases such as Wikipedia, DuckDuckGo, and PubMed.
Text Statistics:
- Displays word count, character count, and word frequency of the transcription.
QA Search:
- Searches previous transcripts and summarizations stored in Pinecone vector database to find related content and insights.
User Authentication:
- Built-in user authentication with the ability to create and manage accounts. User data is stored securely in a database.
Conversation Buffer Memory:
- Chat interface allows users to interact with AI models in a conversational format, maintaining context through conversation buffer memory.

Architecture

Frontend: Built with Streamlit for an interactive and user-friendly interface.
Backend: Uses OpenAI Whisper for transcription and OpenAI GPT-4 for summarization and sentiment analysis.
Database: Transcriptions, summaries, and other metadata are stored in a database using custom functions for user management and transcript handling.
Vector Store: Pinecone is used for storing and retrieving transcriptions and summaries based on their embeddings for efficient QA search.
APIs and Tools:
- OpenAI GPT-4 for text generation, correction, and summarization.
- DuckDuckGo, Wikipedia, and PubMed for research and fact-checking.
- Pydub for audio processing.
- LangChain for orchestrating AI models and connecting tools.

Installation and Setup

Clone the repository:

git clone https://github.com/doniaskima/OpenAI-Transcription-Tool

Install dependencies using pip:
```
pip install -r requirements.txt
```
Set up environment variables:
- Create a .env file with your OpenAI API key:
```
OPENAI_API_KEY=your-openai-api-key
```
Run the Streamlit application:
```
streamlit run app.py
```

Usage

Login or Create Account: Authenticate through the user authentication tab.
Upload or Record Audio: Choose between uploading an audio file or recording one directly through the interface.
Generate Transcription: Click "Generate Transcript and Summary" to transcribe the audio and generate a summary, sentiment analysis, and other insights.
Interact with the Transcript: Review the transcription, summary, and fact-check results. Chat with AI based on the transcript data.
View Previous Transcriptions: Use the "Previous Transcriptions" tab to review and interact with earlier sessions.

Technologies Used

Streamlit: For creating the front-end interface.
OpenAI Whisper: For audio transcription.
OpenAI GPT-4: For text summarization, sentiment analysis, and fact-checking.
LangChain: To orchestrate multiple AI models and tools.
Pinecone: Vector database for storing and querying transcripts.
Pandas: For data manipulation and word frequency analysis.
Pydub: For handling and processing audio files.

Future Improvements

Integration of more AI tools for advanced fact-checking.
Support for additional languages in transcription.
Enhanced user interface and experience for better usability.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
uploads		uploads
venv		venv
.env.example		.env.example
MASTER.db		MASTER.db
db_functions.py		db_functions.py
htmlTemplates.py		htmlTemplates.py
main.py		main.py
prompts.py		prompts.py
readme.md		readme.md
requirements.in		requirements.in
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenAI Whisper Transcription and Summary Tool

Overview

Features

Architecture

Installation and Setup

Usage

Technologies Used

Future Improvements

About

Releases

Packages

Languages

doniaskima/OpenAI-Transcription

Folders and files

Latest commit

History

Repository files navigation

OpenAI Whisper Transcription and Summary Tool

Overview

Features

Architecture

Installation and Setup

Usage

Technologies Used

Future Improvements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages