NextJS D-ID Starter Kit

This project demonstrates the integration of D-ID's and OpenAI's Whisper within a NextJS application. It allows users to interact with AI-powered avatars using text input or voice commands, showcasing the potential of conversational AI interfaces.

Features

Interactive avatar selection
Text-to-speech functionality
Voice input using OpenAI's Whisper for transcription
Integration with OpenAI's GPT model for conversational responses

Getting Started

Prerequisites

Node.js (v14 or later)
npm (v6 or later)
A D-ID API key
An OpenAI API key

Setting up the project

Clone this repository:

git clone https://github.com/WillKre/nextjs-whisper-d-id.git
cd nextjs-whisper-d-id

Install dependencies:
```
npm install
```
Set up environment variables: Run npm run setup-env or create a .env file in the root directory and add the following:
```
D_ID_API_KEY=your_d_id_api_key
OPENAI_API_KEY=your_openai_api_key
NEXT_PUBLIC_OPENAI_API_KEY=your_openai_api_key
```
Then populate the values:
- To obtain a D-ID API key, sign up at D-ID's website and navigate to the API section in your account settings.
- For an OpenAI API key, create an account at OpenAI's website and generate an API key in your account dashboard.
Start the development server:
```
npm run dev
```
Open http://localhost:3000 in your browser to see the application.

Usage

Select an avatar from the dropdown menu.
Choose a voice for the avatar.
Type text in the "Repeat" input to make the avatar speak that text.
Use the "Chat" input to have a conversation with the AI-powered avatar:
- Type your message and press enter, or
- Click the microphone icon to use voice input (requires browser permission).

Voice Selection

The project is set up to use ElevenLabs, a leading voice service, for generating realistic voices. The available voices are fetched dynamically from the ElevenLabs API.

If you want to use voices from other providers or add more options:

Update the API endpoint in app/api/voices/route.ts:
```
const response = await fetch("https://api.elevenlabs.io/v1/voices", {
  // ... existing headers ...
});
```
Replace the URL with the endpoint of your chosen voice provider.

Ensure that the response is mapped to match the expected format:

const transformedVoices = data.voices.map((voice) => ({
  voice_id: voice.voice_id,
  name: voice.name,
}));

Project Structure

app/: Contains the main application code and API routes.
components/: React components used throughout the application.
utils/: Utility functions and helper modules.
types/: Common TypeScript types which are used across the project.
styles/: Global styles and Tailwind CSS configuration.

Technology Stack

Next.js: React framework for building the application
D-ID API: For generating interactive avatars
OpenAI API: For GPT-based conversation and Whisper transcription
NextUI: UI component library
Tailwind CSS: For styling

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
app		app
components		components
public		public
styles		styles
types		types
utils		utils
.env-example		.env-example
.gitignore		.gitignore
README.md		README.md
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NextJS D-ID Starter Kit

Features

Getting Started

Prerequisites

Setting up the project

Usage

Voice Selection

Project Structure

Technology Stack

Contributing

About

Releases

Packages

Languages

WillKre/nextjs-whisper-d-id

Folders and files

Latest commit

History

Repository files navigation

NextJS D-ID Starter Kit

Features

Getting Started

Prerequisites

Setting up the project

Usage

Voice Selection

Project Structure

Technology Stack

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages