Skip to content

AudioIntel - Audio/Video Intelligence, Transcripts, Summary, and much more

Notifications You must be signed in to change notification settings

mtwn105/audio-intel

Repository files navigation

πŸŽ™οΈ AudioIntel

Transform audio into actionable intelligence with our powerful AI platform. AudioIntel helps you extract valuable insights from audio content through transcription, analysis, and AI-powered features.

✨ Features

  • 🎡 Multiple Input Methods

    • Upload audio files (MP3, WAV)
    • Record directly in browser
    • Analyze YouTube videos
  • πŸ€– AI-Powered Analysis

    • Smart summaries and key takeaways
    • Sentiment analysis
    • Speaker identification
    • Actionable insights generation
  • πŸ“ Content Generation

    • Automatic blog post creation
    • Interactive chat with transcripts
    • Key sections identification
  • πŸ” Advanced Features

    • Timeline view with precise timestamps
    • Multi-speaker detection
    • Searchable transcripts
    • Real-time sentiment tracking

πŸš€ Getting Started

Prerequisites

  • Node.js 18+
  • npm or yarn
  • AssemblyAI API key

Installation

  1. Clone the repository
git clone https://github.com/yourusername/audio-intel.git
cd audio-intel
  1. Install dependencies
npm install
# or
yarn install
  1. Set up environment variables
cp .env.example .env

Required environment variables:

ASSEMBLYAI_API_KEY=your_api_key
NEXT_PUBLIC_APP_URL=http://localhost:3000
UPLOADTHING_TOKEN=your_uploadthing_token
GOOGLE_GENERATIVE_AI_API_KEY=your_google_generative_ai_api_key
GOOGLE_TRANSLATE_API_KEY=your_google_translate_api_key
BETTER_AUTH_SECRET=your_better_auth_secret
BETTER_AUTH_BASE_URL=http://localhost:3000
DATABASE_URL=your_database_url
  1. Run the development server
npm run dev
# or
yarn dev

Open http://localhost:3000 to see the application.

πŸ› οΈ Tech Stack

πŸ“ Project Structure

src/
β”œβ”€β”€ app/                    # Next.js app router pages
β”‚   β”œβ”€β”€ actions/           # Server actions
β”‚   β”œβ”€β”€ api/               # API routes
β”‚   └── intels/           # Intel pages
β”œβ”€β”€ components/            # React components
β”œβ”€β”€ lib/                   # Utility functions
β”œβ”€β”€ hooks/                # Custom React hooks
β”œβ”€β”€ public/               # Public assets
└── types/                # TypeScript types

πŸ”‘ Key Features Implementation

Audio Processing

  • Uses AssemblyAI for advanced audio transcription and analysis
  • Supports multiple audio formats and YouTube URLs
  • Real-time processing status updates

Transcript Analysis

  • Speaker diarization
  • Sentiment analysis
  • Key points extraction
  • Blog post generation

Interactive Features

  • Real-time chat with transcript context
  • Searchable transcript interface
  • Timeline-based navigation

🀝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the project
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

πŸ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

πŸ‘¨β€πŸ’» Author

Made with ❀️ by Amit Wani

πŸ™ Acknowledgments

  • AssemblyAI for their powerful audio intelligence API
  • shadcn/ui for the beautiful UI components
  • The open-source community for their amazing tools and libraries

πŸ“± Screenshots

Multiple Sources - Audio file, Record file & YouTube

audioFile recordAudio youtube

Overview

overview

Summary

summary

Transcript

transcript

Chat

chat

Blog

blog

History

history

πŸ”— Links