GitHub - video-db/Director: Intelligent agents for your video library

Intelligent agents for your video library
View Demo »

Report Bug · Request Feature · New Agent Request

🧐 What is The Director?

The Director is an AI-powered framework that lets you interact with your video and audio collections using natural language. Forget complex tools—just tell The Director what you want, and it gets it done.

Whether you’re working with social media clips, lectures, movies, YouTube videos, or any other content, The Director enables you to:

Summarize videos in seconds.
Search for specific moments.
Create clips instantly.
Add overlays, generate thumbnails, and much more. All powered by VideoDB’s scalable "video-as-data" infrastructure.

For example, a simple command like: Upload this video and send the highlights to my Slack, sets everything in motion.

Built with flexibility in mind, The Director is perfect for developers, creators, and teams looking to harness AI to simplify media workflows and unlock new possibilities. 📺 Watch: Intro video

Demos.Compilation-Compressed.mp4

⭐️ Key Features

🤖Build Smart Video Agents

Create custom AI agents that handle tedious tasks for you:

Summarize videos in seconds.
Search and index your media library.
Organize and clip your content effortlessly.

🎨 A New Way to Interact

Experience a sleek, chat-based interface with built-in video playback and intuitive controls. It’s like having a personal assistant for your media.

🔍 Smarter Media Analysis

Connect seamlessly with powerful AI tools like LLMs, databases, and GenAI APIs, while VideoDB ensures your video infrastructure is reliable and scalable.

🧩 Customizable and Flexible

Easily add new features and tools to your workflow. Whether you want to run it locally or on your cloud, The Director adapts to your needs.

⚙️ Architecture Overview

Director's architecture brings together:

Backend Reasoning Engine: Handles workflows and decision-making. Chat-Based UI: Engage with your media library conversationally. Video Player: Advanced playback and interaction tools. Collection View: Organize and browse your media effortlessly.

🧠 Reasoning Engine

At the heart of The Director is its Reasoning Engine, a powerful core that drives intelligent decision-making and dynamic workflows. It acts as the brain behind the agents, enabling them to process commands, interact with data, and deliver meaningful outputs.

How It Works

Contextual Understanding: The engine analyzes user inputs and maintains context, ensuring smooth and coherent interactions with agents.
Dynamic Agent Orchestration: Based on the user’s needs, it identifies and activates the right agents to complete tasks efficiently.
Modular Processing: Tasks are broken into smaller steps, allowing agents to collaborate and deliver accurate results in real time.

Key Capabilities

Multi-Agent Coordination: Seamlessly integrates multiple agents to handle complex workflows, such as summarizing, editing, and searching videos.
Real-Time Updates: Provides live progress and feedback as tasks are being completed.
Extensible Design: Easily adaptable to include custom logic or connect to external APIs for more advanced capabilities.

See It in Action

The Reasoning Engine works in tandem with the chat-based UI, making video interaction intuitive and efficient. For example:

Input: "Create a clip of the funniest scene in this video and share it on Slack."
Output: The engine orchestrates upload, scene detection, clipping, and sharing agents to deliver results seamlessly.

For a closer look, check out the detailed architecture diagram below:

Explore how the Reasoning Engine powers The Director to simplify and supercharge your media workflows.

🏃 Getting Started

📺 Watch: Setup video

Prerequisites

Python 3.9 or higher
Node.js 22.8.0 or higher
npm

Installation

1. Clone the repository:

git clone https://github.com/video-db/Director.git
cd Director

2. Run the setup script:

./setup.sh

This script will:

Install Node.js 22.8.0 using nvm

Install Python and pip

Set up virtual environments for both frontend and backend.

3. Configure the environment variables:

Edit the .env files to add your API keys and other configuration options.

Supported platforms:

Mac
Linux
Windows (WSL)

💬 Running the Application

To start both the backend and frontend servers:

make run

Backend: http://127.0.0.1:8000
Frontend: http://127.0.0.1:8080

For specific tasks:

Backend only: make run-be
Frontend only: make run-fe

📘 Creating a New Agent

To create a new agent in Director, follow these steps:

Copy the template: Duplicate sample_agent.py in Director/backend/director/agents/ and rename it.
Update class details:
- Rename the class.
- Update agent_name and description
Implement logic:
- Update parameters and docstring
- Implement your agent's logic
- Update the run() method.
Handle output and status updates:
- Use appropriate content types (TextContent, VideoContent, ImageContent, SearchResultContent)
- Update self.output_message.actions for progress indicators
- Use push_update() to emit progress events
- Set content status (progress, success, error) and messages
Implement error handling:
- Set error status and messages if issues occur
Finalize the response:
- Call self.output_message.publish() to emit final state and persist session
- Return an AgentResponse with result, message, and data
Register the agent:
- Import your new agent class in Director/backend/director/handler.py
- Add it to the self.agents list in ChatHandler

Remember to consider creating reusable tools if your agent's functionality could be shared across multiple agents.

📖 Documentation

Serve Locally

To serve the documentation on port 9000:

source backend/venv/bin/activate  
make install-be
mkdocs serve -a localhost:9000

To build the documentation:

mkdocs build

🤝 Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 236 Commits
.github		.github
backend		backend
docs		docs
frontend		frontend
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
mkdocs.yml		mkdocs.yml
render.yaml		render.yaml
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧐 What is The Director?

⭐️ Key Features

🤖Build Smart Video Agents

🎨 A New Way to Interact

🔍 Smarter Media Analysis

🧩 Customizable and Flexible

⚙️ Architecture Overview

🧠 Reasoning Engine

How It Works

Key Capabilities

See It in Action

🏃 Getting Started

Prerequisites

Installation

Supported platforms:

💬 Running the Application

📘 Creating a New Agent

📖 Documentation

Serve Locally

🤝 Contributing

About

Releases

Packages

Contributors 4

Languages

License

video-db/Director

Folders and files

Latest commit

History

Repository files navigation

🧐 What is The Director?

⭐️ Key Features

🤖Build Smart Video Agents

🎨 A New Way to Interact

🔍 Smarter Media Analysis

🧩 Customizable and Flexible

⚙️ Architecture Overview

🧠 Reasoning Engine

How It Works

Key Capabilities

See It in Action

🏃 Getting Started

Prerequisites

Installation

Supported platforms:

💬 Running the Application

📘 Creating a New Agent

📖 Documentation

Serve Locally

🤝 Contributing

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages