Welcome to MAVEN! 🚀 Your go-to AI-powered video creation suite that transforms text into engaging video content. Using cutting-edge AI technologies, MAVEN crafts professional videos by generating coherent narratives, matching visuals, and natural speech synthesis.
- 🌐 Real-time web search for up-to-date information
- 🤖 Multi-agent system powered by OpenAI Swarm
- 📊 Intelligent filtering and organization of search results
- 📝 Enhanced content generation with factual, current information
- 🔄 Toggle to enable/disable web search functionality
Generate videos in multiple languages:
- 🇬🇧 English
- 🇮🇹 Italian
- 🇪🇸 Spanish
- 🇫🇷 French
- 🇩🇪 German
- 📝 Essay generation using GPT-4 or GPT-3.5
- 🎨 Image generation using DALL-E 2 or DALL-E 3
- 🗣️ Text-to-speech conversion in multiple languages
- 🎥 Video compilation with synchronized audio and images
- ⏱️ Adjustable video length (30s, 1min, 4min)
- 🖼️ Configurable number of images (1-10)
- 🔄 Choice of language models and image generation models
- 🌓 Light/Dark theme support
- 📱 Responsive Material-UI design
- 🔔 Toast notifications for status updates
- 📊 Dynamic progress tracking
- 🔄 Detailed status updates for each generation step
- ⏳ Progress bar showing completion percentage
- 🎯 Current step indication
- Clone the repository
- Install backend dependencies:
cd backend
pip install -r requirements.txt
- Install frontend dependencies:
cd frontend
npm install
- Set up your environment variables:
- Copy
.env.example
to create a new.env
file:
cp .env.example .env
- Replace
your-openai-api-key
in the.env
file with your actual OpenAI API key - You can get an API key from OpenAI's website
- Copy
- Start the backend server:
cd backend
python api.py
- Start the frontend development server:
cd frontend
npm start
- Access MAVEN:
- Local development:
http://localhost:3030
- Network access:
http://your-ip:3030
- Local development:
-
Video Generation Protection
- Required security key for video generation
-
OpenAI API Key Management
- Optional API key input in UI
- Fallback to environment variable if not provided
- Secure handling of API keys
-
Web Research 🌐 (New in v0.2.0)
- Multi-agent system searches the web for up-to-date information
- Intelligent filtering and organization of search results
- Enhanced content generation with factual, current information
-
Essay Generation 📝
- AI generates a coherent essay based on your topic and web research
- Length is adjusted based on your selected video duration
-
Image Generation 🎨
- Essay is divided into meaningful segments
- Each segment is converted into a detailed image prompt
- DALL-E generates stunning visuals based on these prompts
-
Audio Generation 🔊
- Essay is converted to natural-sounding speech
- Multiple language support with native pronunciation
-
Video Compilation 🎬
- Images are sequenced with perfect timing
- Audio is synchronized with the visuals
- Final video is rendered in high-quality MP4 format
POST /generate
: Start video generationGET /status/{task_id}
: Check generation statusGET /video/{task_id}
: Download generated video
OPENAI_API_KEY
: Your OpenAI API keyOUTPUT_DIR
: Directory for generated files (default: "output")
- Fork the repository
- Create a feature branch
- Commit your changes
- Push to the branch
- Create a Pull Request
- YouTube Integration 🎥:
- Automatic video upload to YouTube after generation
- OAuth2 authentication for YouTube API
- Customizable video metadata (title, description, tags)
- Privacy settings configuration (public, unlisted, private)
- Playlist management
- Upload status tracking and notifications
This project is licensed under the MIT License - see the LICENSE file for details.