Vinci Clips

AI-powered video clipping platform that automatically transforms long-form videos into engaging short clips optimized for social media platforms.

Vinci Clips is an open-source platform that leverages artificial intelligence to analyze video content, generate accurate transcriptions, and automatically identify the most engaging segments for creating viral short-form content. The platform streamlines the content creation workflow for creators, marketers, and businesses looking to maximize their video content's reach across multiple social media platforms.

Watch the Demo Loom On Youtube

Demo of one of the Features: Segregate Clips via AI

Key Features

Core Functionality

Intelligent Video Analysis: AI-powered content analysis using Google Gemini API
Automatic Transcription: Speaker diarization with precise timestamp alignment
Smart Clip Generation: AI suggests optimal clip segments based on content analysis
Multi-Format Support: Support for major video formats with automatic conversion

Content Processing

Video-to-Audio Conversion: High-quality audio extraction using FFmpeg
Thumbnail Generation: Automatic video thumbnail creation for quick preview
Status Tracking: Real-time processing status with comprehensive error handling
Batch Processing: Support for multiple video uploads with queue management

User Interface

Intuitive Dashboard: Clean, responsive interface built with Next.js and Tailwind CSS
Drag-and-Drop Upload: Simple file upload with progress tracking (up to 2GB)
Video Playback: Integrated video player with transcript synchronization
Mobile Responsive: Optimized experience across desktop and mobile devices

Architecture

┌─────────────────┐    ┌─────────────────┐    ┌─────────────────┐
│   Frontend      │    │   Backend       │    │   External      │
│   Next.js       │◄──►│   Express API   │◄──►│   Services      │
│   React/TS      │    │   Node.js       │    │                 │
└─────────────────┘    └─────────────────┘    └─────────────────┘
         │                        │                        │
         │                        ▼                        │
         │              ┌─────────────────┐                │
         │              │   Database      │                │
         │              │   LocalDB       │                │
         │              └─────────────────┘                │
         │                                                 │
         │              ┌─────────────────┐                │
         └──────────────│   File Storage  │                │
                        │   Local system  │                │
                        └─────────────────┘                │
                                                           │
                        ┌─────────────────┐                │
                        │   AI Services   │◄───────────────┘
                        │   Gemini API    │
                        └─────────────────┘

Technology Stack:

Frontend: Next.js 15, React, TypeScript, Tailwind CSS, Shadcn/ui
Backend: Node.js, Express.js, MongoDB with Mongoose
AI/ML: Google Gemini API for transcription and analysis
Media Processing: FFmpeg for video/audio conversion and manipulation
Cloud Storage: Google Cloud Storage with signed URL access
Infrastructure: Docker-ready with environment-based configuration

Getting Started

Prerequisites

Before running Vinci Clips, ensure you have the following installed:

Node.js (version 18.0.0 or higher)
FFmpeg (installed and available in your system PATH)
MongoDB (local installation or cloud instance)

Additionally, you'll and API keys for:

Google Gemini API (for AI transcription services)

Installation

Clone the repository

bash
git clone https://github.com/tryvinci/vinci-clips.git
cd vinci-clips

Install dependencies

bash
# Install dependencies for both frontend and backend
npm run install:all

Configure environment variables

Create a .env file in the backend directory:
```
bash
cd backend
cp .env.example .env
```
Edit backend/.env with your actual values:
```
env
# Server Configuration
PORT=8080

# AI Services
GEMINI_API_KEY=your-gemini-api-key
```
Note: For Docker deployment, see docker-setup.md for different environment configuration.

Start the application (install concurrently)

bash
# Start both frontend and backend
npm start

# Or start individually:
npm run start:backend  # Backend on port 8080
npm run start:frontend # Frontend on port 3000

Access the application

Open your browser and navigate to http://localhost:3000

Usage

Basic Workflow

Upload Video: Drag and drop a video file (up to 2GB) onto the upload interface
Processing: The system automatically:
- Converts video to audio format
- Uploads files to cloud storage
- Generates video thumbnails
- Creates AI-powered transcription with speaker identification
Review Transcript: View the generated transcript with timestamp alignment
Generate Clips: Use AI-suggested segments or manually select time ranges for clip creation
Download Results: Access generated clips from cloud storage with direct download links

API Usage

The platform provides a RESTful API for programmatic access:

javascript
// Upload a video
POST /api/upload
Content-Type: multipart/form-data

// Get transcript status
GET /api/transcripts/:id

// Generate clip
POST /api/clips/generate
{
  "transcriptId": "...",
  "startTime": 30,
  "endTime": 90
}

For detailed API documentation, see API Reference.

Development

Project Structure

vinci-clips/
├── backend/                 # Express.js API server
│   ├── src/
│   │   ├── models/         # MongoDB schemas
│   │   ├── routes/         # API endpoints
│   │   └── index.js        # Server entry point
|   └── storage/db.json     # All your data is stored here
│   └── uploads             # All your videos are stored here
│   └── package.json
├── frontend/               # Next.js application
│   ├── src/
│   │   ├── app/           # App router pages
│   │   ├── components/    # React components
│   │   └── lib/           # Utility functions
│   └── package.json
├── package.json           # Root package.json for scripts
└── README.md

Development Commands

bash
# Development
npm run dev              # Start both services in development mode
npm run start:backend    # Start backend only
npm run start:frontend   # Start frontend only

# Production
npm run build           # Build both applications
npm start              # Start both services in production mode

# Testing
npm test               # Run test suites
npm run lint           # Run ESLint checks

Testing

bash
# Backend tests
cd backend && npm test

# Frontend tests
cd frontend && npm test

# End-to-end tests
npm run test:e2e

Deployment

Docker Deployment

bash
# Build and run with Docker Compose
docker-compose up --build

# Production deployment
docker-compose -f docker-compose.prod.yml up -d

Environment Variables

For local deployment, ensure all environment variables are properly configured:

Variable	Description	Required
`PORT`	Backend server port	No (default: 8080)
`GEMINI_API_KEY`	Google Gemini API key	Yes

Contributing

We welcome contributions to Vinci Clips! Please see our Contributing Guidelines for details on:

Code of conduct
Development workflow
Pull request process
Issue reporting guidelines

Development Setup

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Make your changes
Add tests for new functionality
Ensure all tests pass (npm test)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Development Status

Core Platform (Completed)

Video upload with drag-and-drop interface (2GB limit)
FFmpeg-based video processing and thumbnail generation
Google Cloud Storage integration with signed URLs
AI transcription using Google Gemini API with speaker diarization
MongoDB data persistence with comprehensive status tracking
React/Next.js frontend with responsive design
Basic clip generation from transcript segments
Streamer's Webcam And Gameplay Video into a reel conversion

Caption System (Recently Added)

TikTok/Reels style caption generation with 5 popular styles
SRT-based FFmpeg subtitle rendering
Word-level timestamp conversion from segment data
Caption preview integration in reframe workflow

Planned Improvements

High Priority

Enhanced word-level timestamp precision (Issue #19)
Advanced caption styles based on social media research (Issue #20)
Real-time caption preview with video overlay (Issue #21)

Medium Priority

Intelligent reframing with subject detection (Issue #22)
Smooth camera movement for reframed videos (Issue #23)
Smart fallback mechanisms for complex scenarios (Issue #24)

Future Enhancements

Speaker-aware caption positioning (Issue #25)
LLM-enhanced clip suggestion engine (Issue #26)
Performance caching for transcripts and ML models (Issue #27)
Modular caption style plugin system (Issue #28)

See GitHub Issues for detailed technical specifications and implementation plans.

License

This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0). See the LICENSE file for details.

The AGPL-3.0 license ensures that any modifications or derivatives of this software, including those running on servers, must also be made available under the same license terms.

Support

Community Support

GitHub Issues: Report bugs or request features
Discussions: Join community discussions
Documentation: Read the full documentation

Commercial Support

For enterprise deployments, custom development, or commercial licensing options, please contact us at support@tryvinci.com.

Acknowledgments

Google Gemini API for powerful AI transcription capabilities
FFmpeg for reliable video processing
Next.js and Vercel for excellent development experience
MongoDB for flexible data storage
Open Source Community for inspiration and contributions

Built by the Vinci team. Made possible by the open source community.

Vinci clips

Vinci Clips

Watch the Demo Loom On Youtube

Demo of one of the Features: Segregate Clips via AI

Key Features

Core Functionality

Content Processing

User Interface

Architecture

Getting Started

Prerequisites

Installation

Usage

Basic Workflow

API Usage

Development

Project Structure

Development Commands

Testing

Deployment

Docker Deployment

Environment Variables

Contributing

Development Setup

Development Status

Core Platform (Completed)

Caption System (Recently Added)

Planned Improvements

License

Support

Community Support

Commercial Support

Acknowledgments

Contributors

Vinci Clips

Watch the Demo Loom On Youtube

Demo of one of the Features: Segregate Clips via AI

Key Features

Core Functionality

Content Processing

User Interface

Architecture

Getting Started

Prerequisites

Installation

Usage

Basic Workflow

API Usage

Development

Project Structure

Development Commands

Testing

Deployment

Docker Deployment

Environment Variables

Contributing

Development Setup

Development Status

Core Platform (Completed)

Caption System (Recently Added)

Planned Improvements

License

Support

Community Support

Commercial Support

Acknowledgments

Contributors

Related Repositories