Skip to content

Installation

This guide covers installing Speakora on various platforms.

  • Python 3.10+ (required for SeamlessM4T)
  • Git for cloning the repository
  • FFmpeg for video/audio processing
  • Node.js 18+ (optional, for frontend development)

For best performance, a GPU is recommended:

PlatformRequirements
NVIDIACUDA 11.8+ with 8GB+ VRAM
Apple SiliconmacOS 12.0+ with Metal/MPS
AMDROCm 5.0+
Terminal window
# Clone the repository
git clone https://github.com/rennerdo30/speakora.git
cd speakora
# Run the setup script
./setup.sh
# Start the application
./start.sh

The setup script will:

  1. Create a Python virtual environment
  2. Install all Python dependencies
  3. Install frontend dependencies (if Node.js is available)
  4. Build the frontend (if Node.js is available)
  5. Create necessary directories
Terminal window
python3.10 -m venv venv
source venv/bin/activate # Linux/macOS
# or
venv\Scripts\activate # Windows
Terminal window
pip install --upgrade pip setuptools wheel
pip install -r requirements.txt

3. Install Development Dependencies (Optional)

Section titled “3. Install Development Dependencies (Optional)”
Terminal window
pip install -r requirements-dev.txt
Terminal window
cd frontend
npm install
npm run build
cd ..
Terminal window
./run.sh download --model-size medium

Available model sizes:

SizeModelVRAMQuality
mediumSeamlessM4T v1~8GBGood
largeSeamlessM4T v2~16GBBest

Install FFmpeg via Homebrew:

Terminal window
brew install ffmpeg
Terminal window
sudo apt update
sudo apt install ffmpeg python3.10 python3.10-venv
  1. Install Python 3.10+ from python.org
  2. Install FFmpeg from ffmpeg.org
  3. Add FFmpeg to your PATH

For containerized deployment, see the Docker guide.

Check device detection:

Terminal window
./run.sh info

Force CPU mode:

Terminal window
export SEAMLESS_DEVICE=cpu
./start.sh

Use a smaller model:

Terminal window
# In config/default.yaml
model:
size: "medium"

Manually download models:

Terminal window
./run.sh download --model-size medium