Pakistan's First

MBZ Voice SDK

Speak. Think. Respond. Seamlessly.

100% FREE

A powerful developer tool that enables you to integrate voice input, AI understanding (via Gemini), and spoken responses into any modern web app.

Easy Installation

Get Started in Seconds

Simple commands, powerful results. Clone the repository and you're ready to go.

Terminal — bash — 80×24

user@mbz-sdk:~$ ▋

Frontend Setup

Integrate the SDK into your web application with just a few lines of code.

frontend.js

import { MBZVoiceAgent } from "./mbz-voice-sdk.js"

const agent = new MBZVoiceAgent({
  apiUrl: "http://localhost:8000/ask",
  lang: "en-US",
  speak: true
})

// Start listening
document.getElementById("listen-btn").onclick = () => {
  agent.listen()
}

Backend Setup

Set up the backend with Python and FastAPI in minutes.

main.py

from fastapi import FastAPI
import google.generativeai as genai
import os

app = FastAPI()
genai.configure(api_key=os.getenv("GEMINI_API_KEY"))
model = genai.GenerativeModel("gemini-1.5-flash")

@app.post("/ask")
async def ask(request: Request):
    data = await request.json()
    query = data.get("query")
    response = model.generate_content(query)
    return {"answer": response.text}

Free Forever

Core Features

Accuracy Rate

Founded in Pakistan

Core Features

Powerful Capabilities

Explore the powerful features that make MBZ Voice SDK the perfect choice for your voice-enabled applications

Voice Recognition

AI Understanding

Voice Response

Privacy Focused

Community Driven

Lightning Fast

All-in-One Voice Solution

Advanced voice recognition with high accuracy
Gemini AI integration for intelligent responses
Natural text-to-speech capabilities
Privacy-focused design with local processing
Lightning-fast response times
Community-driven development

Interactive Experience

Voice Recognition Visualized

Experience how voice data flows through our SDK with this interactive 3D visualization

Powerful Features

Everything You Need

Voice Recognition

Easily capture user speech with browser-native voice recognition capabilities. Works across all modern browsers and devices.

AI Understanding

Process speech with Google's Gemini AI for intelligent responses to user queries. Contextual understanding and natural language processing.

Voice Response

Convert AI responses to natural-sounding speech with text-to-speech capabilities. Natural-sounding voice synthesis for better user experience.

Privacy Focused

All processing happens on your servers. No data is stored or shared with third parties without your explicit permission.

Community Driven

Open source with an active community of developers. Contribute, suggest features, and help shape the future of the SDK.

Interactive Demo

Try It Yourself

See how the MBZ Voice SDK can power conversational interfaces in your applications

Ready

Click the microphone button to start speaking

Why Choose Us

Compare and See the Difference

See how MBZ Voice SDK stacks up against other voice recognition solutions

Features	MBZ Voice SDK Pakistan's First	Other Voice SDKs Commercial	Browser APIs Basic
Voice Recognition	+
AI Integration	+
Text-to-Speech
Customizable
Easy Integration	+
Local Processing
Price	Free	$$$	Free
Open Source

Installation Guide

Get Started

Setting Up the Backend

1Clone the Repository

git clone https://github.com/ProMBZ/mbz-voice-sdk.git

2Install Dependencies

cd mbz-voice-sdk/backend pip install -r requirements.txt

3Set Up Environment Variables

# Create a .env file in the backend directory GEMINI_API_KEY=your_api_key_here

4Start the Server

uvicorn main:app --reload

Developer Resources

API Documentation

Constructor

const agent = new MBZVoiceAgent({
  apiUrl: "/ask",      // Backend API endpoint
  speak: true          // Enable text-to-speech
})

apiUrl

The endpoint URL for your backend API that processes voice input

Default: "/ask"

speak

Whether to enable text-to-speech for AI responses

Default: true

Coming Soon

Package Distribution

NPM Package

Soon you'll be able to install MBZ Voice SDK directly from npm with a simple command.

npm install mbz-voice-sdk

PIP Package

The backend will soon be available as a Python package for easy installation.

pip install mbz-voice-sdk

Free Forever

100% Free & Open Source

MBZ Voice SDK is completely free to use in your projects. No hidden fees, no usage limits, no credit card required.

Free to Use

Use in commercial and personal projects with no restrictions

Open Source

View, modify, and contribute to the source code on GitHub

No API Keys

Only need a Gemini API key for the AI functionality