🤖 AI Assistant

A modern AI assistant featuring voice control, voice interaction, camera analysis, code generation, and device management. Built with Python and PyQt6, providing a seamless and intuitive interface for AI-powered tasks.

🎥 Live Demo

AI Assistant in Action

Experience the seamless interaction with voice commands, real-time responses, and intelligent assistance.

✨ Key Features Demonstrated:

🎤 Voice Interaction: Natural conversation with wake word activation
🤖 AI Processing: Real-time responses powered by Gemini AI
💻 Smart Interface: Clean, modern UI with intuitive controls
⚡ Quick Actions: Efficient command processing and execution

🌟 Quick Preview

Voice Commands

Wake word activation and voice response

AI Processing

Real-time AI-powered interactions

🌟 Features Overview

graph TD
    A["🤖 AI Assistant"] --> B["🎤 Voice"]
    A --> C["📸 Camera"]
    A --> D["💻 Code"]
    A --> E["⚙️ Config"]
    A --> F["📱 Devices"]
    A --> G["🔗 Apps"]
    
    B --> B1(["Recognition"])
    B --> B2(["TTS"])
    B --> B3(["STT"])
    
    C --> C1(["Analysis"])
    C --> C2(["Upload"])
    C --> C3(["discription"])

    
    D --> D1(["Complete"])
    D --> D2(["Highlight"])
    
    E --> E1(["Settings"])
    E --> E2(["API Keys"])
    
    F --> F1(["Bluetooth"])
    F --> F2(["Serial"])
    F --> F3(["Wifi"])
    
    G --> G1(["Launch"])
    G --> G2(["Commands"])

    %% Color definitions
    classDef root fill:#2c3e50,stroke:#2c3e50,color:#fff
    classDef main fill:#3498db,stroke:#2980b9,color:#fff
    classDef sub fill:#e8f4f8,stroke:#3498db,color:#2c3e50

    %% Apply colors
    class A root
    class B,C,D,E,F,G main
    class B1,B2,B3,C1,C2,C3,D1,D2,E1,E2,F1,F2,F3,G1,G2 sub

Key Features

Category	Features
🎤 Voice	Wake word detection, Speech recognition, Text-to-speech, Noise reduction
📸 Camera	Real-time analysis, Image upload, Custom prompts, Gemini Vision
💻 Code	Smart completion, Syntax highlighting, Editor integration
⚙️ Config	API setup, Voice settings, UI preferences, Storage
📱 Devices	Bluetooth control, Port detection, Serial communication
🔗 Apps	Custom commands, App launching, Command sequences

🎤 Voice Control

Wake Word Detection: Activate with "computer" using Picovoice Porcupine
Speech Recognition: Accurate voice-to-text with noise reduction
Text-to-Speech: Natural voice responses with multiple voice options
Noise Reduction: WebRTC-based voice activity detection

📸 Camera Features

Real-time Analysis: Live camera feed processing
Image Upload: Support for image file analysis
Custom Prompts: Tailored image analysis queries
Gemini Vision: Powered by Google's Gemini AI for image understanding

💻 Code Generation

Smart Completion: Context-aware code suggestions
Syntax Highlighting: Clear code visualization
Editor Integration: Custom editor configuration
Code Simulation: Typing simulation for demonstrations

⚙️ Settings Management

API Configuration: Gemini and Picovoice API key management
Voice Settings: Language and voice customization
UI Preferences: Theme and display options
Persistent Storage: Settings auto-save and recovery

📱 Device Management

Bluetooth Control: Serial communication with devices
Device Discovery: Automatic port detection
Connection Management: Connect/disconnect functionality
Custom Commands: Device-specific command handling

🔗 App Integration

Custom Commands: User-defined command sequences
App Launching: Quick access to favorite applications
Command Sequences: Multi-step automation
Settings Persistence: Saved configurations across sessions

🔑 Quick API Setup

🗣️ Wake Word (Picovoice)
Get Key →
Free: Default wake words & basic usage

🧠 AI Features (Gemini)
Get Key →
Free: Gemini Pro with generous limits

Add keys to .env:

PICOVOICE_API_KEY=xxxxx
GEMINI_API_KEY=xxxxx

🚀 Quick Start

Prerequisites

Python 3.8 or higher
pip (Python package installer)
Microphone (for voice features)
Camera (for image analysis)

Installation

Clone the repository:

git clone https://github.com/moego0/ai-assistant.git
cd ai-assistant

Create and activate virtual environment:

# Windows
python -m venv venv
venv\Scripts\activate

# Linux/Mac
python3 -m venv venv
source venv/bin/activate

Install dependencies:

pip install -r requirements.txt

Configure API Keys:
- Launch the application
- Navigate to Settings
- Add required API keys:
  - Gemini API key (AI features)
  - Porcupine key (wake word)

🎯 Usage Guide

Voice Interaction Flow

sequenceDiagram
    participant User
    participant Assistant
    participant API
    
    User->>Assistant: Say "Computer"
    Assistant->>User: "Yes?"
    User->>Assistant: Speak Command
    Assistant->>API: Process Command
    API->>Assistant: Response
    Assistant->>User: Voice & Text Response

🎤 Voice Commands

Activate: Say "computer"
Speak your command/question
Receive voice and text response

📸 Image Analysis

Access camera via camera icon
Options:
- Real-time analysis
- Image upload
- Custom analysis prompts

💻 Code Generation

Describe your code needs
Get formatted, syntax-highlighted code
Copy or save generated code

⚙️ Configuration

Settings Panel

Category	Options
Voice	Gender (Male/Female)
Language	English/Arabic/Bilingual
API Keys	Gemini, Porcupine
Model	Multiple Gemini models

Default Configuration

{
    "voice_gender": "male",
    "speech_language": "en-US",
    "vad_aggressiveness": 3
}

🛠️ Development

Requirements

Check requirements.txt for full dependency list:

PyQt6 >= 6.4.2
google-generativeai >= 0.3.0
SpeechRecognition >= 3.10.0
And more...

Project Structure

ai-assistant/
├── AI_Assistant.py    # Main application
├── requirements.txt   # Dependencies
├── README.md         # Documentation
├── LICENSE          # MIT License
└── .gitignore       # Git ignore rules

🤝 Contributing

Contributions welcome! Please:

Fork the repository
Create feature branch
Commit changes
Push to branch
Open pull request

📄 License

This project is licensed under the MIT License - see LICENSE file.

🙏 Acknowledgments

Google Gemini - AI capabilities
PyQt6 - UI framework
Picovoice - Wake word detection
Open source community

🏠 Device Control

Supported Devices

Smart Lights (Philips Hue, LIFX, etc.)
Smart Thermostats (Nest, Ecobee)
Security Cameras
Media Players (Smart TVs, Speakers)
Smart Plugs and Switches

Setup Process

Enable device discovery in Settings
Connect to your home network
Authorize devices
Create device groups (optional)

🤖 Automation

Features

Custom Script Creation
Scheduled Tasks
Event-Based Triggers
App Integration
Voice Command Macros

Example Automation

# Morning Routine Automation
@automation.schedule("07:00")
def morning_routine():
    # Turn on lights gradually
    smart_lights.fade_in(duration=300)
    # Set temperature
    thermostat.set_temperature(22)
    # Start coffee maker
    smart_plug.turn_on("coffee_maker")

📥 Installation Guide

Prerequisites

Python 3.8 or higher
Git
Microphone (for voice features)
Camera (optional, for image analysis)

Step 1: Clone the Repository

git clone https://github.com/moego0/ai-assistant.git
cd ai-assistant

Step 2: Set Up Virtual Environment

# Windows
python -m venv venv
venv\Scripts\activate

# Linux/macOS
python3 -m venv venv
source venv/bin/activate

Step 3: Install Dependencies

pip install -r requirements.txt

Step 4: Get API Keys

Picovoice (Wake Word Detection)
- Visit Picovoice Console
- Create a free account
- Get your API key
- Free tier includes:
  - Default wake words
  - Basic usage limits
Google Gemini (AI Features)
- Visit Google AI Studio
- Sign in with your Google account
- Get your API key
- Free tier includes:
  - Access to Gemini Pro
  - Generous usage limits

Step 5: Configuration

Create a .env file in the project root:

PICOVOICE_API_KEY=your_picovoice_key_here
GEMINI_API_KEY=your_gemini_key_here

(Optional) Customize settings in editor_config.json

🚀 Usage

Starting the Assistant

python AI_Assistant.py

Voice Commands

Say "computer" to activate
Wait for the activation sound
Speak your command
Examples:
- "What's the weather like?"
- "Generate some Python code"
- "Analyze this image"

Camera Features

Click the camera icon to start
Use "Analyze" for real-time analysis
Upload images for detailed analysis

Code Generation

Request code in natural language
Use the code editor for modifications
Save generated code to files

🤝 Contributing

Fork the repository
Create your feature branch:

git checkout -b feature/AmazingFeature

Commit your changes:

git commit -m 'Add some AmazingFeature'

Push to the branch:

git push origin feature/AmazingFeature

Open a Pull Request

🐛 Troubleshooting

Common Issues

Microphone not working
- Check microphone permissions
- Select correct input device in settings
Camera issues
- Ensure camera permissions are granted
- Check camera connection
API Key errors
- Get a valid API key from Picovoice and Gemini AI
- Check API key format
- Ensure free tier limits not exceeded

Getting Help

Open an issue on GitHub
Check existing issues
Include error messages and system info

🎯 How to Use

Wake Word

To activate the assistant, simply say: "Computer"

Available Commands

Command	Description	Example
`Open [application]`	Launch applications	"Open Chrome", "Open Spotify"
`Search for [query]`	Search the web	"Search for gold preices today"
`What's the time?`	Get current time/date	"What's the time?"
`Type [text]`	Type text via voice	"Type Hello World"
`Genrate code`	Genrate code	"genrate code for python calculator"
`control devices`	Open and close lights	"open red light"

Command Tips

Speak clearly and at a normal pace
Wait for the wake word acknowledgment before giving a command
Commands are case-insensitive
For application names, use common names (e.g., "chrome" instead of "google chrome")
Don't forget to initialize apps and devices before using them
Use the Genrate code command to generate code
Use the control devices command to open and close lights
Use the Open [application] command to launch applications
You can integrate this app with Arduino or Home assistant to control you hame and devices

Made with ❤️ by Mohamed Abdelraouf

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs		docs
.gitignore		.gitignore
AI_Assistant.py		AI_Assistant.py
LICENSE		LICENSE
README.md		README.md
editor_config.json		editor_config.json
preview.gif		preview.gif
requirements.txt		requirements.txt

License

moego0/ai-assistant

Folders and files

Latest commit

History

Repository files navigation

🤖 AI Assistant

🎥 Live Demo

✨ Key Features Demonstrated:

🌟 Quick Preview

🌟 Features Overview

Key Features

🎤 Voice Control

📸 Camera Features

💻 Code Generation

⚙️ Settings Management

📱 Device Management

🔗 App Integration

🔑 Quick API Setup

🚀 Quick Start

Prerequisites

Installation

🎯 Usage Guide

Voice Interaction Flow

🎤 Voice Commands

📸 Image Analysis

💻 Code Generation

⚙️ Configuration

Settings Panel

Default Configuration

🛠️ Development

Requirements

Project Structure

🤝 Contributing

📄 License

🙏 Acknowledgments

🏠 Device Control

Supported Devices

Setup Process

🤖 Automation

Features

Example Automation

📥 Installation Guide

Prerequisites

Step 1: Clone the Repository

Step 2: Set Up Virtual Environment

Step 3: Install Dependencies

Step 4: Get API Keys

Step 5: Configuration

🚀 Usage

Starting the Assistant

Voice Commands

Camera Features

Code Generation

🤝 Contributing

🐛 Troubleshooting

Common Issues

Getting Help

🎯 How to Use

Wake Word

Available Commands

Command Tips

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Languages