AI-Enhanced Dev Lab v0.6.4

AI Development Lab with MCP Server for secure, auditable AI tool interactions and RAG evaluation gates.

Quick Start

Install dependencies:
```
pip install -r requirements.txt
```

Start the MCP server:

.venv/bin/python -m mcp_server.simple_server

Run tests:
```
pytest
```

RAG Evaluation Gates

Run evaluation locally:

# Run full evaluation
python eval/run.py --dataset eval/data/lab/lab_dev.jsonl --output eval/runs/$(date +%Y%m%d-%H%M%S)

# Check gates
python scripts/ci/parse_metrics.py eval/runs/*/metrics.json

# Start MCP server
.venv/bin/python -m mcp_server.simple_server

MCP Tools Available

The MCP server provides the following tools:

Terminal Operations

run_command: Execute terminal commands safely with timeout
check_file: Check if files exist and get metadata
read_file: Safely read files with line limits
list_directory: List directory contents with limits

Evaluation Operations

run_eval: Run RAG evaluation safely
check_gates: Check if evaluation gates pass

Usage Examples

# Test MCP server
curl -X POST http://localhost:8000/tools/run_command \
  -H "Content-Type: application/json" \
  -d '{"command": "ls -la", "timeout": 10}'

# Check file existence
curl -X POST http://localhost:8000/tools/check_file \
  -H "Content-Type: application/json" \
  -d '{"filepath": "eval/run.py"}'

# Run evaluation
curl -X POST http://localhost:8000/tools/run_eval \
  -H "Content-Type: application/json" \
  -d '{"dataset": "eval/data/lab/lab_dev.jsonl", "output_dir": "eval/runs/test"}'

Architecture

MCP Server: FastAPI-based server providing AI tools via MCP protocol
Security: Guardian-based access control and PII redaction
Audit: Comprehensive logging of all tool interactions
Evaluation: Automated testing and metrics for AI models
RAG Gates: Comprehensive evaluation framework with automated CI integration

Project Structure

lab/ - Research and development experiments
eval/ - Evaluation framework and gates
mcp_server/ - MCP server implementation
evidence/ - Evaluation evidence and reports

Development

See docs/cursor-usage.md for Cursor IDE setup and usage.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.cursor		.cursor
.github		.github
.husky		.husky
.vscode		.vscode
app/mcp-servers/promotions		app/mcp-servers/promotions
config/mcp		config/mcp
docs		docs
eval		eval
evidence		evidence
lab		lab
logs		logs
mcp_server		mcp_server
scripts/ci		scripts/ci
test_docs		test_docs
test_eval		test_eval
test_lab		test_lab
tests/rag		tests/rag
.cursorignore		.cursorignore
.env.sample		.env.sample
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
FREEZE.md		FREEZE.md
README.md		README.md
TERMINAL_OPERATIONS_GUIDE.md		TERMINAL_OPERATIONS_GUIDE.md
VERSION		VERSION
commitlint.config.cjs		commitlint.config.cjs
coverage.xml		coverage.xml
eval.md		eval.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
test_code_org.py		test_code_org.py
verify_v0_6_4.py		verify_v0_6_4.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI-Enhanced Dev Lab v0.6.4

Quick Start

RAG Evaluation Gates

MCP Tools Available

Terminal Operations

Evaluation Operations

Usage Examples

Architecture

Project Structure

Development

About

Uh oh!

Releases 3

Packages

Languages

haz3141/ai-dev-lab

Folders and files

Latest commit

History

Repository files navigation

AI-Enhanced Dev Lab v0.6.4

Quick Start

RAG Evaluation Gates

MCP Tools Available

Terminal Operations

Evaluation Operations

Usage Examples

Architecture

Project Structure

Development

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages