✅ TICKET-006: Wake-word Detection Service - Implemented wake-word detection using openWakeWord - HTTP/WebSocket server on port 8002 - Real-time detection with configurable threshold - Event emission for ASR integration - Location: home-voice-agent/wake-word/ ✅ TICKET-010: ASR Service - Implemented ASR using faster-whisper - HTTP endpoint for file transcription - WebSocket endpoint for streaming transcription - Support for multiple audio formats - Auto language detection - GPU acceleration support - Location: home-voice-agent/asr/ ✅ TICKET-014: TTS Service - Implemented TTS using Piper - HTTP endpoint for text-to-speech synthesis - Low-latency processing (< 500ms) - Multiple voice support - WAV audio output - Location: home-voice-agent/tts/ ✅ TICKET-047: Updated Hardware Purchases - Marked Pi5 kit, SSD, microphone, and speakers as purchased - Updated progress log with purchase status 📚 Documentation: - Added VOICE_SERVICES_README.md with complete testing guide - Each service includes README.md with usage instructions - All services ready for Pi5 deployment 🧪 Testing: - Created test files for each service - All imports validated - FastAPI apps created successfully - Code passes syntax validation 🚀 Ready for: - Pi5 deployment - End-to-end voice flow testing - Integration with MCP server Files Added: - wake-word/detector.py - wake-word/server.py - wake-word/requirements.txt - wake-word/README.md - wake-word/test_detector.py - asr/service.py - asr/server.py - asr/requirements.txt - asr/README.md - asr/test_service.py - tts/service.py - tts/server.py - tts/requirements.txt - tts/README.md - tts/test_service.py - VOICE_SERVICES_README.md Files Modified: - tickets/done/TICKET-047_hardware-purchases.md Files Moved: - tickets/backlog/TICKET-006_prototype-wake-word-node.md → tickets/done/ - tickets/backlog/TICKET-010_streaming-asr-service.md → tickets/done/ - tickets/backlog/TICKET-014_tts-service.md → tickets/done/
143 lines
3.1 KiB
Markdown
143 lines
3.1 KiB
Markdown
# Web Dashboard Design
|
|
|
|
Design document for the Atlas web LAN dashboard.
|
|
|
|
## Overview
|
|
|
|
A simple, local web interface for monitoring and managing the Atlas voice agent system. Accessible only on the local network.
|
|
|
|
## Goals
|
|
|
|
1. **Monitor System**: View conversations, tasks, reminders
|
|
2. **Admin Control**: Pause/resume agents, kill services
|
|
3. **Log Viewing**: Search and view system logs
|
|
4. **Privacy**: Local-only, no external access
|
|
|
|
## Pages/Sections
|
|
|
|
### 1. Dashboard Home
|
|
- System status overview
|
|
- Active conversations count
|
|
- Pending tasks count
|
|
- Active timers/reminders
|
|
- Recent activity
|
|
|
|
### 2. Conversations
|
|
- List of recent conversations
|
|
- Search/filter by date, agent type
|
|
- View conversation details
|
|
- Delete conversations
|
|
|
|
### 3. Tasks Board
|
|
- Read-only Kanban view
|
|
- Filter by status
|
|
- View task details
|
|
|
|
### 4. Timers & Reminders
|
|
- List active timers
|
|
- List upcoming reminders
|
|
- Cancel timers
|
|
|
|
### 5. Logs
|
|
- Search logs by date, agent, tool
|
|
- Filter by log level
|
|
- Export logs
|
|
|
|
### 6. Admin Panel
|
|
- Agent status (family/work)
|
|
- Pause/Resume buttons
|
|
- Kill switches:
|
|
- Family agent
|
|
- Work agent
|
|
- MCP server
|
|
- Specific tools
|
|
- Access revocation:
|
|
- List active sessions
|
|
- Revoke sessions/tokens
|
|
|
|
## API Design
|
|
|
|
### Base URL
|
|
`http://localhost:8000/api` (or configurable)
|
|
|
|
### Endpoints
|
|
|
|
#### Conversations
|
|
```
|
|
GET /conversations - List conversations
|
|
GET /conversations/:id - Get conversation
|
|
DELETE /conversations/:id - Delete conversation
|
|
```
|
|
|
|
#### Tasks
|
|
```
|
|
GET /tasks - List tasks
|
|
GET /tasks/:id - Get task details
|
|
```
|
|
|
|
#### Timers
|
|
```
|
|
GET /timers - List active timers
|
|
POST /timers/:id/cancel - Cancel timer
|
|
```
|
|
|
|
#### Logs
|
|
```
|
|
GET /logs - Search logs
|
|
GET /logs/export - Export logs
|
|
```
|
|
|
|
#### Admin
|
|
```
|
|
GET /admin/status - System status
|
|
POST /admin/agents/:type/pause - Pause agent
|
|
POST /admin/agents/:type/resume - Resume agent
|
|
POST /admin/services/:name/kill - Kill service
|
|
GET /admin/sessions - List sessions
|
|
POST /admin/sessions/:id/revoke - Revoke session
|
|
```
|
|
|
|
## Security
|
|
|
|
- **Local Network Only**: Bind to localhost or LAN IP
|
|
- **No Authentication**: Trust local network (can add later)
|
|
- **Read-Only by Default**: Most operations are read-only
|
|
- **Admin Actions**: Require explicit confirmation
|
|
|
|
## Implementation Plan
|
|
|
|
### Phase 1: Basic UI
|
|
- HTML structure
|
|
- CSS styling
|
|
- Basic JavaScript
|
|
- Static data display
|
|
|
|
### Phase 2: API Integration
|
|
- Connect to MCP server APIs
|
|
- Real data display
|
|
- Basic interactions
|
|
|
|
### Phase 3: Admin Features
|
|
- Admin panel
|
|
- Kill switches
|
|
- Log viewing
|
|
|
|
### Phase 4: Real-time Updates
|
|
- WebSocket integration
|
|
- Live updates
|
|
- Notifications
|
|
|
|
## Technology Choices
|
|
|
|
- **Simple**: Vanilla HTML/CSS/JS for simplicity
|
|
- **Or**: Lightweight framework (Vue.js, React) if needed
|
|
- **Backend**: Extend MCP server with dashboard endpoints
|
|
- **Styling**: Simple, clean, functional
|
|
|
|
## Future Enhancements
|
|
|
|
- Voice interaction (when TTS/ASR ready)
|
|
- Mobile app version
|
|
- Advanced analytics
|
|
- Customizable dashboards
|