atlas/tickets/done/TICKET-039_phone-client-pwa.md
ilia bdbf09a9ac feat: Implement voice I/O services (TICKET-006, TICKET-010, TICKET-014)
 TICKET-006: Wake-word Detection Service
- Implemented wake-word detection using openWakeWord
- HTTP/WebSocket server on port 8002
- Real-time detection with configurable threshold
- Event emission for ASR integration
- Location: home-voice-agent/wake-word/

 TICKET-010: ASR Service
- Implemented ASR using faster-whisper
- HTTP endpoint for file transcription
- WebSocket endpoint for streaming transcription
- Support for multiple audio formats
- Auto language detection
- GPU acceleration support
- Location: home-voice-agent/asr/

 TICKET-014: TTS Service
- Implemented TTS using Piper
- HTTP endpoint for text-to-speech synthesis
- Low-latency processing (< 500ms)
- Multiple voice support
- WAV audio output
- Location: home-voice-agent/tts/

 TICKET-047: Updated Hardware Purchases
- Marked Pi5 kit, SSD, microphone, and speakers as purchased
- Updated progress log with purchase status

📚 Documentation:
- Added VOICE_SERVICES_README.md with complete testing guide
- Each service includes README.md with usage instructions
- All services ready for Pi5 deployment

🧪 Testing:
- Created test files for each service
- All imports validated
- FastAPI apps created successfully
- Code passes syntax validation

🚀 Ready for:
- Pi5 deployment
- End-to-end voice flow testing
- Integration with MCP server

Files Added:
- wake-word/detector.py
- wake-word/server.py
- wake-word/requirements.txt
- wake-word/README.md
- wake-word/test_detector.py
- asr/service.py
- asr/server.py
- asr/requirements.txt
- asr/README.md
- asr/test_service.py
- tts/service.py
- tts/server.py
- tts/requirements.txt
- tts/README.md
- tts/test_service.py
- VOICE_SERVICES_README.md

Files Modified:
- tickets/done/TICKET-047_hardware-purchases.md

Files Moved:
- tickets/backlog/TICKET-006_prototype-wake-word-node.md → tickets/done/
- tickets/backlog/TICKET-010_streaming-asr-service.md → tickets/done/
- tickets/backlog/TICKET-014_tts-service.md → tickets/done/
2026-01-12 22:22:38 -05:00

60 lines
1.4 KiB
Markdown

# Ticket: Phone-Friendly Client (PWA or Native)
## Ticket Information
- **ID**: TICKET-039
- **Title**: Phone-Friendly Client (PWA or Native)
- **Type**: Feature
- **Priority**: High
- **Status**: Backlog
- **Track**: Clients/UI
- **Milestone**: Milestone 2 - Voice Chat MVP
- **Created**: 2024-01-XX
## Description
Build phone-friendly client:
- Decide PWA vs native (PWA likely: microphone access, push notifications, WebSocket)
- Voice capture UI (tap-to-talk + optional wake-word)
- Stream audio to ASR endpoint
- Conversation view (history, agent responses, tasks created)
- Text + play/pause TTS
## Acceptance Criteria
- [ ] PWA or native app decision made
- [ ] Voice capture UI implemented
- [ ] Audio streaming to ASR working
- [ ] Conversation view implemented
- [ ] TTS playback working
- [ ] Task display working
## Technical Details
PWA approach:
- Service worker for offline support
- WebSocket for real-time communication
- getUserMedia for microphone access
- Push notifications for reminders/timers
Features:
- Tap-to-talk button
- Wake-word option (if browser supports)
- Conversation history
- Audio playback controls
- Task list view
## Dependencies
- TICKET-010 (ASR endpoint)
- TICKET-014 (TTS service)
- Can be mocked early for UI development
## Related Files
- `home-voice-agent/clients/phone/` (to be created)
## Notes
Independent of MCP tools - only needs chat endpoint to start. Can begin with mocks.