- Added .cursorrules for project guidelines and context - Created README.md for project overview and goals - Established ARCHITECTURE.md for architectural documentation - Set up tickets directory with initial ticket management files - Included .gitignore to manage ignored files and directories This commit lays the foundation for the Atlas project, ensuring a clear structure for development and collaboration.
50 lines
1.4 KiB
Markdown
50 lines
1.4 KiB
Markdown
# Ticket: Select ASR Engine and Target Hardware
|
|
|
|
## Ticket Information
|
|
|
|
- **ID**: TICKET-009
|
|
- **Title**: Select ASR Engine and Target Hardware
|
|
- **Type**: Research
|
|
- **Priority**: High
|
|
- **Status**: Backlog
|
|
- **Track**: Voice I/O
|
|
- **Milestone**: Milestone 1 - Survey & Architecture
|
|
- **Created**: 2024-01-XX
|
|
|
|
## Description
|
|
|
|
Decide on ASR (Automatic Speech Recognition) engine and deployment:
|
|
- Evaluate options: faster-whisper, Whisper.cpp, etc.
|
|
- Decide deployment: faster-whisper on 4080, CPU-only on small box, or shared
|
|
- Consider model size vs latency trade-offs
|
|
- Document hardware requirements
|
|
|
|
## Acceptance Criteria
|
|
|
|
- [ ] ASR engine selected (faster-whisper recommended)
|
|
- [ ] Target hardware decided (4080 vs CPU box)
|
|
- [ ] Model size selected (medium vs small)
|
|
- [ ] Latency requirements documented
|
|
- [ ] Decision recorded in architecture docs
|
|
|
|
## Technical Details
|
|
|
|
Considerations:
|
|
- faster-whisper on 4080: Lower latency, higher quality
|
|
- CPU-only on small box: Lower cost, higher latency
|
|
- Shared deployment: Resource contention considerations
|
|
- Model sizes: tiny/small/medium/base for latency/quality trade-off
|
|
|
|
## Dependencies
|
|
|
|
- TICKET-004 (architecture) - helpful context
|
|
|
|
## Related Files
|
|
|
|
- `docs/ASR_EVALUATION.md` (to be created)
|
|
- `ARCHITECTURE.md`
|
|
|
|
## Notes
|
|
|
|
Can run in parallel with TTS and LLM work. Needs wake-word event flow defined for when to start/stop capture.
|