Evaluate TTS Options #2

Merged
ilia merged 2 commits from vk/45ad-evaluate-tts-opt into master 2026-01-05 21:30:15 -05:00
Owner

Ticket: Evaluate TTS Options

Ticket Information

  • ID: TICKET-013
  • Title: Evaluate TTS Options
  • Type: Research
  • Priority: High
  • Status: Backlog
  • Track: Voice I/O
  • Milestone: Milestone 1 - Survey & Architecture
  • Created: 2024-01-XX

Description

Evaluate text-to-speech options:

  • Compare open source options (Piper, Mimic 3, etc.)
  • Evaluate local neural TTS solutions
  • Select 1-2 voices for family agent
  • Consider latency, quality, and resource usage

Acceptance Criteria

  • TTS options compared
  • Selected TTS engine documented
  • Voice samples selected
  • Resource requirements documented
  • Decision recorded in architecture docs

Technical Details

Options to evaluate:

  • Piper (lightweight, fast)
  • Mimic 3 (high quality)
  • Coqui TTS (neural, customizable)
  • Other open-source solutions

Considerations:

  • Latency for interactive use
  • Voice quality and naturalness
  • Resource usage
  • Customization options

Dependencies

  • TICKET-004 (architecture) - helpful context
  • docs/TTS_EVALUATION.md (to be created)
  • ARCHITECTURE.md

Notes

Independent of LLM logic. Can be developed in parallel with other voice work.

# Ticket: Evaluate TTS Options ## Ticket Information - **ID**: TICKET-013 - **Title**: Evaluate TTS Options - **Type**: Research - **Priority**: High - **Status**: Backlog - **Track**: Voice I/O - **Milestone**: Milestone 1 - Survey & Architecture - **Created**: 2024-01-XX ## Description Evaluate text-to-speech options: - Compare open source options (Piper, Mimic 3, etc.) - Evaluate local neural TTS solutions - Select 1-2 voices for family agent - Consider latency, quality, and resource usage ## Acceptance Criteria - [ ] TTS options compared - [ ] Selected TTS engine documented - [ ] Voice samples selected - [ ] Resource requirements documented - [ ] Decision recorded in architecture docs ## Technical Details Options to evaluate: - Piper (lightweight, fast) - Mimic 3 (high quality) - Coqui TTS (neural, customizable) - Other open-source solutions Considerations: - Latency for interactive use - Voice quality and naturalness - Resource usage - Customization options ## Dependencies - TICKET-004 (architecture) - helpful context ## Related Files - `docs/TTS_EVALUATION.md` (to be created) - `ARCHITECTURE.md` ## Notes Independent of LLM logic. Can be developed in parallel with other voice work.
ilia added 2 commits 2026-01-05 21:26:27 -05:00
This commit completes the evaluation of Text-to-Speech (TTS) options
as described in TICKET-013.

- Creates a detailed  document comparing Piper,
  Mimic 3, and Coqui TTS.
- Recommends Piper for initial development due to its performance and
  low resource usage.
- Updates  to reflect the decision and points to the
  new evaluation document.
- Moves TICKET-013 to the 'done' column.
ilia merged commit 4a0bfa773f into master 2026-01-05 21:30:15 -05:00
Sign in to join this conversation.
No Reviewers
No Label
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: ilia/atlas#2
No description provided.