- Enhanced `ARCHITECTURE.md` with details on LLM models for work (Llama 3.1 70B Q4) and family agents (Phi-3 Mini 3.8B Q4). - Introduced new documents: - `ASR_EVALUATION.md` for ASR engine evaluation and selection. - `HARDWARE.md` outlining hardware requirements and purchase plans. - `IMPLEMENTATION_GUIDE.md` for Milestone 2 implementation steps. - `LLM_CAPACITY.md` assessing VRAM and context window limits. - `LLM_MODEL_SURVEY.md` surveying open-weight LLM models. - `LLM_USAGE_AND_COSTS.md` detailing LLM usage and operational costs. - `MCP_ARCHITECTURE.md` describing the Model Context Protocol architecture. - `MCP_IMPLEMENTATION_SUMMARY.md` summarizing MCP implementation status. These updates provide comprehensive guidance for the next phases of development and ensure clarity in project documentation.
8.0 KiB
Hardware Requirements and Purchase Plan
Overview
This document outlines hardware requirements for the Atlas voice agent system, based on completed technology evaluations and model selections.
Hardware Status
Already Available
- ✅ RTX 4080 (16GB VRAM) - Work agent LLM + ASR
- ✅ RTX 1050 (4GB VRAM) - Family agent LLM
- ✅ Servers - Hosting for 4080 and 1050
Required Hardware
Must-Have / Critical for MVP
1. Microphones (Priority: High)
Requirements:
- High-quality USB microphones or array mic
- For living room/office wake-word detection and voice capture
- Good noise cancellation for home environment
- Multiple locations may be needed
Options:
Option A: USB Microphones (Recommended)
- Blue Yeti or Audio-Technica ATR2100x-USB
- Cost: $50-150 each
- Quantity: 1-2 (living room + office)
- Pros: Good quality, easy setup, USB plug-and-play
- Cons: Requires USB connection to always-on node
Option B: Array Microphone
- ReSpeaker 4-Mic Array or similar
- Cost: $30-50
- Quantity: 1-2
- Pros: Better directionality, designed for voice assistants
- Cons: May need additional setup/configuration
Option C: Headset (For Desk Usage)
- Logitech H390 or similar USB headset
- Cost: $30-50
- Quantity: 1
- Pros: Lower noise, good for focused work
- Cons: Not hands-free
Recommendation: Start with 1-2 USB microphones (Option A) for MVP
Purchase Priority: ⭐⭐⭐ Critical - Needed for wake-word and ASR testing
2. Always-On Node (Priority: High)
Requirements:
- Small, low-power device for wake-word detection
- Can also run ASR if using CPU deployment
- 24/7 operation capability
- Network connectivity
Options:
Option A: Raspberry Pi 4+ (Recommended)
- Specs: 4GB+ RAM, microSD card (64GB+)
- Cost: $75-100 (with case, power supply, SD card)
- Pros: Low power, well-supported, good for wake-word
- Cons: Limited CPU for ASR (would need GPU or separate ASR)
Option B: Intel NUC (Small Form Factor)
- Specs: i3 or better, 8GB+ RAM, SSD
- Cost: $200-400
- Pros: More powerful, can run ASR on CPU, better for always-on
- Cons: Higher cost, more power consumption
Option C: Old SFF PC (If Available)
- Specs: Any modern CPU, 8GB+ RAM
- Cost: $0 (if repurposing)
- Pros: Free, likely sufficient
- Cons: May be larger, noisier, higher power
Recommendation:
- If using ASR on 4080: Raspberry Pi 4+ is sufficient (wake-word only)
- If using ASR on CPU: Intel NUC or SFF PC recommended
Purchase Priority: ⭐⭐⭐ Critical - Needed for wake-word node
3. Storage (Priority: Medium)
Requirements:
- Additional storage for logs, transcripts, note archives
- SSD for logs (fast access)
- HDD for archives (cheaper, larger capacity)
Options:
Option A: External SSD
- Size: 500GB-1TB
- Cost: $50-100
- Use: Logs, active transcripts
- Pros: Fast, portable
Option B: External HDD
- Size: 2TB-4TB
- Cost: $60-120
- Use: Archives, backups
- Pros: Large capacity, cost-effective
Recommendation:
- If space available on existing drives: Can defer
- If needed: 500GB SSD for logs + 2TB HDD for archives
Purchase Priority: ⭐⭐ Medium - Can use existing storage initially
4. Network Gear (Priority: Low)
Requirements:
- Extra Ethernet runs or PoE switch (if needed)
- For connecting mic nodes and servers
Options:
Option A: PoE Switch
- Ports: 8-16 ports
- Cost: $50-150
- Use: Power and connect mic nodes
- Pros: Clean setup, single cable
Option B: Ethernet Cables
- Length: As needed
- Cost: $10-30
- Use: Direct connections
- Pros: Simple, cheap
Recommendation: Only if needed for clean setup. Can use WiFi for Pi initially.
Purchase Priority: ⭐ Low - Only if needed for deployment
Nice-to-Have (Post-MVP)
5. Dedicated Low-Power Box for 1050 (Priority: Low)
Requirements:
- If current 1050 host is noisy or power-hungry
- Small, quiet system for family agent
Options:
- Mini-ITX build with 1050
- Small form factor case
- Cost: $200-400 (if building new)
Recommendation: Only if current setup is problematic
Purchase Priority: ⭐ Low - Optional optimization
6. UPS (Uninterruptible Power Supply) (Priority: Medium)
Requirements:
- Protect 4080/1050 servers from abrupt shutdowns
- Prevent data loss during power outages
- Runtime: 10-30 minutes
Options:
- APC Back-UPS 600VA or similar
- Cost: $80-150
- Capacity: 600-1000VA
Recommendation: Good investment for data protection
Purchase Priority: ⭐⭐ Medium - Recommended but not critical for MVP
7. Dashboard Display (Priority: Low)
Requirements:
- Small tablet or wall-mounted screen
- For LAN dashboard display
Options:
- Raspberry Pi Touchscreen (7" or 10")
- Cost: $60-100
- Use: Web dashboard display
Recommendation: Nice for visibility, but web dashboard works on any device
Purchase Priority: ⭐ Low - Optional, can use phone/tablet
Purchase Plan
Phase 1: MVP Essentials (Immediate)
Total Cost: $125-250
-
USB Microphone(s): $50-150
- 1-2 microphones for wake-word and voice capture
- Priority: Critical
-
Always-On Node: $75-200
- Raspberry Pi 4+ (if ASR on 4080) or NUC (if ASR on CPU)
- Priority: Critical
Subtotal: $125-350
Phase 2: Storage & Protection (After MVP Working)
Total Cost: $140-270
-
Storage: $50-100 (SSD) + $60-120 (HDD)
- Only if existing storage insufficient
- Priority: Medium
-
UPS: $80-150
- Protect servers from power loss
- Priority: Medium
Subtotal: $190-370
Phase 3: Optional Enhancements (Future)
Total Cost: $60-400
- Network Gear: $10-150 (if needed)
- Dashboard Display: $60-100 (optional)
- Dedicated 1050 Box: $200-400 (only if needed)
Subtotal: $270-650
Total Cost Estimate
- MVP Minimum: $125-250
- MVP + Storage/UPS: $315-620
- Full Setup: $585-1270
Recommendations by Deployment Option
If ASR on RTX 4080 (Recommended)
- Always-On Node: Raspberry Pi 4+ ($75-100) - Wake-word only
- Microphones: 1-2 USB mics ($50-150)
- Total MVP: $125-250
If ASR on CPU (Alternative)
- Always-On Node: Intel NUC ($200-400) - Wake-word + ASR
- Microphones: 1-2 USB mics ($50-150)
- Total MVP: $250-550
Purchase Timeline
Week 1 (MVP Start)
- ✅ Order USB microphone(s)
- ✅ Order always-on node (Pi 4+ or NUC)
- Goal: Get wake-word and basic voice capture working
Week 2-4 (After MVP Working)
- Order storage if needed
- Order UPS for server protection
- Goal: Stable, protected setup
Month 2+ (Enhancements)
- Network gear if needed
- Dashboard display (optional)
- Goal: Polish and optimization
Hardware Specifications Summary
Always-On Node (Wake-Word + Optional ASR)
Minimum (Raspberry Pi 4):
- CPU: ARM Cortex-A72 (quad-core)
- RAM: 4GB+
- Storage: 64GB microSD
- Network: Gigabit Ethernet, WiFi
- Power: 5V USB-C, ~5W
Recommended (Intel NUC - if ASR on CPU):
- CPU: Intel i3 or better
- RAM: 8GB+
- Storage: 256GB+ SSD
- Network: Gigabit Ethernet, WiFi
- Power: 12V, ~15-25W
Microphones
USB Microphone:
- Interface: USB 2.0+
- Sample Rate: 48kHz
- Bit Depth: 16-bit+
- Directionality: Cardioid or omnidirectional
Array Microphone:
- Channels: 4+ microphones
- Interface: USB or I2S
- Beamforming: Preferred
- Noise Cancellation: Preferred
Next Steps
- ✅ Hardware requirements documented
- ✅ Purchase plan created
- Action: Order MVP essentials (microphones + always-on node)
- Action: Set up always-on node for wake-word testing
- Action: Test microphone setup with wake-word detection
References
- Wake-Word Evaluation:
docs/WAKE_WORD_EVALUATION.md(when created) - ASR Evaluation:
docs/ASR_EVALUATION.md - Architecture:
ARCHITECTURE.md
Last Updated: 2024-01-XX Status: Requirements Complete - Ready for Purchase