- Added .cursorrules for project guidelines and context - Created README.md for project overview and goals - Established ARCHITECTURE.md for architectural documentation - Set up tickets directory with initial ticket management files - Included .gitignore to manage ignored files and directories This commit lays the foundation for the Atlas project, ensuring a clear structure for development and collaboration.
1.1 KiB
1.1 KiB
Ticket: Select Family Agent Model (1050)
Ticket Information
- ID: TICKET-020
- Title: Select Family Agent Model for 1050
- Type: Research
- Priority: High
- Status: Backlog
- Track: LLM Infra
- Milestone: Milestone 1 - Survey & Architecture
- Created: 2024-01-XX
Description
Select the LLM model for family agent on 1050:
- Small, instruction-tuned model
- Latency-optimized for 24/7 operation
- Suitable for 4GB VRAM
- Good instruction-following
Acceptance Criteria
- Family agent model selected
- Quantization level chosen
- Rationale documented
- Model file location specified
- Latency characteristics documented
Technical Details
Selection criteria:
- Small model size (1B-3B parameters)
- Instruction-tuned
- Low latency (< 1s response time)
- Function calling support
- Quantization: Q4 or Q5 for 4GB VRAM
Dependencies
- TICKET-017 (model survey)
- TICKET-018 (capacity assessment)
Related Files
docs/MODEL_SELECTION.md(to be created)
Notes
Optimized for always-on, low-latency family interactions. Separate from work agent.