- Added .cursorrules for project guidelines and context - Created README.md for project overview and goals - Established ARCHITECTURE.md for architectural documentation - Set up tickets directory with initial ticket management files - Included .gitignore to manage ignored files and directories This commit lays the foundation for the Atlas project, ensuring a clear structure for development and collaboration.
51 lines
1.1 KiB
Markdown
51 lines
1.1 KiB
Markdown
# Ticket: Select Family Agent Model (1050)
|
|
|
|
## Ticket Information
|
|
|
|
- **ID**: TICKET-020
|
|
- **Title**: Select Family Agent Model for 1050
|
|
- **Type**: Research
|
|
- **Priority**: High
|
|
- **Status**: Backlog
|
|
- **Track**: LLM Infra
|
|
- **Milestone**: Milestone 1 - Survey & Architecture
|
|
- **Created**: 2024-01-XX
|
|
|
|
## Description
|
|
|
|
Select the LLM model for family agent on 1050:
|
|
- Small, instruction-tuned model
|
|
- Latency-optimized for 24/7 operation
|
|
- Suitable for 4GB VRAM
|
|
- Good instruction-following
|
|
|
|
## Acceptance Criteria
|
|
|
|
- [ ] Family agent model selected
|
|
- [ ] Quantization level chosen
|
|
- [ ] Rationale documented
|
|
- [ ] Model file location specified
|
|
- [ ] Latency characteristics documented
|
|
|
|
## Technical Details
|
|
|
|
Selection criteria:
|
|
- Small model size (1B-3B parameters)
|
|
- Instruction-tuned
|
|
- Low latency (< 1s response time)
|
|
- Function calling support
|
|
- Quantization: Q4 or Q5 for 4GB VRAM
|
|
|
|
## Dependencies
|
|
|
|
- TICKET-017 (model survey)
|
|
- TICKET-018 (capacity assessment)
|
|
|
|
## Related Files
|
|
|
|
- `docs/MODEL_SELECTION.md` (to be created)
|
|
|
|
## Notes
|
|
|
|
Optimized for always-on, low-latency family interactions. Separate from work agent.
|