- Added .cursorrules for project guidelines and context - Created README.md for project overview and goals - Established ARCHITECTURE.md for architectural documentation - Set up tickets directory with initial ticket management files - Included .gitignore to manage ignored files and directories This commit lays the foundation for the Atlas project, ensuring a clear structure for development and collaboration.
1.1 KiB
1.1 KiB
Ticket: Select Work Agent Model (4080)
Ticket Information
- ID: TICKET-019
- Title: Select Work Agent Model for 4080
- Type: Research
- Priority: High
- Status: Backlog
- Track: LLM Infra
- Milestone: Milestone 1 - Survey & Architecture
- Created: 2024-01-XX
Description
Select the LLM model for work agent on 4080:
- Coding/research-optimized model
- Not used by family agent
- Suitable for 16GB VRAM with quantization
- Good function-calling support
Acceptance Criteria
- Work agent model selected
- Quantization level chosen
- Rationale documented
- Model file location specified
- Performance characteristics documented
Technical Details
Selection criteria:
- Coding capabilities (CodeLlama, DeepSeek Coder, etc.)
- Research/analysis capabilities
- Function calling support
- Context window size
- Quantization: Q4-Q6 for 16GB VRAM
Dependencies
- TICKET-017 (model survey)
- TICKET-018 (capacity assessment)
Related Files
docs/MODEL_SELECTION.md(to be created)
Notes
Separate from family agent model. Can be selected independently.