- Added .cursorrules for project guidelines and context - Created README.md for project overview and goals - Established ARCHITECTURE.md for architectural documentation - Set up tickets directory with initial ticket management files - Included .gitignore to manage ignored files and directories This commit lays the foundation for the Atlas project, ensuring a clear structure for development and collaboration.
51 lines
1.1 KiB
Markdown
51 lines
1.1 KiB
Markdown
# Ticket: Implement LLM Routing Layer
|
|
|
|
## Ticket Information
|
|
|
|
- **ID**: TICKET-023
|
|
- **Title**: Implement LLM Routing Layer
|
|
- **Type**: Feature
|
|
- **Priority**: High
|
|
- **Status**: Backlog
|
|
- **Track**: LLM Infra
|
|
- **Milestone**: Milestone 2 - Voice Chat MVP
|
|
- **Created**: 2024-01-XX
|
|
|
|
## Description
|
|
|
|
Build routing layer for LLM requests:
|
|
- Simple gateway deciding family vs work agent
|
|
- Route based on origin/identity
|
|
- Load balancing if needed
|
|
- Request logging and metrics
|
|
|
|
## Acceptance Criteria
|
|
|
|
- [ ] Routing gateway implemented
|
|
- [ ] Family vs work agent routing working
|
|
- [ ] Identity-based routing functional
|
|
- [ ] Request logging implemented
|
|
- [ ] Basic metrics collection
|
|
|
|
## Technical Details
|
|
|
|
Routing logic:
|
|
- Check request origin (client, IP, token)
|
|
- Route to 1050 for family agent
|
|
- Route to 4080 for work agent
|
|
- Fallback and error handling
|
|
- Rate limiting per agent
|
|
|
|
## Dependencies
|
|
|
|
- TICKET-021 (4080 server)
|
|
- TICKET-022 (1050 server)
|
|
|
|
## Related Files
|
|
|
|
- `home-voice-agent/routing/` (to be created)
|
|
|
|
## Notes
|
|
|
|
Simple gateway initially. Can be enhanced later with more sophisticated routing.
|