- Added .cursorrules for project guidelines and context - Created README.md for project overview and goals - Established ARCHITECTURE.md for architectural documentation - Set up tickets directory with initial ticket management files - Included .gitignore to manage ignored files and directories This commit lays the foundation for the Atlas project, ensuring a clear structure for development and collaboration.
50 lines
1.1 KiB
Markdown
50 lines
1.1 KiB
Markdown
# Ticket: LLM Logging & Metrics
|
|
|
|
## Ticket Information
|
|
|
|
- **ID**: TICKET-024
|
|
- **Title**: LLM Logging & Metrics
|
|
- **Type**: Feature
|
|
- **Priority**: Medium
|
|
- **Status**: Backlog
|
|
- **Track**: LLM Infra
|
|
- **Milestone**: Milestone 2 - Voice Chat MVP
|
|
- **Created**: 2024-01-XX
|
|
|
|
## Description
|
|
|
|
Implement logging and metrics for LLM services:
|
|
- Structured logs (prompt, tools called, latency, cost estimates)
|
|
- GPU usage dashboards
|
|
- Request/response logging
|
|
- Error tracking
|
|
|
|
## Acceptance Criteria
|
|
|
|
- [ ] Structured logging implemented
|
|
- [ ] Log format includes: prompt, tools, latency
|
|
- [ ] GPU usage monitoring
|
|
- [ ] Basic metrics dashboard
|
|
- [ ] Error logging and alerting
|
|
|
|
## Technical Details
|
|
|
|
Logging format:
|
|
- JSON structured logs
|
|
- Fields: timestamp, agent, prompt, tools, latency, tokens, errors
|
|
- GPU metrics: utilization, memory, temperature
|
|
- Storage: local files or time-series DB
|
|
|
|
## Dependencies
|
|
|
|
- TICKET-021 (4080 server)
|
|
- TICKET-022 (1050 server)
|
|
|
|
## Related Files
|
|
|
|
- `home-voice-agent/monitoring/` (to be created)
|
|
|
|
## Notes
|
|
|
|
Important for debugging and optimization. Can be enhanced over time.
|