5 Commits

Author SHA256 Message Date
tlg
cf7c77b3b5 Add llmux implementation plan (30 tasks)
Covers project scaffolding, config, auth, VRAM manager, all four
backends, API routes, Dockerfile, deployment scripts, and four
phases of testing (integration, functional, VRAM, performance).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-03 22:43:37 +02:00
tlg
45947e80a4 Update manual steps: DNS done, Open WebUI config automated
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-03 22:25:51 +02:00
tlg
7187c58c5e Add llmux product requirements in StrictDoc format
42 requirements covering architecture, runtimes, models, VRAM
management, API, authentication, configuration, integration,
and four-phase testing plan.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-03 21:11:05 +02:00
tlg
bd0ed74d32 Clarify VRAM eviction rule for cross-priority edge case
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-03 13:20:53 +02:00
tlg
e6be9dcb85 Add llmux design specification
Covers architecture, model registry, VRAM management, API endpoints,
container setup, Open WebUI integration, Traefik routing, and
four-phase testing plan.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-03 13:15:46 +02:00