DesTEngSsv006_swd

SHA256

Files

tlg d7a091df8c feat: VRAM manager with priority-based model eviction

Tracks GPU VRAM usage (16GB) and handles model loading/unloading with
priority-based eviction: LLM (lowest) -> TTS -> ASR (highest, protected).
Uses asyncio Lock for concurrency safety.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-04 09:14:41 +02:00

config

feat: project scaffolding with config files and test fixtures

2026-04-04 07:23:14 +02:00

docs/superpowers

Add llmux implementation plan (30 tasks)

2026-04-03 22:43:37 +02:00

llmux

feat: VRAM manager with priority-based model eviction

2026-04-04 09:14:41 +02:00

tests

feat: VRAM manager with priority-based model eviction

2026-04-04 09:14:41 +02:00

requirements.txt

feat: project scaffolding with config files and test fixtures

2026-04-04 07:23:14 +02:00