Upload files to "/"

This commit is contained in:
2026-04-05 11:47:39 +00:00
parent 09997ffd64
commit dafaa490be
4 changed files with 48 additions and 30 deletions
+10 -8
View File
@@ -24,7 +24,7 @@ An intelligent prompt classification and routing pipeline for [Open WebUI](https
| reasoning (FI) | gpt-oss:120b | gpt-oss:20b | Analysis, comparison, strategy (Finnish) |
| reasoning (EN) | gpt-oss:120b | gpt-oss:20b | Analysis, comparison, strategy (English) |
| image generation | gpt-oss:120b + SDXL | gpt-oss:20b + SDXL | "generate an image", "luo kuva" |
| uncensored image | Juggernaut XL v9 | Juggernaut XL v9 | Prompt starts with `uncen` |
| uncensored image | dolphin-mistral:7b + Juggernaut XL v9 | dolphin-mistral:7b + Juggernaut XL v9 | Prompt starts with `uncen` |
| vision | llama3.2-vision:11b | llama3.2-vision:11b | User uploads an image |
| general | gpt-oss:120b | gpt-oss:20b | Everything else |
@@ -43,6 +43,7 @@ Two pipeline variants are provided:
ollama pull qwen2.5-coder:14b
ollama pull gpt-oss:120b # or gpt-oss:20b for the lighter variant
ollama pull llama3.2-vision:11b
ollama pull dolphin-mistral:7b # uncensored model for image prompt refinement
```
- **Brave Search API key** (free tier: https://brave.com/search/api/)
@@ -221,7 +222,7 @@ uncen a beautiful sunset over the ocean
uncen portrait of a warrior in golden armor
```
The `uncen` prefix is stripped and the user's text is sent directly to Stable Diffusion with quality tags appended — **no LLM refinement** (to avoid model refusal). The pipeline switches the SD checkpoint via the API automatically.
The `uncen` prefix is stripped and the prompt is refined by **dolphin-mistral:7b** (an uncensored LLM that won't refuse any content) instead of gpt-oss. The pipeline switches the SD checkpoint to Juggernaut XL v9 automatically. If dolphin-mistral is unavailable, it falls back to sending the user's text directly with quality tags appended.
### How it works
@@ -233,11 +234,12 @@ The `uncen` prefix is stripped and the user's text is sent directly to Stable Di
5. SD checkpoint is unloaded from VRAM and page cache is dropped
**Uncensored mode:**
1. `uncen` prefix is stripped, quality tags appended directly (no LLM call)
2. Ollama models are unloaded from VRAM
3. SD checkpoint is switched to Juggernaut XL v9
4. Image is generated, compressed PNG→JPEG, and streamed in 4KB chunks
5. SD checkpoint is unloaded from VRAM and page cache is dropped
1. `uncen` prefix is stripped
2. dolphin-mistral:7b refines the prompt into optimized SD tags (no refusal)
3. Ollama models are unloaded from VRAM
4. SD checkpoint is switched to Juggernaut XL v9
5. Image is generated, compressed PNG→JPEG, and streamed in 4KB chunks
6. SD checkpoint is unloaded from VRAM and page cache is dropped
## VRAM Management
@@ -258,7 +260,7 @@ sudo sh -c 'sync; echo 3 > /proc/sys/vm/drop_caches'
```
User Message
├─ "uncen" prefix? ─────────────── → Juggernaut XL v9 (direct, no search)
├─ "uncen" prefix? ─────────────── → dolphin-mistral:7b (refine) → Juggernaut XL v9
├─ Image uploaded? ──────────────── → llama3.2-vision:11b