Upload files to "/"

2026-04-05 11:47:39 +00:00
parent 09997ffd64
commit dafaa490be
4 changed files with 48 additions and 30 deletions
@@ -24,7 +24,7 @@ An intelligent prompt classification and routing pipeline for [Open WebUI](https
 | reasoning (FI) | gpt-oss:120b | gpt-oss:20b | Analysis, comparison, strategy (Finnish) |
 | reasoning (EN) | gpt-oss:120b | gpt-oss:20b | Analysis, comparison, strategy (English) |
 | image generation | gpt-oss:120b + SDXL | gpt-oss:20b + SDXL | "generate an image", "luo kuva" |
-| uncensored image | Juggernaut XL v9 | Juggernaut XL v9 | Prompt starts with `uncen` |
+| uncensored image | dolphin-mistral:7b + Juggernaut XL v9 | dolphin-mistral:7b + Juggernaut XL v9 | Prompt starts with `uncen` |
 | vision | llama3.2-vision:11b | llama3.2-vision:11b | User uploads an image |
 | general | gpt-oss:120b | gpt-oss:20b | Everything else |

@@ -43,6 +43,7 @@ Two pipeline variants are provided:
  ollama pull qwen2.5-coder:14b
  ollama pull gpt-oss:120b    # or gpt-oss:20b for the lighter variant
  ollama pull llama3.2-vision:11b
+  ollama pull dolphin-mistral:7b   # uncensored model for image prompt refinement
  ```
 - **Brave Search API key** (free tier: https://brave.com/search/api/)

@@ -221,7 +222,7 @@ uncen a beautiful sunset over the ocean
 uncen portrait of a warrior in golden armor
 ```

-The `uncen` prefix is stripped and the user's text is sent directly to Stable Diffusion with quality tags appended — **no LLM refinement** (to avoid model refusal). The pipeline switches the SD checkpoint via the API automatically.
+The `uncen` prefix is stripped and the prompt is refined by **dolphin-mistral:7b** (an uncensored LLM that won't refuse any content) instead of gpt-oss. The pipeline switches the SD checkpoint to Juggernaut XL v9 automatically. If dolphin-mistral is unavailable, it falls back to sending the user's text directly with quality tags appended.

 ### How it works

@@ -233,11 +234,12 @@ The `uncen` prefix is stripped and the user's text is sent directly to Stable Di
 5. SD checkpoint is unloaded from VRAM and page cache is dropped

 **Uncensored mode:**
-1. `uncen` prefix is stripped, quality tags appended directly (no LLM call)
-2. Ollama models are unloaded from VRAM
-3. SD checkpoint is switched to Juggernaut XL v9
-4. Image is generated, compressed PNG→JPEG, and streamed in 4KB chunks
-5. SD checkpoint is unloaded from VRAM and page cache is dropped
+1. `uncen` prefix is stripped
+2. dolphin-mistral:7b refines the prompt into optimized SD tags (no refusal)
+3. Ollama models are unloaded from VRAM
+4. SD checkpoint is switched to Juggernaut XL v9
+5. Image is generated, compressed PNG→JPEG, and streamed in 4KB chunks
+6. SD checkpoint is unloaded from VRAM and page cache is dropped

 ## VRAM Management

@@ -258,7 +260,7 @@ sudo sh -c 'sync; echo 3 > /proc/sys/vm/drop_caches'
 ```
 User Message
    │
-    ├─ "uncen" prefix? ─────────────── → Juggernaut XL v9 (direct, no search)
+    ├─ "uncen" prefix? ─────────────── → dolphin-mistral:7b (refine) → Juggernaut XL v9
    │
    ├─ Image uploaded? ──────────────── → llama3.2-vision:11b
    │