Upload files to "/"
This commit is contained in:
@@ -24,7 +24,7 @@ An intelligent prompt classification and routing pipeline for [Open WebUI](https
|
||||
| reasoning (FI) | gpt-oss:120b | gpt-oss:20b | Analysis, comparison, strategy (Finnish) |
|
||||
| reasoning (EN) | gpt-oss:120b | gpt-oss:20b | Analysis, comparison, strategy (English) |
|
||||
| image generation | gpt-oss:120b + SDXL | gpt-oss:20b + SDXL | "generate an image", "luo kuva" |
|
||||
| uncensored image | Juggernaut XL v9 | Juggernaut XL v9 | Prompt starts with `uncen` |
|
||||
| uncensored image | dolphin-mistral:7b + Juggernaut XL v9 | dolphin-mistral:7b + Juggernaut XL v9 | Prompt starts with `uncen` |
|
||||
| vision | llama3.2-vision:11b | llama3.2-vision:11b | User uploads an image |
|
||||
| general | gpt-oss:120b | gpt-oss:20b | Everything else |
|
||||
|
||||
@@ -43,6 +43,7 @@ Two pipeline variants are provided:
|
||||
ollama pull qwen2.5-coder:14b
|
||||
ollama pull gpt-oss:120b # or gpt-oss:20b for the lighter variant
|
||||
ollama pull llama3.2-vision:11b
|
||||
ollama pull dolphin-mistral:7b # uncensored model for image prompt refinement
|
||||
```
|
||||
- **Brave Search API key** (free tier: https://brave.com/search/api/)
|
||||
|
||||
@@ -221,7 +222,7 @@ uncen a beautiful sunset over the ocean
|
||||
uncen portrait of a warrior in golden armor
|
||||
```
|
||||
|
||||
The `uncen` prefix is stripped and the user's text is sent directly to Stable Diffusion with quality tags appended — **no LLM refinement** (to avoid model refusal). The pipeline switches the SD checkpoint via the API automatically.
|
||||
The `uncen` prefix is stripped and the prompt is refined by **dolphin-mistral:7b** (an uncensored LLM that won't refuse any content) instead of gpt-oss. The pipeline switches the SD checkpoint to Juggernaut XL v9 automatically. If dolphin-mistral is unavailable, it falls back to sending the user's text directly with quality tags appended.
|
||||
|
||||
### How it works
|
||||
|
||||
@@ -233,11 +234,12 @@ The `uncen` prefix is stripped and the user's text is sent directly to Stable Di
|
||||
5. SD checkpoint is unloaded from VRAM and page cache is dropped
|
||||
|
||||
**Uncensored mode:**
|
||||
1. `uncen` prefix is stripped, quality tags appended directly (no LLM call)
|
||||
2. Ollama models are unloaded from VRAM
|
||||
3. SD checkpoint is switched to Juggernaut XL v9
|
||||
4. Image is generated, compressed PNG→JPEG, and streamed in 4KB chunks
|
||||
5. SD checkpoint is unloaded from VRAM and page cache is dropped
|
||||
1. `uncen` prefix is stripped
|
||||
2. dolphin-mistral:7b refines the prompt into optimized SD tags (no refusal)
|
||||
3. Ollama models are unloaded from VRAM
|
||||
4. SD checkpoint is switched to Juggernaut XL v9
|
||||
5. Image is generated, compressed PNG→JPEG, and streamed in 4KB chunks
|
||||
6. SD checkpoint is unloaded from VRAM and page cache is dropped
|
||||
|
||||
## VRAM Management
|
||||
|
||||
@@ -258,7 +260,7 @@ sudo sh -c 'sync; echo 3 > /proc/sys/vm/drop_caches'
|
||||
```
|
||||
User Message
|
||||
│
|
||||
├─ "uncen" prefix? ─────────────── → Juggernaut XL v9 (direct, no search)
|
||||
├─ "uncen" prefix? ─────────────── → dolphin-mistral:7b (refine) → Juggernaut XL v9
|
||||
│
|
||||
├─ Image uploaded? ──────────────── → llama3.2-vision:11b
|
||||
│
|
||||
|
||||
Reference in New Issue
Block a user