Trim README quick start

2026-06-01 15:07:41 +09:00
parent 30594b4e0b
commit ead7c01822
1 changed files with 79 additions and 185 deletions
--- a/README.md
+++ b/README.md
@@ -22,7 +22,10 @@ A self-hosted AI workspace -- meant to be the self-hosted version of the UI expe
  - **Extras** -- more to explore, happy if you give it a go!<br>　<sub>image editor · theme editor · file uploads (vision + PDF) · web search · presets · sessions · 2FA</sub>
 ## Demo
-A full, hover-to-play tour lives on the landing page (`docs/index.html`). A few looks:
+A full, hover-to-play tour lives on the landing page (`docs/index.html`).
 <details>
 <summary>Screenshots / clips</summary>
 ### Chat & Agents
 ![Chat & Agents](docs/chat.gif)
@@ -35,194 +38,119 @@ A full, hover-to-play tour lives on the landing page (`docs/index.html`). A few
 ### Notes & Tasks
 ![Notes & Tasks](docs/notes.gif)
 </details>
 ## Quick Start
-Defaults work out of the box — clone, run, configure inside the app.
+Defaults work out of the box: clone, run, then configure models/search/email
-Open the **Settings** panel after first login to point Odysseus at your LLM
+inside **Settings**. Only edit `.env` for deployment-level overrides like
-server, search provider, email account, etc. Only touch `.env` if you need
+`APP_PORT`, `AUTH_ENABLED`, `DATABASE_URL`, or a pre-seeded admin password.
 to override deployment-level things like `AUTH_ENABLED`, `DATABASE_URL`,
 or pre-seed `ODYSSEUS_ADMIN_PASSWORD` (otherwise an initial password is
 generated and printed on first boot).
 Contributing? See [CONTRIBUTING.md](CONTRIBUTING.md) for setup, testing, and
 pull request guidelines.
-### Option 1: Docker (recommended)
+### Docker (recommended)
 ```bash
 git clone https://github.com/pewdiepie-archdaemon/odysseus.git
 cd odysseus
 cp .env.example .env       # optional, but recommended for explicit defaults
 docker compose up -d --build
 ```
-Compose starts Odysseus, ChromaDB, SearXNG, and ntfy. First run does a full
+Open `http://localhost:7000` when the containers are healthy. If the port is
-image build. Open `http://localhost:7000` after the containers are healthy.
+taken, set `APP_PORT=7001` in `.env` and recreate the container.
 If port `7000` is already taken, set `APP_PORT=7001` (or another free port)
 in `.env`, recreate the container, and open `http://localhost:7001`.
-> **On Apple Silicon, Docker can't use the Metal GPU** (it runs a Linux VM), so
+### Native Linux / macOS
 > Cookbook will serve models on the CPU only. For GPU-accelerated Cookbook,
 > run the app natively — see [Apple Silicon](#apple-silicon-m-series).
 Cookbook remote servers use an Odysseus-owned SSH key from `./data/ssh`
 inside Docker. In **Cookbook -> Settings -> Servers**, generate/copy the
 public key and add it to the remote server's `~/.ssh/authorized_keys`.
 After generating the key, you can also install it from the host with:
 ```bash
 ssh-copy-id -i data/ssh/id_ed25519.pub user@server
 ```
 Cookbook local downloads are stored in `./data/huggingface`, mounted as
 `~/.cache/huggingface` inside the Odysseus container. Cookbook-installed
 serve engines and Python CLIs are stored in `./data/local`, mounted as
 `~/.local`, so vLLM/llama.cpp installs survive container recreation.
 After downloading a model, open **Cookbook -> Serve**, pick the cached model,
 and launch it. When the server answers `/v1/models`, Odysseus adds it to the
 chat model picker automatically. For NVIDIA / AMD GPUs in Docker, install
 the host runtime (NVIDIA Container Toolkit or ROCm drivers) and enable the
 matching overlay via `COMPOSE_FILE` in `.env`:
 ```bash
 # NVIDIA
 COMPOSE_FILE=docker-compose.yml:docker/gpu.nvidia.yml
 # AMD ROCm
 COMPOSE_FILE=docker-compose.yml:docker/gpu.amd.yml
 ```
 Verify with `docker compose exec odysseus nvidia-smi -L` (or `rocm-smi`).
 The default Docker image is intentionally slim. For Python-based serve engines,
 use **Cookbook -> Dependencies** to install vLLM, SGLang, llama-cpp-python, or
 diffusers into the persisted `./data/local` mount. Native CUDA builds inside the
 container also require CUDA toolkit binaries such as `nvcc`; if those are not
 installed in the container, use prebuilt Python wheels or serve from a remote
 GPU host that already has the toolkit.
 Useful checks:
 ```bash
 docker compose ps
 docker compose logs --tail=120 odysseus
 docker compose logs odysseus | grep -E 'ChromaDB|MemoryVectorStore|DEGRADED'
 docker compose exec odysseus python -c "from services.hwfit.models import get_models; print(len(get_models()))"
 ```
 Expected vector-memory startup lines in Docker:
 ```text
 ChromaDB connected: chromadb:8000
 MemoryVectorStore initialized
 ```
 The Cookbook model catalog check should print a non-zero count. If it prints
 `0`, rebuild the Odysseus image with `docker compose build --no-cache odysseus`.
 ### Option 2: Manual install — Linux / macOS
 **Requirements:** Python 3.11+. Cookbook also requires `tmux` for background
 model downloads and serves.
 > **On macOS (Apple Silicon)?** Skip the manual steps below — run
 > `./start-macos.sh` for a one-command setup. See
 > [Apple Silicon](#apple-silicon-m-series).
 Install system packages first:
 ```bash
 # Debian/Ubuntu
 sudo apt install tmux
 # Arch
 sudo pacman -S tmux
 # Fedora
 sudo dnf install tmux
 # macOS (Homebrew). macOS ships no recent Python by default — install 3.11+
 # (skip the python line if you already have Python 3.11 or newer):
 brew install python@3.11 tmux
 ```
 Then install Odysseus:
 ```bash
 git clone https://github.com/pewdiepie-archdaemon/odysseus.git
 cd odysseus
-python3 -m venv venv          # on macOS use: python3.11 -m venv venv
+python3 -m venv venv
 source venv/bin/activate
 pip install -r requirements.txt
-python setup.py            # creates data dirs and prints an initial admin password
+python setup.py
 python -m uvicorn app:app --host 0.0.0.0 --port 7000
 ```
 Requirements: Python 3.11+. Cookbook also needs `tmux` for background model
 downloads and serves.
-#### Apple Silicon (M-series)
+### Apple Silicon
 Docker on macOS cannot use the Metal GPU. For GPU-accelerated Cookbook on an
 M-series Mac, run Odysseus natively:
 > **On a Mac, run Odysseus natively (not in Docker) so Cookbook can use the
 > Metal GPU.** Cookbook serves models on whatever machine Odysseus runs on, and
 > Docker on macOS is a Linux VM with **no access to the GPU** — in a container
 > your Mac looks like a CPU-only Linux box.
 **Quick start — one command.** From a fresh clone:
 ```bash
 git clone https://github.com/pewdiepie-archdaemon/odysseus.git
 cd odysseus
 ./start-macos.sh
 ```
 That installs what's needed via Homebrew (Python 3.11+, `tmux`, and a prebuilt
 Metal `llama-server`), sets everything up, and launches Odysseus at
 **http://127.0.0.1:7860**. Log in with the admin password it prints, open
 **Cookbook**, and it detects your GPU (`backend: metal`) and recommends GGUF
 models that fit your Mac. (MLX models aren't supported on macOS and are hidden —
 see below.) Re-run `./start-macos.sh` any time to start it again (use another
 port with `ODYSSEUS_PORT=7900 ./start-macos.sh`).
-**Prefer a clickable app?** After your first `./start-macos.sh`, build a
+It launches at `http://127.0.0.1:7860`. To build a clickable app wrapper:
-launcher `Odysseus.app` (+ a drag-to-Applications `.dmg`) that starts the server
+
 and opens the UI in its own window:
 ```bash
-./build-macos-app.sh          # → dist/Odysseus.app and dist/Odysseus.dmg
+./build-macos-app.sh
 ```
 <details>
-<summary>What <code>start-macos.sh</code> does, serving engines, and manual steps</summary>
+<summary>Cookbook, GPU, Ollama, and troubleshooting notes</summary>
-`start-macos.sh` is just the manual steps wrapped up: Homebrew deps → a Python
+**Docker bundled services.** Compose starts Odysseus, ChromaDB, SearXNG, and
-`venv` → `pip install -r requirements.txt` → `python setup.py` → `uvicorn` on a
+ntfy. ChromaDB/SearXNG/ntfy bind host ports to `127.0.0.1` by default, so they
-non-AirPlay port. Run them by hand if you prefer (the Linux steps above, but use
+are reachable from the host but not exposed to your LAN/public internet unless
-`python3.11 -m venv` and `--port 7860`).
+you opt in.
-**Serving engines on Metal** — Cookbook only recommends models it can serve here:
+**Cookbook storage in Docker.** Downloads live in `./data/huggingface`
- **llama.cpp** — `brew install llama.cpp` (done by `start-macos.sh`) provides a
+(`~/.cache/huggingface` in the container). Cookbook-installed Python CLIs and
-  prebuilt Metal `llama-server`, no compile. Without it, Cookbook builds it from
+serve engines live in `./data/local` (`~/.local` in the container), so they
-  source on first serve, which needs `cmake` + Xcode Command Line Tools
+survive container recreation.
  (`brew install cmake && xcode-select --install`).
 - **Ollama** — `brew install ollama` is another simple Metal-accelerated option.
 - vLLM/SGLang are CUDA/ROCm-only and do **not** run on macOS.
-**MLX models are not supported on Apple Silicon.** Odysseus serves models via
+**Remote servers.** In **Cookbook -> Settings -> Servers**, generate the
-llama.cpp/Ollama (GGUF) and vLLM/SGLang (CUDA) — it has no MLX (`mlx_lm`)
+Odysseus SSH key and add the public key to the remote server's
-runtime. So MLX-only models can't be served on a Mac and are deliberately
+`~/.ssh/authorized_keys`. From the host you can also run:
 **hidden** from Cookbook's recommendations there; pick a GGUF build instead.
-**Port 7000 & AirPlay** — macOS AirPlay Receiver holds ports 7000/5000, so
+```bash
-`start-macos.sh` defaults to **7860**. To use 7000, turn AirPlay Receiver off in
+ssh-copy-id -i data/ssh/id_ed25519.pub user@server
 System Settings → General → AirDrop & Handoff.
 **Build prerequisites baked in** — the `.app` wraps this repo's `venv` (it
 doesn't bundle Python), so the path is fixed at build time — rebuild if you move
 the repo.
 </details>
 ### Option 3: Manual install — Windows (PowerShell)
 Windows support is not actively tested. Use it with caution; Docker on Linux
 or a Linux/macOS manual install is the safer path for now.
 ```powershell
 git clone https://github.com/pewdiepie-archdaemon/odysseus.git
 cd odysseus
 python -m venv venv
 venv\Scripts\Activate.ps1
 pip install -r requirements.txt
 python setup.py
 python -m uvicorn app:app --host 0.0.0.0 --port 7000
 ```
-Open `http://localhost:7000`, log in with the generated admin password,
+**NVIDIA / AMD Docker GPU overlays.** Install the host runtime first, then add
-and configure everything else inside **Settings**.
+one of these to `.env`:
 ```bash
 COMPOSE_FILE=docker-compose.yml:docker/gpu.nvidia.yml
 COMPOSE_FILE=docker-compose.yml:docker/gpu.amd.yml
 ```
 Verify with:
 ```bash
 docker compose exec odysseus nvidia-smi -L
 docker compose exec odysseus rocm-smi
 ```
 **Ollama with Docker.** If Ollama runs on the host, add this endpoint in
 Settings:
 ```text
 http://host.docker.internal:11434/v1
 ```
 Ollama must listen outside its own loopback interface:
 ```bash
 OLLAMA_HOST=0.0.0.0:11434 ollama serve
 ```
 **Useful checks.**
 ```bash
 docker compose ps
 docker compose logs --tail=120 odysseus
 docker compose logs odysseus | grep -E 'ChromaDB|MemoryVectorStore|DEGRADED'
 ```
 **macOS details.** `start-macos.sh` installs Homebrew deps, creates the venv,
 runs setup, and starts uvicorn on port `7860` because AirPlay often holds
 `7000`. It uses llama.cpp/Ollama for Metal. vLLM/SGLang are CUDA/ROCm-only and
 do not run on macOS. MLX-only models are not served by Odysseus.
 </details>
 ## Security Notes
 Odysseus is a self-hosted workspace with powerful local tools: shell access, file uploads, model downloads, web research, email/calendar integrations, and API tokens. Treat it like an admin console.
@@ -274,21 +202,6 @@ Key settings:
 | `CHROMADB_PORT` | `8100` | ChromaDB port for manual host runs. Docker overrides this to `8000`. |
 | `EMBEDDING_URL` | -- | OpenAI-compatible embeddings endpoint |
 ### Bundled services
 Docker Compose includes these by default. The bundled service ports bind to `127.0.0.1` unless you opt in to a different bind address in `.env`, so they are reachable from the host machine but not from your LAN or the public internet by default:
  - **ChromaDB** → vector store for semantic memory. In Docker, Odysseus connects to `chromadb:8000`; from the host it is exposed as `${CHROMADB_BIND:-127.0.0.1}:8100`.
  - **SearXNG** → meta search for web search. In Docker, Odysseus connects to `searxng:8080`; from the host it is exposed as `127.0.0.1:8080`.
  - **ntfy** → local notification service, exposed as `${NTFY_BIND:-127.0.0.1}:8091`.
 **Phone push notifications via ntfy:** A phone cannot subscribe to `127.0.0.1` on your server. To expose ntfy safely without opening it on every interface:
  - **Tailscale (recommended)** — set `NTFY_BIND=<tailscale-host-ip>` and `NTFY_BASE_URL=http://<tailscale-host-ip>:8091` in `.env`, recreate ntfy, then point the ntfy Android/iOS app at `http://<tailscale-host-ip>:8091/<your-topic>`.
  - **Enable ntfy auth and bind to LAN** — add `NTFY_AUTH_FILE` + `NTFY_AUTH_DEFAULT_ACCESS=deny-all` to the `ntfy` service, create a user with `docker compose exec ntfy ntfy user add ...`, then set `NTFY_BIND` to your LAN IP. See the [ntfy docs](https://docs.ntfy.sh/config/#access-control).
 ### Optional external services
  - **Ollama** → local LLM server -- [ollama.ai](https://ollama.ai)
 ### Built-in MCP servers (optional setup)
 Odysseus auto-registers a few built-in MCP servers at startup. The npx-based ones (currently the browser server, `@playwright/mcp`) only start when their npm package is already in the local npx cache. If a package isn't cached, that server is skipped with a startup log message explaining what to do, so a fresh install does not block on a multi-minute npm download or hang if Playwright system deps are missing.
@@ -301,25 +214,6 @@ npx -y @playwright/mcp@latest --version
 That installs `@playwright/mcp` plus Playwright (~300MB total). Restart Odysseus and the server will register at startup.
 ### Ollama with Docker
 If Odysseus is running in Docker and Ollama is running on the host, add the endpoint in Settings as:
 `http://host.docker.internal:11434/v1`
 The default Compose file already maps `host.docker.internal` on Linux. Ollama also needs to listen outside its own loopback interface:
 ```bash
 OLLAMA_HOST=0.0.0.0:11434 ollama serve
 ```
 For a systemd Ollama install, set that in the Ollama service override. If Odysseus can see Ollama but requests hang or fail, check that your host firewall allows Docker bridge traffic to port `11434`.
 First-token latency is usually Ollama/model/hardware, not Odysseus. To compare, test Ollama directly:
 ```bash
 curl http://127.0.0.1:11434/v1/models
 ```
 ## Architecture
 ```
 app.py                   # FastAPI entry point