odysseus

Author	SHA1	Message	Date
Wes Huber	ccc0b9ab0c	Setup: prompt for first-run admin credentials * feat(setup): prompt for admin credentials interactively on first run When setup.py runs in a terminal (TTY) without env vars set, it now asks the user to choose a username and password instead of generating a random one that scrolls off-screen. Includes confirmation prompt to catch typos. Existing behavior is preserved: - ODYSSEUS_ADMIN_USER + ODYSSEUS_ADMIN_PASSWORD env vars take priority - Non-interactive contexts (Docker, CI) still get a random password - ODYSSEUS_SKIP_ADMIN_PROMPT=1 opts out of the interactive prompt - Re-runs still skip if auth.json already exists Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(macos): use venv Python for pip install and uvicorn launch On PEP 668 systems (newer Homebrew Python), pip install outside a venv is rejected. The script creates a venv but then called the system $PY for pip and uvicorn. Switch to ./venv/bin/python for both. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Revert "fix(macos): use venv Python for pip install and uvicorn launch" This reverts commit 7a1be956659d86183da2edcde2114eb363efd3e4. --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 13:14:37 +09:00
danielxb	5268a546bc	Model picker: group models by provider Rebased on current main. Integrates with the new Recent/Favorites system — provider groups appear below Recent and Favorites in browse mode for large catalogs (>12 models). Changes: - Models grouped by canonical provider with collapsible sections - Chevron animation consistent with sidebar sections - Domino cascade on expand (only on just-opened group) - Provider display names (deepseek-ai -> DeepSeek, meta -> Llama, etc.) - Alias merging (meta + meta-llama -> one Llama group) - Search includes provider display names for filtering - Collapsed state persists in localStorage - No screenshot binary committed Co-authored-by: danielxb <5981902+danielxb@users.noreply.github.com>	2026-06-02 13:14:22 +09:00
spooky	cd4f496cb4	Fix native Cookbook quant classification	2026-06-02 13:07:20 +09:00
MohammadYusif	65b5d65059	fix(agent): extract web search sources from output key tool_execution.py returns web search results as {"output": ..., "exit_code": 0}. The sources-extraction block in stream_agent_loop only checked result.get("results") and result.get("stdout"), so _src_text was always "" for every tool-call-mode web search. Two consequences: 1. The SOURCES marker was never parsed and the web_sources SSE event was never emitted -- the sources panel never appeared after agent-mode searches. 2. The marker (a large JSON blob) was left in result["output"] and forwarded verbatim to the LLM in round 2 via format_tool_result, confusing some local models into producing no tokens. Fix: prepend result.get("output") to the lookup chain, and update the cleanup assignment so result["output"] is overwritten with the stripped text. Adds six regression tests in tests/test_agent_loop.py documenting the before/after behaviour and verifying backward compat with the legacy results/stdout paths. Co-authored-by: MohammadYusif <MohammadYusif@users.noreply.github.com>	2026-06-02 13:06:09 +09:00
Stephen Yue	d46c406bd8	Fix Cookbook fit column sorting The Fit column shared the Score column's sort key, so clicking the Fit header sorted by Score instead of by hardware fit. There was also no fit option in the hidden sort <select> and no fit branch in the client-side comparator. - Give the Fit column its own sort key (fit). - Add a fit option to the sort select (kept Score as the default so first-load ordering is unchanged). - Sort by the categorical fit_level rank (perfect > good > marginal > too_tight), tie-broken by score, honoring the ascending/descending toggle. Fixes #842 Co-authored-by: SabixMaru <285860855+SabixMaru@users.noreply.github.com>	2026-06-02 13:05:53 +09:00
Alexandre Teixeira	e129378014	Clarify private deployment hardening docs Document safer defaults and deployment guidance for network-accessible Odysseus installs. The guidance emphasizes keeping auth enabled, disabling localhost bypass outside development, using secure cookies for HTTPS/reverse-proxy deployments, and exposing only the authenticated Odysseus entrypoint through a trusted proxy or private access layer. Also clarify that bundled services, databases, vector stores, notification services, and raw model/provider APIs should remain internal-only. This is documentation and config-example only. It does not change runtime behavior.	2026-06-02 13:01:12 +09:00
Juan Pablo Jiménez	eda99360d1	Fix Cookbook dependency install completion state * Fix Cookbook dependency install completion state Mark Cookbook dependency installs as complete when the background runner exits successfully, even when HuggingFace-specific download markers are absent. * Add focused regression coverage for cookbook dependency completion. Keep the fix narrowly scoped while carrying env_path through dependency tasks and locking the completion reconciliation behavior with targeted tests.	2026-06-02 12:59:29 +09:00
Tatlatat	acfdcf346c	fix(agent): map native google_search and surface empty rounds Models (notably Gemini) emit a native 'google_search' function call, but the agent loop had no mapping for it, so the call failed to convert, the round produced 0 chars and 0 tool blocks, and generation died silently — the web client hung on 'waiting for first token' with no error (also #443). - Map google_search / google_search_retrieval / google_search_grounding to the web_search tool, and read Gemini's 'queries' array (falling back to 'query'). - In stream_agent_loop, when a round yields no response text and no tool events, emit a visible fallback message instead of leaving the user hanging. - Give the unknown-tool execution branch an explicit exit_code=1 so the failure is logged as an error rather than 'n/a'. Unknown/unconvertible tool names still return None (unchanged) so they are dropped safely rather than executed. Added tests covering the google_search mapping, the queries array, and unknown/invalid-JSON returning None.	2026-06-02 12:57:45 +09:00
Alexandre Teixeira	5607db85d4	tests: cover companion models route filtering	2026-06-02 12:57:32 +09:00
Boody	97528be0f4	Add custom web search result count * fixed confusing credentials prompt * fix(setup): return status from create_default_admin function * fix(setup): initialize admin creation status in main function * fix(setup): enhance admin creation feedback and status handling * Enhance admin user login messages with conditional feedback based on creation status * Refine admin user creation feedback messages for clarity and actionability and formatted code * Add fallback error message for admin creation failure in setup script * Add run script for Uvicorn with dotenv integration * Refactor server runner to use argparse for host and port configuration * Remove captured output print statement from server runner * Fix server runner to ensure cross-platform compatibility and improve log handling * Remove run.py script to match main repo * feat: add custom option for search result count in settings * fix: enforce minimum and maximum values for custom search result count	2026-06-02 12:55:15 +09:00
Sheikh Rahat Mahmud	e2ba068cbc	Add provider endpoint resolver tests The existing test_endpoint_resolver.py copies the pure functions to avoid import side effects, so its assertions can silently drift from the shipped src/endpoint_resolver.py (the copies already lag: no OpenRouter headers, no anthropic.com host matching). This adds a sibling module that imports the REAL resolver and locks in behavior for every provider named in ROADMAP.md's "Provider setup/probing audit" — Anthropic, Gemini, Groq, xAI, OpenRouter, OpenAI, DeepSeek — plus Ollama (local + cloud) and the Tailscale self-host fallback in resolve_url. Covers build_chat_url, build_models_url, build_headers, normalize_base, _first_chat_model, _anthropic_api_root, _ollama_api_root, and resolve_url. conftest.py already stubs the heavy deps, so the import is side-effect free. Test-only; no behavior change. 55 new tests, all passing. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 12:53:50 +09:00
ooovenenoso	1a7b90623c	Prefer Python 3.11+ in Windows launcher	2026-06-02 12:50:58 +09:00
spooky	0f3280ee05	Expose advanced llama.cpp serve controls	2026-06-02 12:46:16 +09:00
Mahdi Salmanzade	05fb48e9d5	Add admin-only companion pairing Split 3/4 of the companion bridge (#863, #871 landed 1/4 and 2/4). Adds admin-only device pairing to the companion router. - GET /api/companion/pair -- renders a form; never mints (a GET must not mint a credential: SameSite=Lax session cookies ride top-level GET navigations, so GET-minting would be CSRF-triggerable via a link/<img>) - POST /api/companion/pair -- mints a one-time chat-scoped token. Admin-cookie only; CSRF-safe because a SameSite=Lax cookie is not sent on a cross-site POST, the same protection POST /api/tokens relies on. ?format=json returns the pairing payload for an in-app screen. Minting invalidates the auth middleware's token cache so the code works on the next request with no restart. companion/pairing.py holds the mint/LAN/QR helpers; the token is shown once and stored only as a bcrypt hash + prefix (mirrors routes/api_token_routes.py). Tests (tests/test_companion_pairing.py): - a bearer/'api' caller and a non-admin user are rejected by require_admin (403); an admin passes - the token is returned once and persisted only as a hash - minting invalidates the cache (works without restart) - minting is exposed on POST, never GET (CSRF)	2026-06-02 12:43:50 +09:00
Zeus-Deus	19a4f823a4	Rename Character copy to Persona Issue #234: the "Character" tab and its "Style of response" label made it unclear that this is where a system prompt is set. Rename the user-facing labels for clarity: - "Character" tab + section heading -> "Persona" - "Style of response" -> "System prompt" - supporting strings: select placeholder, name placeholder, button/title text, toasts, confirm/notice text, the chat-bar indicator tooltip, the settings visibility toggle, and the assistant personality picker ("Characters" optgroup -> "Personas"). Used "Persona" rather than the issue's suggested "Preset" because the app already has a distinct, user-facing "Presets" concept (built-in presets like Code Analyze/Brainstorm/Reason, shown as their own group in the assistant picker). "Persona" matches what this tab actually creates -- a named persona with its own memories -- without colliding with that term. Internal identifiers (element IDs, data-chartab attributes, function names) and the character_name backend field are intentionally left unchanged so existing saved presets and JS wiring keep working.	2026-06-02 12:42:15 +09:00
Collin	c90a7a19a5	Add dialog accessibility semantics Screen readers got no signal that a dialog opened — not one modal carried role="dialog" — and several close buttons had no accessible name. - The 6 static tool windows (Brain, Theme, Prompt, Rename session, Cookbook, Settings) now carry role="dialog" + an accessible name. They are dockable, tiling windows, so they are non-modal dialogs (intentionally no aria-modal). - The four unlabelled close buttons (theme, prompt, cookbook, settings) get an aria-label so they no longer read as just "heavy multiplication x". - styledConfirm / styledPrompt ARE blocking modals: they get role="dialog" + aria-modal="true" + aria-labelledby/aria-describedby, and now manage focus — restore focus to the triggering element on close and trap Tab within the dialog (they already moved focus in on open). tests/test_dialog_aria.py pins the roles, labels, and focus management.	2026-06-02 12:41:25 +09:00
ghreprimand	77611f0491	Scope memory consolidation by owner group Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-02 12:40:28 +09:00
Mihail Filippov	3d109cbaca	Add explicit open-signup state endpoint * Refactor open registration state switching * Rename endpoint to open-signup	2026-06-02 12:35:54 +09:00
Leo	6fca7e86b7	Cookbook serve profiles and engine filter * Cookbook: Engine filter + intelligent hardware-computed serve profiles Two related Cookbook serving improvements for accurate, hardware-aware model serving (especially on consumer GPUs that can only run GGUF/llama.cpp). Engine filter - New "Engine" dropdown (All / llama.cpp / vLLM / SGLang) beside the quant picker. Pure client-side view filter over the fetched list via the same _detectBackend() the serve commands use, so what you filter to is exactly what would launch. Re-renders from cache (no refetch). Empty-state message + the instant-cache-paint path account for it too. Intelligent serve profiles (Quality / Balanced / Speed) - services/hwfit/profiles.py: compute_serve_profiles() turns detected VRAM + model size into concrete llama.cpp flags (n_gpu_layers, n_cpu_moe, cache-type, context). Encodes the by-hand tuning: a too-big MoE offloads experts to CPU instead of failing; a model that fits stays fully on GPU; quant tracks profile intent; vision models keep image-encoder headroom. Reuses models.py VRAM math so filtering and serving agree on what fits. Pure/deterministic (no t/s claims — partial-offload speed isn't reliably predictable; fit is what's computed). - /api/hwfit/profiles endpoint returns the profiles + the model's trained context limit, with loose name matching (strips org/ prefix, -GGUF suffix, quant tag) so a local GGUF folder name resolves to its catalog entry. - _buildServeCmd (llama.cpp) now emits --n-cpu-moe / --flash-attn / --cache-type-k/v when set, with llama-cpp-python fallback equivalents. It previously only set -ngl/-c, which is why it OOM'd or ran slow. - Serve panel: profile chips that fill the fields on click, plus CPU-MoE / KV Cache / Flash Attn fields. Context is clamped to the model's trained limit (and an absolute 1M sanity ceiling) on type/blur/profile-load and at launch — fixes a crash where a stale 256k/16M preset + quantized KV cache caused an amdgpu ErrorDeviceLost. Tests: tests/test_serve_profiles.py (7) — offload vs full-GPU fit, never exceed VRAM, context cap, launchable flags, vision headroom, no-GPU empty. Checks: py_compile + node --check pass; pytest test_serve_profiles + test_hwfit_amd green; verified live on an RDNA4 box (gfx1200) — Balanced lands ~ncm18 q4 128k, matching hand-tuning. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * Cookbook: make column-header sorting discoverable (incl. Newest) Sorting in Cookbook is via clickable column headers (pewds' design), but the headers had no visual cue that they're interactive — so sorting in general, and the Newest sort on the Model header specifically, was undiscoverable. - Style sortable headers as interactive: pointer cursor, hover underline, and the active sort column bolded/highlighted. There was no CSS for .hwfit-sortable / .hwfit-sort-active at all; this helps every existing sort, not just Newest. - The Model column header sorts by release_date (newest first), reusing the existing header-click sort wiring and the "newest" SORT_KEY. No new sort control — uses the existing column-header paradigm. Checks: node --check passes. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * Cookbook serve profiles: keep the on-disk file's quant fixed (don't propose Q6/Q2) In the Serve tab the model is a specific GGUF file already on disk, so its quant can't change — but the profiles were suggesting "Quality · Q6_K" / "Speed · Q2_K" as if you could re-quantize it. That's meaningless when serving a fixed file. - compute_serve_profiles gains serve_weights_gb / serve_quant. When set (SERVE mode), the quant is locked to the file's and profiles differ only in the real serving knobs — n_cpu_moe, KV-cache type, context. _weights_gb / _cpu_moe_for_budget use the file's actual size instead of a quant-derived estimate. DOWNLOAD mode (no override) still varies the quant to show download options. - /api/hwfit/profiles accepts serve_weights_gb & serve_quant. - The Serve panel parses the file's size (from m.size "20.6 GB") and quant (from the repo/file name) and passes them, so profiles match what's actually served. Result for a 20.6 GB Q4_K_M file: all three profiles stay Q4_K_M and differ by KV/ctx/offload (Quality q8 KV 128k ncm21, Balanced q4 128k ncm17, Speed q4 32k ncm15) — no nonsensical quant changes. Tests: test_serve_mode_keeps_fixed_quant. Full serve-profile suite green (9). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * Cookbook serve: Vision toggle (auto-find mmproj) + live VRAM/RAM-spillover monitor Two serve-panel additions: 1. Vision toggle. A "Vision" checkbox that serves the model with its multimodal projector so it can read images. The mmproj path is resolved at runtime (find mmproj-.gguf next to the model), so dropping an mmproj file in the model folder makes the toggle just work; `--mmproj … --image-max-tokens 1024` (native) / `--clip_model_path` (llama-cpp-python) only when on + found. 2. Live GPU-memory monitor.* A readout that polls /api/cookbook/gpus every 4s while the panel is open and shows VRAM used/total/%, free, and — crucially on a discrete card — RAM spillover (AMD gtt_used_mb), with a plain-language health hint: green/healthy, amber/tight, red/"spilled to RAM — slow (raise CPU MoE or lower context)". Surfaces gtt_used_mb from the gpus endpoint (previously read for total only and discarded for 'used'). Lets you see at a glance whether a config fits VRAM (fast) or is paging to system RAM over PCIe (slow) instead of guessing. Checks: node --check + py_compile pass. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-02 12:34:42 +09:00
spooky	8b3c0d8ad4	feat: select cached gguf artifacts for serve (#891 )	2026-06-02 12:32:40 +09:00
Alexandre Teixeira	8455b88643	Improve Docker GPU setup diagnostics (#705 ) * Improve Docker GPU setup diagnostics Add a Docker GPU preflight script for NVIDIA users. The script is read-only by default, checks host NVIDIA drivers, Docker availability, and container GPU passthrough, and prints actionable next steps. Add explicit opt-in modes to print install commands, install NVIDIA Container Toolkit on Ubuntu/Debian, and enable the NVIDIA Compose overlay in .env after passthrough is verified. Document common NVIDIA Docker failure modes, ignore generated .env backups, and clarify that Cookbook can only detect GPUs exposed to the Odysseus container. * Clarify Docker GPU diagnostic limits	2026-06-02 12:30:40 +09:00
Sirsyorrz	517aa593e0	Cookbook: clearer tooltips on saved-config badge and GPU chip (#850 ) Two small polish items in the Cookbook Serve panel. Saved-config badge The little count badge next to the Save button ("3 ▾" etc.) had a generic "Saved launch configs" tooltip, so the number reads like a notification dot. Make it spell out what it is and what clicking does: "3 saved launch configs for <model> — click ▾ to load or delete" (and "No saved launch configs for <model> yet — click Save to add one" when empty). Tooltip stays in sync via _updateSavedToggleLabel so save/delete updates both the count and the hint. GPU chip on mixed-GPU boxes (#711) The chip label was `${gpuCount}x ${gpu_name}`, where gpu_name is just gpus[0].name — so a 4090 + 3060 reads as "2x RTX 4090". The backend already emits gpu_groups (identical cards grouped, used by the serve flow to pin CUDA_VISIBLE_DEVICES) and a per-card gpus[] array, so use them: - Label renders each homogeneous pool: "1× RTX 4090 + 1× RTX 3060". Homogeneous setups keep the existing "2× RTX 4090" form. - Tooltip lists each GPU with its index + VRAM, useful for picking the right device when launching. Refs #711.	2026-06-02 12:30:24 +09:00
Dustin	bd3204fe96	Diagnose vLLM device detection failure with actionable suggestion (#778 ) Adds a diagnosis pattern for the 'Failed to infer device type' error vLLM raises when no CUDA or ROCm GPU is found (e.g. systems with only integrated or Intel Xe graphics). The existing pattern only caught 'No CUDA GPUs are available' which fires later in startup; this new entry catches the earlier device-probe failure and the NVML/amdsmi library-not-found messages that precede it. Surfaces in the Cookbook serve card as: "vLLM could not find a supported GPU — switch to llama.cpp or Ollama" instead of a raw Python traceback. Co-authored-by: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>	2026-06-02 12:30:07 +09:00
IBR-41379	385c3c3cf3	fix: use sys.executable for Cookbook model cache scan on Windows (#627 ) Windows has 'App Execution Aliases' that can make shutil.which('python3') and shutil.which('python') resolve to a Microsoft Store stub instead of real Python -- even when Python is properly installed. The stub outputs: 'Python was not found; run without arguments to install from the Microsoft Store, or disable this shortcut from Settings > Apps > Advanced app settings > App execution aliases.' and exits 9009, producing empty stdout. The JSON parse of the local model cache scan then fails with 'Expecting value: line 1 column 1 (char 0)', and the Cookbook model list shows nothing. Fix: prefer sys.executable as the interpreter for the local scan. Odysseus already runs inside its own venv, so sys.executable always points to the real venv Python and bypasses PATH / Store alias lookup entirely. which_tool() is kept as a fallback. Cross-platform: sys.executable works identically on Linux and macOS (returns the real interpreter path), so this change is safe everywhere.	2026-06-02 12:29:40 +09:00
Ruben G.	25dcb1b10f	fix(macos): make Homebrew dep install idempotent and non-fatal (#754 ) start-macos.sh now skips Homebrew formulae that are already installed, so re-runs no longer re-hit Homebrew. tmux and llama.cpp are treated as optional: a failed install warns and continues instead of aborting the launch under set -e. Python stays required (it builds the venv).	2026-06-02 12:28:37 +09:00
Rolly Calma	32efeeb3a2	chore: use running event loop in async helpers (#821 )	2026-06-02 12:28:05 +09:00
lolwuttav	c99193041a	fix(cookbook): default Ollama serve to loopback (#872 )	2026-06-02 12:27:04 +09:00
Tatlatat	ffb77d7ff2	fix(auth): honor AUTH_ENABLED=false on owner-scoped endpoints (no /login loop) (#880 ) When the operator sets AUTH_ENABLED=false, three owner-scoped endpoints still returned 401 (api/models, api/research/, api/email/), so the front-end redirected the browser to /login and the app was unusable despite auth being turned off. require_user() in src/auth_helpers.py already documents and honors this contract (issue #622) via 'if _auth_disabled(): return ""', but these endpoints did their own get_current_user/is_configured check without it. Make _require_user (research), the /api/models anti-leak guard, and email_helpers._require_auth consult _auth_disabled() and let anonymous through (owner='') only when the operator explicitly disabled auth. The 401 protection is fully intact when AUTH_ENABLED=true. Verified end-to-end: with AUTH_ENABLED=false the SPA now loads instead of bouncing to /login.	2026-06-02 12:26:26 +09:00
Mahdi Salmanzade	66cd44b66d	fix(research): gate /api/research/spinoff on session ownership (#878 ) The spinoff endpoint authenticated the caller (_require_user) but never verified the research session belonged to them before reading the persisted report and seeding it into a new chat session owned by the caller. Any authenticated user who knew or guessed another user's research session ID could exfiltrate that user's full report into their own session — a cross-user data disclosure (IDOR). Every other endpoint in this router gates on _owns_in_memory / _assert_owns_research right after validating the session ID; spinoff was the lone exception. Add the same _owns_in_memory check (covers both the in-memory task and the on-disk JSON) so a non-owner gets a 404 before any data is read or a session is created. Add regression tests pinning the anonymous (401) and wrong-owner (404) cases.	2026-06-02 12:26:12 +09:00
mist	fca8d68aba	Match host, not substring, when resolving DuckDuckGo redirects (#886 ) _resolve_ddg_redirect (the DuckDuckGo /l/?uddg= redirect resolver used on every HTML-fallback result href) gated on `"duckduckgo.com" in parsed.hostname`. That substring test also matches look-alike hosts like `duckduckgo.com.evil.com` and `notduckduckgo.com`, so a result link on such a host would be silently rewritten to its embedded `uddg` target. Same substring-vs-hostname pitfall fixed for provider detection in `54ecfa3`. Match the host properly: exactly `duckduckgo.com` or a `.duckduckgo.com` subdomain. Genuine redirects (`//duckduckgo.com/l/...`, and relative `/l/...` hrefs resolved against `html.duckduckgo.com`) keep working. The resolver was a closure inside duckduckgo_search; lifted it (plus the new _is_duckduckgo_host helper) to module scope so it can be unit-tested directly. Adds tests/test_ddg_redirect_resolution.py (red on the look-alike case before this change, green after).	2026-06-02 12:25:56 +09:00
Mahdi Salmanzade	f691537472	fix(security): stop leaking the vault master password via process argv (#879 ) The /api/vault/unlock handler ran `bw` as `_run_bw(["unlock", req.master_password, "--raw"])`. _run_bw launches it with `asyncio.create_subprocess_exec(bw_path, *args)`, so the master password became a process argument — readable by any local user through `ps` and `/proc/<pid>/cmdline` for the lifetime of the unlock subprocess. The Bitwarden master password decrypts the entire vault, so this is a serious credential exposure on any multi-user / shared host (CWE-214). The sibling /login handler already avoids this by feeding the password on stdin; unlock was the outlier. Hand the password to `bw` through the environment instead (`--passwordenv BW_PASSWORD`), mirroring how BW_SESSION is already passed — `/proc/<pid>/environ` is readable only by the process owner, not other local users. Add regression tests pinning that the secret reaches the subprocess env and never appears in argv.	2026-06-02 12:25:43 +09:00
Alexandre Teixeira	90878c380e	Add resolve_endpoint fallback chain regressions (#890 )	2026-06-02 12:24:50 +09:00
Alexandre Teixeira	d1d047dd11	Add Ollama port path detection regressions (#883 )	2026-06-02 12:24:18 +09:00
Juan Pablo Jiménez	e58e4a185d	Expose Cookbook user-install CLIs in Docker (#887 ) Ensure pip --user console scripts like vLLM are visible to Docker runtime and dependency probes by adding the user install bin directory to PATH.	2026-06-02 12:23:29 +09:00
Tatlatat	9a1893760d	fix(cookbook): skip pip --user fallback inside virtualenvs (#388 ) (#889 ) The dependency-install fallback chain unconditionally ran 'pip install --user', which fails inside a virtualenv (and as root in LXC/containers) with 'Can not perform a --user install. User site-packages are not visible in this virtualenv.' — even though the function's docstring already noted --user is invalid in venvs. Guard the --user fallback with a venv check so it only runs outside a venv (where --user is actually valid for PEP-668 system Pythons). Derive the venv probe interpreter from the install command (python for 'pip', python3 for 'pip3'/'python3 -m pip') so the check runs in pip's own environment. System PEP-668 installs keep the --user fallback; venv/LXC-root installs no longer hit the --user error. Updated the unit test for the new chain. Closes #388	2026-06-02 12:23:20 +09:00
pewdiepie-archdaemon	966b53df77	Improve Cookbook serve diagnostics and recommendations	2026-06-02 12:15:47 +09:00
Prakhya	bdc99d746a	fix: add Browser MCP connection diagnostics (#662 )	2026-06-02 11:50:17 +09:00
NovaUnboundAi	3319310942	Allow longer deep research extraction timeouts (#651 ) Co-authored-by: NovaUnboundAi <NovaUnboundAi@users.noreply.github.com>	2026-06-02 11:50:03 +09:00
Achilleas90	247df16e82	Fix ordered list rendering in markdown preview (#645 )	2026-06-02 11:49:44 +09:00
Rasmus	1882ad68ea	fix: open #document deep-links on refresh and surface load errors (#631 ) Add a hashchange handler for #document-<id> so refresh / URL-bar nav opens the document, and replace the silent console.error in loadDocument with a user-facing toast. Closes #560	2026-06-02 11:48:54 +09:00
Christopher Milian	35ba56fa0c	fix: remove ollama backend filter conflict (#613 )	2026-06-02 11:48:35 +09:00
nsgds	5645cce6d0	Support vLLM 0.20.2 / NIM reasoning-parser output end-to-end (surface + agent context + render) (#602 ) * fix(stream): read 'reasoning' SSE field for vLLM 0.20.2 / NIM vLLM 0.20.2 / NVIDIA NIM emit reasoning-parser output in the `reasoning` delta field; older builds use `reasoning_content`. stream_llm() read only the latter, so reasoning from models like Nemotron-3-Nano (--reasoning-parser) was silently dropped and never rendered. Accept either field. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(agent): keep reasoning_content only on the latest assistant turn The agent loop echoed each round's reasoning back as `reasoning_content` on every assistant turn, assuming vendors ignore it. Nemotron's chat template re-injects ALL prior reasoning_content as <think> blocks, and the loop is trimmed only once (before it starts) — so reasoning accumulated unbounded across rounds, bloating context and feeding the model its own prior reasoning, which reinforced repetition/looping. Strip reasoning_content from earlier assistant turns so only the most recent round carries it (still satisfies DeepSeek's thinking-mode follow-up requirement). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * fix(agent-ui): wrap each round's reasoning in its own <think> block The streamed think-tag wrapper gated on whole-message substring checks (accumulated.includes('<think>')), which only ever wrapped ONE reasoning block per message. A multi-round agent response has a reasoning phase per round, so once round 1 closed its <think>...</think>, rounds 2+ reasoning was emitted unwrapped and leaked into the visible answer. Replace the substring checks with a stateful open/close flag that toggles per think/answer cycle, so each round's reasoning gets its own collapsible block. Single-turn chat is unchanged (one open, one close). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(stream): reasoning/reasoning_content delta surfaces as thinking chunk Covers @pewdiepie-archdaemon's requested regression: a streamed {reasoning: ...} delta emits a thinking chunk while {content: ...} streams as normal content; plus the older reasoning_content field for backward compat. Mirrors the #591 scenario. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-02 11:48:17 +09:00
nsgds	a857d2016d	fix: don't bill self-hosted models reached by a container/service hostname (#596 ) * fix(cost): treat dotless container hostnames as local (free) getModelCost() substring-matches model names against a cloud price table, so a self-hosted 'nemotron'/'llama' model was billed at cloud rates. isLocalEndpoint() only recognized IPs / localhost / .local, not bare Docker service names (nim-nano, llamaswap), so the local-is-free guard missed them. A single-label hostname (no dot) can never be a public API -> treat as local. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test(cost): isLocalEndpoint classifies service names local, cloud FQDNs billable Covers @pewdiepie-archdaemon's requested cases: llamaswap/nim-nano + localhost/private-IPs/.local => local (free); api.openai.com/openrouter.ai/etc => not local. Drives the real function via node --input-type=module (same approach as test_reply_recipients_js.py), skips when node is absent. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-02 11:47:58 +09:00
william-napitupulu	649cacfa05	Importing files bug (#582 ) * Update Styles.css Small update to the styles that bothered me, i noticed in the window/modal for calendar when editing a day the time icons had a mask that overlapped the icon. I simply added 'background-image: none' prop to it/ * Importing files bug I found a bug that wouldn't let me upload files in the library window during the documents tab, when a user selected a file, the code grabbed a reference to fileInput.files and immediately cleared the input value (fileInput.value = '') to allow for re-uploading the same file later. However, because fileInput.files is a live FileList tied directly to the DOM element, clearing the input inherently emptied our saved variable as well, resulting in lost file data. Note this error might be browser specific as it worked fine on Zen/Firefox but failed on Edge and chrome Fix use Array.From which copies the value into files instead of using refrences	2026-06-02 11:47:25 +09:00
Sirsyorrz	cb3d86608c	Cookbook: pick the correct vLLM tool-call-parser for Qwen2.5 (#580 ) The model-name detector treated every Qwen model as a Qwen3, falling into the qwen3_xml parser: if (n.includes('qwen3') && n.includes('coder')) return 'qwen3_coder'; if (n.includes('qwen')) return 'qwen3_xml'; // catches qwen2.5 too qwen3_xml is the parser for Qwen3 reasoning/instruct models. Qwen2.5 (and Qwen2, Qwen1.5) ship with hermes-style tool calling, so the qwen3_xml parser never recognises their tool calls — they leak through as plain text in the assistant reply and the agent silently fails to execute anything. Reproduces with: vllm serve Qwen/Qwen2.5-Coder-14B-Instruct-AWQ ... \ --enable-auto-tool-choice --tool-call-parser qwen3_xml → ask the agent to call any tool → JSON shows up in chat, no call runs. Fix the ordering: qwen3 + coder → qwen3_coder qwen3 → qwen3_xml qwen → hermes (Qwen2.5 / Qwen2 / Qwen1.5) Verified against the model matrix: Qwen2.5-Coder-14B-Instruct-AWQ → hermes Qwen2.5-7B-Instruct → hermes Qwen3-8B → qwen3_xml Qwen3-32B → qwen3_xml Qwen3-Coder-30B-A3B → qwen3_coder Qwen2-72B-Instruct → hermes Qwen1.5-7B-Chat → hermes	2026-06-02 11:47:15 +09:00
Rasmus	e73f3edc06	fix: scope chat active-document lookup to the session owner (#569 )	2026-06-02 11:46:40 +09:00
mist	f13d897093	Fix AttributeError on bullet lines in extract_memory_from_chat (#873 ) The fallback memory extractor (used by routes/memory_routes.py when the LLM extractor fails) matched list items with `r'^[-•]\|\d+\.\s(.)'`. Operator precedence makes that `(^[-•]) \| (\d+\.\s(.))`, so the capture group only exists on the numbered-list branch. A bullet line ("- foo") matches the first branch, so `group(1)` is None and `text_match.group(1).strip()` raises AttributeError — crashing extraction for any assistant message that contains a bullet list (i.e. most of them). Numbered lists happened to work. Group both markers — `r'^(?:[-•]\|\d+\.)\s(.*)'` — so the capture applies to bullets and numbers alike. Adds tests/test_memory_bullet_extraction.py (red before, green after).	2026-06-02 11:46:06 +09:00
Kenny Van de Maele	2b39412355	Expand ~ in read_file and write_file paths (#781 ) read_file/write_file passed the raw path to open(), so a tilde path like ~/notes.txt failed ("not found") — the shell's ~ expansion never happened because there's no shell. Agents then fell back to bash to reach home-dir files. Expand ~ (and ~user) with os.path.expanduser before opening. Checks: python -m py_compile src/tool_execution.py.	2026-06-02 11:45:21 +09:00
Ernest Hysa	7669696bb0	fix(scheduler): push next_run forward on startup to stop restart double-fire (#708 ) TaskScheduler.start() aborts stale TaskRun rows but never advanced ScheduledTask.next_run. Across a restart the in-process _executing set is empty, so the first post-restart _check_due_tasks() call dispatches every task whose next_run is still in the past — and so does every subsequent poll, until the task's regular _execute_task path finally runs compute_next_run and pushes it forward. start() now queries active tasks with next_run < now and pushes each one to now + 60s. The first poll after restart sees them as not-yet-due, the task runs once normally, and compute_next_run puts the schedule back on its real cadence. Paused and not-yet-due tasks are left alone. The validator test was rewritten as a regression test asserting the opposite of the bug it originally demonstrated, plus two narrower cases to lock down the filter (only active+overdue is touched).	2026-06-02 11:43:30 +09:00
ooovenenoso	15c7cb58e7	fix(cookbook): retry 0% HF download stalls sooner (#691 ) Co-authored-by: Kevin <120500656+oooindefatigable@users.noreply.github.com>	2026-06-02 11:42:59 +09:00

1 2 3 4 5 ...

302 Commits