odysseus

Author	SHA1	Message	Date
spooky	f9e1d38cc2	fix: diagnose vllm serve runtime issues (#1198 )	2026-06-05 11:03:04 +01:00
Kenny Van de Maele	0a2adc9c96	Add ask_user tool: agent-posed multiple-choice questions (#2111 ) Let the agent pause and ask the user a multiple-choice question when a task is genuinely ambiguous and the answer changes what it does next — choosing between approaches, confirming an assumption, picking a target — instead of guessing. Modeled on the existing `ui_control` marker pattern: the `ask_user` tool returns an `ask_user` payload that the agent loop emits as an SSE event and then ends the turn. The frontend renders the question with clickable option buttons, a free-text "Other" input, and an x to dismiss; the user's choice is sent as the next message and the agent resumes with it in context. - src/tool_execution.py: `ask_user` handler — pure UI marker, no I/O. Validates a non-empty question + 2..6 options, normalizes string/object options, returns the payload. - src/agent_loop.py: emit the `ask_user` event and break the round loop so the turn ends and waits for the user's selection. Stream the question as assistant text so it persists/replays (prevents a re-ask loop). - Registration: TOOL_TAGS, ALWAYS_AVAILABLE, BUILTIN_TOOL_DESCRIPTIONS, FUNCTION_TOOL_SCHEMAS, the system-prompt blurb. Not admin-gated (any user can be asked); the structured args serialize via the default json.dumps path. - routes/chat_routes.py: relay the `ask_user` event to the client. - static/js/chat.js + static/style.css: render the question card (options + free-text Other + dismiss x; removed once answered). Reuses CSS vars and the .modal-close button; emoji go through the monochrome-SVG pipeline. Bump chat.js cache pin. - tests/test_ask_user_tool.py: payload, multi flag, string options, option cap, validation errors, serializer round-trip, registration.	2026-06-05 11:49:11 +02:00
Lucas Daniel	f5d834b0c5	fix(cookbook): surface backend diagnosis when serve fails in background (#1636 ) * refactor(cookbook): move _diagnose_serve_output to module level in cookbook_helpers Extracts the nested _diagnose_serve_output function from inside setup_cookbook_routes() and moves it to module level in cookbook_helpers.py, alongside the other helper functions it logically belongs with. No behaviour change — the function is now importable directly for testing and by other callers without going through the route factory closure. * fix(cookbook): surface backend diagnosis when serve fails in background The background poll (_pollBackgroundStatus) already received `diagnosis` and `cmd` from /api/cookbook/tasks/status but discarded both. When a serve job died while the Cookbook modal was closed, reopening it showed only a red error badge with no context. - Persist live.diagnosis into task._backendDiagnosis in localStorage so it survives modal close/reopen and page refresh - Persist live.cmd into task.payload._cmd for agent-spawned tasks so the crash report includes the actual command - After _renderRunningTab(), walk rendered cards and call _showDiagnosis() for any that have a stored _backendDiagnosis but no panel yet - In _renderTaskCard(), use _backendDiagnosis as a fallback when the client-side _terminalServeDiagnosis() finds nothing * test(cookbook): add coverage for _diagnose_serve_output error patterns 10 tests verifying the 16 serve-failure patterns: - CUDA OOM, port-in-use, vLLM missing, gated model - Traceback fallback fires without startup success marker - Traceback suppressed when server actually started - Clean/empty output returns None - trust-remote-code and no-GGUF patterns	2026-06-05 09:52:07 +01:00
Kenny Van de Maele	367858a587	Merge branch 'main' into dev Bring main's maintainer-curated work (cookbook scheduler, calendar rendering/sync, settings polish, agent debug loop) into dev so dev is a superset of main (resolves the dev/main drift, #2543).	2026-06-05 10:50:51 +02:00
Vykos	b19e5693af	Constrain embedding model cache paths (#2849 )	2026-06-05 10:46:48 +02:00
Vykos	d4d168f972	Harden emoji SVG proxy responses (#2842 )	2026-06-05 10:31:58 +02:00
Vykos	194985b5e1	Constrain gallery filenames to image root (#2828 )	2026-06-05 10:29:11 +02:00
Vykos	2cae5a681d	Sanitize calendar export filenames (#2840 )	2026-06-05 10:18:09 +02:00
ghreprimand	e0097c9c48	Strip tz in _parse_dt dateutil fallback (naive-datetime contract) (#2557 ) _parse_dt documents that it returns naive datetimes (CalendarEvent.dtstart is naive) and every return path strips tz — except the last-resort dateutil fallback, which returned dateutil's value verbatim. An offset-bearing non-ISO input (e.g. RFC-2822 'Mon, 05 Jan 2026 14:00:00 +0900', which fromisoformat rejects but dateutil parses) leaked a tz-aware datetime into the naive dtstart column via create_event/update_event -> _parse_dt_pair. On read-back, _expand_rrule compares ev.dtstart against naive window bounds and raised 'can't compare offset-naive and offset-aware datetimes' (500 / no events). Normalize the fallback to UTC-naive, mirroring the fromisoformat branch. Naive inputs are unchanged. (cherry picked from commit b03b6b91df21c1a3ad3c447f23f35b8b19e6d1b1) Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-05 08:18:26 +01:00
pewdiepie-archdaemon	e2f449f4ef	Cookbook scheduler + serve: schedule via Tasks, Stop verifies kill, Ollama auto port-pick - Schedule cookbook serves through the existing ScheduledTask system: the serve preset gets a ^ button next to Launch that opens a daily/hourly/ weekly form mirroring the admin-switch style; the schedule action runs action_cookbook_serve, which delegates to /api/model/serve and stamps the resulting task with _scheduledStopAtMs. A background cookbook_serve_lifecycle loop ticks every 60s and kills any serve whose window has ended, also dropping the auto-registered endpoint so the model picker doesn't keep pointing at a dead server. - Stop and remove on a Running serve now awaits the SSH/tmux kill, re-checks tmux has-session, and surfaces an error toast (leaving the row) when the kill failed. Previously fire-and-forget, so a failed SSH/tmux call silently left the live serve running while the row vanished from the UI. - Cookbook tasks/status orphan-adoption sweep no longer requires the serve-/cookbook- session-id prefix; any tmux session whose pane is running a known model-server process gets auto-pulled into Running. Without this loosening, a cookbook-launched serve whose tmux id fell back to a bare number was invisible — you couldn't see it, let alone stop it. - Ollama serve always launches a fresh process under cookbook's tmux (no more monitor-mode reattach to a systemd/Docker ollama Stop can't reach). The handler pre-picks a free port by probing the target host over SSH and mutates req.cmd's OLLAMA_HOST so the runner script AND the auto-registered endpoint agree on the same bind port. - Auto-register uses host.docker.internal (when running inside Docker) instead of localhost, matching the URL /setup adds for Ollama by hand. Local cookbook serves now produce a chat-reachable endpoint on first launch. - Cascade-delete: removing a scheduled cookbook task also deletes any linked calendar event (cookbook_task_id marker in the description). - Tasks list groups cookbook_serve under a "Cookbook" category that sorts above the rest, so scheduler-launched serves are easy to find.	2026-06-05 14:41:43 +09:00
pewdiepie-archdaemon	f8aaeab245	Merge remote-tracking branch 'origin/dev'	2026-06-05 12:14:34 +09:00
pewdiepie-archdaemon	f19ac6ed03	Merge branch 'main' of github.com:pewdiepie-archdaemon/odysseus # Conflicts: # static/js/cookbookRunning.js	2026-06-05 11:23:15 +09:00
Abylaikhan Zulbukharov	1d80bf5e65	feat(mcp): add Streamable HTTP transport with OAuth 2.0 (#1033 ) * feat(mcp): add Streamable HTTP transport with OAuth 2.0 Odysseus could only reach MCP servers over stdio and SSE, so modern remote servers like https://mcp.higgsfield.ai/mcp (Streamable HTTP, gated behind OAuth) could not be connected. Add an `http` transport that connects via the SDK's streamablehttp_client and authenticates with the SDK's OAuthClientProvider: RFC 9728 protected-resource discovery, RFC 8414 authorization-server metadata, Dynamic Client Registration, authorization-code + PKCE, and token refresh. A small bridge (src/mcp_oauth.py) connects the SDK's blocking callback to the existing web callback route via an asyncio.Future keyed by the OAuth `state`, and the dynamic client registration plus tokens persist per-server in a new encrypted `oauth_tokens` column. The connect runs as a bounded background task so the "Add server" request returns immediately; redirect_handler publishes needs_auth + auth_url to connection state as soon as discovery/DCR completes (which can exceed the bounded wait), and the UI polls until connected. Remote users finish via the existing paste-back flow. The Google OAuth path is left unchanged. - core/database.py: encrypted oauth_tokens column + migration - src/mcp_oauth.py: OAuth provider, DB-backed TokenStorage, state registry - src/mcp_manager.py: http dispatch, background connect, _connect_http - routes/mcp_routes.py: http validation, needs_auth/auth_url, callback bridge - static/js/settings.js: Streamable HTTP option + OAuth flow with polling - tests: 5 new unit tests (transport dispatch, registry, token storage) Verified against the live Higgsfield server: discovery, DCR (client_id issued), loopback redirect accepted, and a PKCE authorization URL with needs_auth status. No regressions (full suite delta is only the 5 added passing tests). * fix(mcp): address PR #1033 review feedback - mcp_oauth: derive redirect URI from OAUTH_REDIRECT_BASE_URL/APP_PUBLIC_URL (default http://localhost:7000) instead of hardcoding the port - mcp_oauth: leave OAuth scope unset so the SDK derives it from the server's WWW-Authenticate/protected-resource metadata; hardcoding an OIDC scope broke non-OpenID MCP servers (verified: Higgsfield still gets its server-derived scope) - mcp_oauth: prune abandoned OAuth flows (_prune_stale + _pending_ts) so the module-level registries can't grow unbounded - mcp_oauth: persist tokens/client-info in a single DB session/commit (_update) instead of a load+save double round-trip - mcp_manager: cancel and drop the background connect task in disconnect_server so a deleted server stops publishing status - database: document why the oauth_tokens migration uses TEXT while the model declares EncryptedText (encryption is applied at the Python layer) - settings.js: surface persistent OAuth-poll failures and an explicit timeout message instead of silently swallowing errors - tests: cover the stale-flow pruning * static/js/settings.js now shows an in-flight loading state on the buttons that fire requests:	2026-06-05 02:40:52 +02:00
anduimagui	f9c81f3c8d	fix(email): scope AI caches by owner (#2695 )	2026-06-05 02:21:50 +02:00
Kenny Van de Maele	2be3779e6e	feat: Add workspace: confine agent tools to a folder (#1103 ) * feat: Add workspace: confine agent tools to a folder Pick a server folder as the agent's workspace so its file/shell tools work there and don't touch files outside it. File tools are hard-confined; bash/ python run with cwd set to the folder. Includes a slash command: `/workspace` (alias `/ws`) — show / `set <path>` / `clear` / `pick` (open the directory browser). - routes/workspace_routes.py: GET /api/workspace/browse (admin-only). - src/tool_execution.py: hard path confinement for read_file/write_file; bash/python cwd. Threaded route → stream_agent_loop → execute_tool_block. - src/agent_loop.py: workspace note prepended to the system prompt. - static/: overflow menu item, input-bar pill, directory-browser modal, and the /workspace slash command. - tests/test_workspace_confine.py. * Wire workspace confinement into tools that landed after this PR edit_file (#1239) and grep/glob/ls (#1670) merged after workspace-confine was written, so they bypassed the workspace boundary. Thread the workspace through: - edit_file: _do_edit_file resolves via _resolve_tool_path_in_workspace - grep/glob/ls: _resolve_search_root confines to the workspace (root + paths) - bash/python/bg cwd: workspace or _AGENT_WORKDIR (keep the #2586 data-dir default when no workspace is set) Tests cover edit_file + grep/ls confinement (inside ok, outside rejected). * Workspace picker: editable path bar + modal style cohesion + cross-platform hardening - Make the current-folder strip an editable address bar: type/paste a full path and press Enter to navigate (also reaches other Windows drives and hidden dirs the up-only browser cannot). - Reuse shared modal CSS: drop bespoke .workspace-modal-content/.workspace-btn* in favour of base .modal-content/.modal-body and the .confirm-btn button family; separators/hover use var(--border). Net -31 CSS lines. - Fix the path field overflowing the modal right edge (flex stretch + margin vs an overflow:auto scrollbar-feedback loop): full-bleed, no h-margin. - Cross-platform confinement: normcase the workspace commonpath check so containment holds on case-insensitive filesystems (Windows/macOS). - Make tests OS-portable: sibling temp dirs instead of /etc, python os.getcwd() instead of pwd. 5 pass.	2026-06-05 00:06:37 +02:00
pewdiepie-archdaemon	a260e0abd4	Revert calendar-based cookbook scheduler Reverts `b98ee04` + `4ed48ba` + `a19b6d2`. Calendar events turned out to be the wrong abstraction for scheduling model serve windows. Pivoting to the existing ScheduledTask infrastructure (cron / daily / weekly recurrence, next_run tracking, edit-from-Tasks-tab UI) in a follow-up commit. The ScheduledTask path: - reuses dispatch logic the rest of the app already understands - drops the calendar dependency entirely (no auto-created "Cookbook" calendar, no calendar.js hook) - shows up in the Tasks UI that already exists for everything else What this revert removes: - src/cookbook_scheduler.py — calendar reconciler - routes/cookbook_schedule_routes.py — /api/cookbook/schedule/* endpoints - static/js/cookbookSchedule.js — Schedule modal / settings card - cookbook_scheduler_enabled + cookbook_schedule_calendar_href settings keys - The window.cookbookOpenScheduleForm hook in calendar.js - The Schedule button + paired-button CSS in cookbookServe.js + style.css	2026-06-05 06:57:21 +09:00
Kenny Van de Maele	64d65b73c1	feat: round-limit handling — Continue affordance at the cap + configurable cap (#1999 ) * feat: round-limit handling — Continue affordance at the cap + configurable cap When the agent loop runs out of rounds (per-message step cap, default 20) while still actively using tools, it stopped silently mid-task. Now: 1. The loop emits a `rounds_exhausted` SSE event at the cap, and the UI shows a "Continue" pill at the bottom of the chat that resumes the task from where it left off. Repeated cap-hits each get a fresh Continue (multiple continues in a row). 2. The cap is configurable in Settings → Agent ("Max steps per message"), validated on the client, at the save endpoint, and at the read site. - src/agent_loop.py: track `_exhausted_rounds` (set only when a full tool-executing round completes on the last allowed round — i.e. the agent wanted to keep going); emit `{"type":"rounds_exhausted","rounds":N}` (logged). - routes/chat_routes.py: read `agent_max_rounds` (clamped 1..200), pass as `max_rounds`; forward the new event through the SSE relay. - routes/auth_routes.py: validate numeric settings on save (int + clamp; agent_max_rounds 1..200, agent_max_tool_calls 0..1000; 400 on non-int). - src/settings.py: default `agent_max_rounds = 20`. - static/: Settings input + client-side clamp; the Continue pill (reuses the existing .stopped-indicator / .continue-btn classes and theme vars --border/--fg/--bg/--accent); appended to the chat container so it survives the message re-render at stream finalize. chat.js cache version bumped. * test: cover rounds_exhausted emission (cap-hit vs normal finish) Drives the real stream_agent_loop with mocked LLM stream / tool exec / settings: a tool block every round exhausts the cap and must emit rounds_exhausted; a plain answer hits the done-break and must not. Guards the for/else logic.	2026-06-04 22:36:05 +02:00
Ocean Bennett	e69298888b	fix(history): block compact during active runs (#2635 )	2026-06-04 21:50:16 +02:00
Kenny Van de Maele	1cd0aa2b8c	feat(provider): add GitHub Copilot provider with device-flow auth (#1480 ) * feat(provider): add GitHub Copilot provider with device-flow auth Adds GitHub Copilot as a model provider, so Copilot models (gpt-4o/4.1/5, Claude, Gemini, …) work through the normal chat + agent loop, incl. native tool calling and vision. Auth is one-click via the GitHub OAuth device flow; the access token is stored as the endpoint's (encrypted) api_key and sent directly as `Authorization: Bearer` (no Copilot-token exchange, no refresh — matching how editors talk to the Copilot API). Copilot is a normal ModelEndpoint detected by host; the only provider-specific behaviour is a small set of required request headers, injected centrally. Sign-in is available from Settings → model endpoints ("Connect GitHub Copilot") and from chat via `/setup copilot`. - src/copilot.py (new), routes/copilot_routes.py (new): constants, header builders, device-flow start/poll, model discovery, owner-scoped endpoint provisioning. - src/llm_core.py, src/endpoint_resolver.py: detect `copilot`, inject headers, per-request x-initiator/vision. - src/agent_loop.py: allowlist api.githubcopilot.com for native tool schemas. - src/model_context.py: known context windows for Copilot (no unauthenticated /models probe). - static/, README, tests/test_copilot.py. Tidy copilot_routes: clarify supports_tools, note _PENDING is per-process	2026-06-04 21:13:14 +02:00
Ocean Bennett	ca32b43b38	fix(history): tolerate tool-call turns during compact (#2626 )	2026-06-04 20:59:41 +02:00
Afonso Coutinho	ed933ac232	fix: renaming a user leaves their API tokens resolving to the old owner (#1932 ) * fix: renaming a user leaves their API tokens resolving to the old owner * Drive rename token-cache test through the real auth resolver instead of patching a closure	2026-06-04 20:37:59 +02:00
pewdiepie-archdaemon	b98ee04e2f	Cookbook scheduler: reuse the standard calendar event card + auto-create Cookbook calendar Drop the custom Schedule modal in favor of opening the calendar's existing event-creation form pre-filled with the model's name + cookbook YAML in the description. The user lands in the same event editor they already know from regular calendar use, just pointed at the auto-created "Cookbook" calendar. Backend: - POST /api/cookbook/schedule/ensure-calendar — idempotent: creates a calendar named "Cookbook" if one doesn't exist for the current user, saves its href into cookbook_schedule_calendar_href, flips cookbook_scheduler_enabled on. Verifies the saved href against /api/calendar/calendars on every call so a manually-deleted calendar self-heals. Frontend: - calendar.js: expose window.cookbookOpenScheduleForm(draft) which opens the calendar modal (if not open), calls _showEventForm, then pre-fills summary / description / rrule / calendar dropdown. Force-expands the "Add details" section so the user can see which calendar it's heading into. - cookbookSchedule.js: Schedule-button click now calls ensure-calendar, builds the cookbook: YAML block, and routes to window.cookbookOpenScheduleForm instead of openModal(). The legacy custom modal stays as a fallback for the case where calendar.js hasn't loaded. UX tweak: - cookbookServe.js: replace the standalone "Schedule…" text button with a small icon-only button (clock SVG) glued to the right edge of Launch. The pair forms one visual unit — Launch on the left, schedule-now on the right — sharing a thin divider. CSS handles the rounded corners + divider.	2026-06-05 02:52:07 +09:00
Afonso Coutinho	abe04436a0	fix: merge-last-assistant deletes tool/system rows from the DB (history desync) (#1929 )	2026-06-04 19:47:08 +02:00
Giuseppe	bc83479f94	fix: bool('false') is True coerces endpoint toggles incorrectly (#2361 ) Python's bool('false') returns True because the string is non-empty. A JS client serialising a boolean as the string 'false' would have supports_tools or is_enabled silently flipped to True — so 'disable tool support' would actually enable it. Use an explicit lookup dict for supports_tools and a case-insensitive string check for is_enabled so both string and native bool inputs are handled correctly. Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-04 19:43:38 +02:00
pewdiepie-archdaemon	a19b6d2d4d	Cookbook scheduler: calendar events drive model serve windows (experimental, feature-flagged) Add a calendar-driven scheduler so a user can pick a model in Cookbook, click "Schedule…" instead of "Launch", choose time windows + days of the week + (optional) end date, and have Odysseus auto-launch the serve when the window starts and hard-kill it when the window ends. The calendar IS the source of truth — events on a designated calendar are interpreted as serve schedules, so editing the event in the calendar UI immediately changes the schedule. Whole feature is gated by setting `cookbook_scheduler_enabled` (default False). Disabling the setting silences the reconciler and the API refuses requests; setting + three new files = entire surface, easy to revert. New files: - src/cookbook_scheduler.py — background reconciler: ticks every 60s, reads next ±90s of calendar events on the designated calendar, launches/kills serves to match. Honors "refuse if GPUs busy" (skips with reason, no retry). Adopts pre-existing manual serves matching the event's model so window-end cleanup still applies. Tags scheduler-owned tasks with `_scheduledBy: <event_uid>` so it never kills serves it doesn't own. - routes/cookbook_schedule_routes.py — POST /api/cookbook/schedule/from-cookbook builds RRULE+ICS events from the modal's input (model, slots[], days[], until). GET /upcoming returns the next 24h with per-event status (scheduled / running / adopted / skipped / failed / ended) for the UI. POST /reconcile-now manually kicks the reconciler. - static/js/cookbookSchedule.js — Schedule button click handler + modal. Daily/hourly time slot picker, multi-slot ("+ add another time slot"), weekday chips with Weekdays/Weekend/Every-day quicksets, optional Until date. Calls /from-cookbook on save. Whole module is a single IIFE; deleting the file plus its <script> tag removes the UI surface. Existing files touched (minimal): - app.py: register the new router + add the reconcile loop as a startup task (~10 lines, all in one block). Reconcile loop checks the feature flag on every tick, so leaving it running with the flag off costs ~one settings lookup per minute. - static/index.html: one new <script> tag for cookbookSchedule.js. - static/js/cookbookServe.js: add a "Schedule…" button next to the existing Launch button. Hidden by default; cookbookSchedule.js reveals it after confirming the feature flag is on. - static/style.css: ~80 lines for the modal styles (mobile-aware via @media). User choices baked in: - Calendar events are the source of truth. - Refuse to launch if GPUs busy (skip + log reason in scheduler.events[uid].reason). - Hard kill at event end. - No retry on a skipped event within the window. - Multi-slot per day supported (one calendar event per slot, shared RRULE). - Pre-existing manual serves get adopted at window start so they're killed at end. Known follow-ups (not in this commit): - Settings UI to pick the schedule calendar + toggle the feature flag. - Calendar event color/badge for status (running/skipped/failed). - "Lazy launch on first request" — currently launches at event start. Replacing _launch_serve with a proxy that defers vllm until the first chat request is a contained future change.	2026-06-05 02:35:23 +09:00
RaresKeY	c12c2aa233	fix: normalize Gemma 4 thought-channel output (#2224 )	2026-06-04 19:26:58 +02:00
WasserEsser	20cc23c9bd	fix(models): make pinned models visible in chat UI (#2481 ) Two bugs prevented pinned models from appearing in the chat model picker: 1. _fetch_models() only used _cached_model_ids(), ignoring pinned_models. Since Fireworks AI doesn't list kimi-k2p6-turbo in /v1/models, the cached list was empty, so the endpoint showed as offline with no models. 2. _curate_models() filtered unknown pinned IDs into models_extra, but the chat UI only reads models (primary list). Pinned models stayed invisible. Fix: use _visible_models() to merge cached + pinned, then promote pinned IDs from models_extra to models so they appear in the dropdown. Closes #1521 follow-up	2026-06-04 19:17:37 +02:00
nubs	050283c145	fix(mcp): confine oauth file paths (#2272 )	2026-06-04 19:10:23 +02:00
Kenny Van de Maele	a7e60ca7ec	Merge pull request #2214 from vdmkenny/chore/rm-unused-upload-dir-import chore: remove unused UPLOAD_DIR imports in document_routes	2026-06-04 17:11:15 +02:00
pewdiepie-archdaemon	9112861d8e	cookbook agent debug loop: persistent log files, auto-adopt orphan tmux, Codex/Claude skill parity Three converging fixes so the chat agent + external Codex/Claude skills can actually debug a crashed serve instead of staring at a post-crash neofetch banner: * Serves now `tee` to /tmp/odysseus-tmux/SESSION.log on the host running them. Runner saves fds 3/4 before the tee and restores them right before `exec ${SHELL}`, so the post-crash interactive zsh banner does NOT pollute the log file. * `tail_serve_output` (chat agent) and `/api/codex/cookbook/output/{sid}` (Codex+Claude skills) both prefer the persistent log file over the tmux pane. Pane is fallback for sessions predating the tee runner. Default tail bumped 150 -> 400. * `list_served_models` "recent log" snippet seeks to the Traceback line instead of showing the last 6 lines (which was always the bash prompt). Cookbook auto-adoption sweep on `/api/cookbook/tasks/status`: every 20s (rate-limited) the cookbook SSHes each configured server, finds `serve-` / `cookbook-` tmux sessions running an actual model process (vllm/python/llama-server/etc., filtered via `pane_current_command`), and writes them into state.tasks. So when the agent falls back to raw ssh+tmux, the session appears in the Cookbook UI on the next poll. `serve_model` error path now reads `data["detail"]` in addition to `data["error"]` so the FastAPI HTTPException message ("Invalid characters in cmd") actually reaches the agent instead of being swallowed as a generic "Serve failed". Tool description updated to warn against `cd …`/`source …`/`&&` prefixes. Intent-without-action supervisor in agent_loop: when the model writes "Let me tail the output" / "I'll check the logs" / "Let me investigate" and ends the turn without emitting a tool call, the loop injects a sharp system nudge ("You said you would X — DO IT NOW") and continues. Capped at 2 nudges per chat so a model that genuinely cannot use the tool does not pin the loop. Codex/Claude skill parity: adds `/cookbook/cached`, `/cookbook/presets`, `/cookbook/preset/{name}`, `/cookbook/adopt` so external agents have the same surface as the chat agent. SKILL.md docs + odysseus_api.py wrapper updated for both bundles. `adopt_served_model` promoted to the always-on tool set so the agent has a documented fallback when serve_model rejects a cmd. Also various cookbook UI tweaks accumulated alongside the above (cookbook.js, cookbookRunning.js, cookbookServe.js, cookbook-diagnosis.js, settings.js, style.css).	2026-06-04 23:27:18 +09:00
Massab K.	594775dc4b	Fix issue 135 chat context bleed (#281 ) * Fix issue 135 chat context bleed * Guard task delivery metadata access	2026-06-04 13:27:46 +01:00
Alexander Kenley	7b45a94b6d	Fix calendar routing and user-local time context (#408 ) * fix(chat): add user-local time context * fix(chat): route calendar follow-up phrasing * refactor(chat): log tool intent routing reasons * test(chat): align user time prompt shim --------- Co-authored-by: Alex Kenley <Alex.Kenley@threatvectorsecurity.com>	2026-06-04 13:20:04 +01:00
tanmayraut45	f59edee611	Support extra CA bundle for private-CA LLM providers (#769 ) Adding GigaChat (Sber) or an on-premise enterprise LLM gateway as a model endpoint fails on first probe with CERTIFICATE_VERIFY_FAILED: self-signed certificate in certificate chain (_ssl.c:1000) because their TLS chain is signed by a private root CA (Russian Trusted Root CA for GigaChat; corporate CA for on-prem) that isn't part of the default system / certifi trust store. The endpoint shows offline in the picker even though the URL and API key are correct (issue #722). The right fix is to extend the trust store, not to weaken verification. This change: - src/tls_overrides.py: new module that resolves an opt-in env var LLM_CA_BUNDLE at import time, builds a shared SSLContext via ssl.create_default_context() (so the system / certifi bundle is loaded first) and layers the operator's PEM on top with load_verify_locations(). Exposes llm_verify() returning a value suitable for httpx `verify=`. Defaults to True (httpx built-in trust) when the env var is unset, when the file is missing, or when the PEM fails to load — verification is never silently disabled, the warning is logged and we fall back to the safe path. - src/llm_core.py: thread llm_verify() into the shared AsyncClient used by stream_llm / streaming completions. - routes/model_routes.py: thread llm_verify() into the five httpx.get call sites in _probe_endpoint / _ping_endpoint so adding a private-CA endpoint goes green on the very first probe and the picker stops showing it offline. - .env.example: document LLM_CA_BUNDLE with the GigaChat case as the concrete example. Deliberately NOT included: a verify=False knob (global or per-host). Disabling verification exposes the affected endpoint to MITM, and the operator-supplied bundle is the correct fix for legitimate private-CA providers — so the only switch in this PR is the safe one. Closes #722.	2026-06-04 13:18:50 +01:00
ghreprimand	28c43121d7	Fix session export 500 on multimodal/None message content (#1984 ) txt/html/md export joined and string-munged message.content directly, so a multimodal turn (content is a list of blocks) crashed export with a TypeError on join (txt) / AttributeError on .replace (html), and None content (tool-only assistant turns) rendered as the literal 'None'. Add a _content_to_text helper that flattens string/list/None to plain text and apply it at the three export sites. JSON export is unchanged (it serializes structured content correctly). Plain-string content is returned unchanged, so existing exports are identical. Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-04 12:53:44 +01:00
Giuseppe	f6a5f6592f	fix: log warnings on silently swallowed agent and endpoint failures (#2367 ) get_builtin_overrides() was swallowing all exceptions with a bare `except Exception: pass`, so misconfigured tool-description overrides would silently produce wrong agent behaviour with no log trace. The background endpoint refresh loop had the same pattern: any probe failure was silently ignored, giving operators no signal that the refresh was broken. Also removes a circular self-import (`from src.agent_loop import _build_base_prompt`) inside _build_system_prompt; the function is already in scope and the import created a latent circular reference risk. Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-06-04 12:29:31 +01:00
Kenny Van de Maele	e9dfd1a747	Remove unused UPLOAD_DIR imports in document_routes routes/document_routes.py imports UPLOAD_DIR from src.constants in 8 separate function bodies but never uses it (pyflakes: 'imported but unused' ×8). Drop the dead imports — no behaviour change.	2026-06-04 13:17:21 +02:00
Kenny Van de Maele	d25052e43f	chore: remove unused imports in calendar_routes (#2221 ) routes/calendar_routes.py imports several names it never uses (pyflakes): typing.Tuple, dateutil.rrule.{rruleset,DAILY,WEEKLY,MONTHLY,YEARLY}, and auth_helpers.get_current_user. Drop them (the whole DAILY/WEEKLY/MONTHLY/ YEARLY line goes; rrulestr and require_user are kept). No behaviour change.	2026-06-04 12:13:18 +01:00
Yuri	a2e691da2b	fix(models): stabilize proxy endpoint refresh behavior * fix: support large proxy model endpoint refresh Large OpenAI-compatible proxy endpoints can expose hundreds of models and make /v1/models slow. Treating those endpoints like local model servers caused model picker opens and background probes to repeatedly hit /models, producing timeouts and making otherwise usable endpoints appear offline. Make model endpoint discovery cached-first for normal UI usage, add explicit proxy/API classification and refresh policy fields, exclude proxy/API endpoints from aggressive local probing, and preserve cached models when refresh fails. Manual Test/Add/Refresh actions still fetch the full model list with longer timeouts so users can intentionally import large proxy model lists without blocking normal model picker usage. * fix: preserve endpoint ping status semantics	2026-06-04 04:56:11 +01:00
Afonso Coutinho	09fe308720	fix(auth): revoke API tokens when deleting users * fix: revoke API bearer tokens when their owner is deleted * Re-run CI * Invalidate bearer-token cache on user delete so warmed cached tokens stop working	2026-06-04 04:44:34 +01:00
Rudy Wolf	1c43daa564	fix(compare): stop blind mode leaking model identities via session names (#1318 ) Blind Compare anonymized the pane headers, but each pane still created a helper chat session named "[CMP] <real-model>" and GET /api/sessions returned the session's model field. So the sidebar and the session-list API let a user map "Model A" back to its real model before voting, defeating the blind test. - Frontend (static/js/compare/index.js, panes.js): in blind mode, name helper sessions by their neutral slot ("[CMP] Model A") instead of the model, matching the existing blind pane labels. - Backend GET /api/sessions (routes/session_routes.py): blank the model field for [CMP]-prefixed helper sessions via a new _public_model helper. - Backend /api/compare/start (routes/compare_routes.py): name blind sessions by slot and withhold model_left/model_right/mapping from the blind response (revealed at /vote). - Tests: tests/test_blind_compare_redaction.py. Fixes #1285. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-04 04:39:01 +01:00
hawktuahs	3d8c364689	[Bash] Fix Windows cookbook background tasks (#676 ) * Fix Windows cookbook background tasks * Add Windows Cookbook reliability follow-ups	2026-06-04 04:30:01 +01:00
Paulo Victor Cordeiro	bd4067cf83	fix: guard remaining uid.decode() calls in auto-classify spam path (#1860 ) Two more bare uid.decode() calls at lines 889 and 897 crash with AttributeError when uid is already a string. Applies the same isinstance guard used everywhere else in this function.	2026-06-04 04:06:10 +01:00
.bulat	e340674c12	Persist user prefs atomically (#1840 )	2026-06-04 03:55:22 +01:00
Vykos	193dc2f085	fix(uploads): bound direct upload reads * Stabilize full test collection * Add bounded reads for direct uploads	2026-06-04 00:32:50 +01:00
pewdiepie-archdaemon	48f5182286	Merge branch 'codex-on-main'	2026-06-04 08:27:41 +09:00
pewdiepie-archdaemon	089246614d	feat: Claude Agent integration + cookbook reconnect + UI polish - Claude Agent integration: AGENT_CONFIGS.claude, INTG_TYPES.claude, setup_claude_routes + integrations/claude/ skill bundle. Wired in app.py alongside the existing Codex integration; same scope-gated /api/codex/* backend; agent form has new description so users know it's setup for an external CLI, not an agent streamed inside Odysseus. - Remove mark_email_boundaries action: not good enough yet. Stripped from task UI, scheduler defaults, registry, tool schema, clear-cache route. Added to RETIRED_HOUSEKEEPING_ACTIONS so existing rows + their task_runs auto-purge on startup. - Cookbook download reliability: "Reconnect" fix button in the crash diagnosis runs _reconnectTask after probing has-session. 30s confirm window before marking a download "done" — kills the Finished/Downloading flicker when tmux briefly drops between captures. - Mobile UX: tap anywhere on a note card body opens the editor; Update button morphs to Archive when no text was edited; bell icon accent-colored; chip-trashing notif pills fade so only the icon rotates into the trash zone. - Settings integrations: SVG-per-provider in email + API preset dropdowns, custom drop-up-aware menus, accent sub-header icons (IMAP/SMTP), consistent card styling between list + edit, contacts Edit/Delete icons, agent form description copy.	2026-06-04 08:27:26 +09:00
Mahdi Salmanzade	271489a10c	fix(research): owner-scope endpoint resolution POST /api/research/start (require_privilege "can_use_research" — a normal user, not admin) resolves an endpoint two ways and feeds the row's decrypted api_key + base_url into research_handler.start_research(llm_endpoint=, llm_headers=): 1. body.endpoint_id -> query(ModelEndpoint).filter(id == endpoint_id, is_enabled == True).first() 2. no endpoint + nothing configured -> query(ModelEndpoint).filter( is_enabled == True).first() Neither was owner-scoped. ModelEndpoint is a per-user resource (core/database.py: non-null owner = private, "the model picker only shows the endpoint to that user"). So a research-privileged user (or a chat-scoped token) could pass another user's PRIVATE endpoint_id — or fall through to their first-enabled row — and run research against that owner's endpoint: spending their API key / quota and reaching whatever internal base_url they configured (SSRF). This is the same multi-tenant owner-scoping class already fixed for companion/models, the /api/v1/chat session gate (#870), and the /api/v1/chat first-enabled fallback (#1045, _first_enabled_endpoint). These two sinks on the research path were missed. Extract `_owned_enabled_endpoint(db, owner, endpoint_id=None)` which scopes via the shared owner_filter helper (own rows + legacy null-owner shared rows), matching webhook_routes._first_enabled_endpoint and session_routes._owned_endpoint. Used for both sinks. A scoped miss on the explicit-id path returns the existing 404 ("Endpoint not found or disabled"), so endpoint existence isn't revealed. A null/empty owner stays a no-op (single-user / legacy mode). Add regression tests pinning both lookups (cross-owner rejected, own-row allowed, legacy shared-row allowed, disabled-skipped, fallback never borrows, null-owner no-op).	2026-06-03 23:19:28 +01:00
Mahdi Salmanzade	729a30a10e	fix(compare): owner-scope endpoint key lookup POST /api/compare/start (a normal-user route — no admin gate) creates two caller-owned [CMP] sessions from caller-supplied endpoint URLs (endpoint_a / endpoint_b), then copies a ModelEndpoint's decrypted api_key into each session's headers by matching on URL: ep = db.query(ModelEndpoint).filter(ModelEndpoint.base_url == base).first() The match was not owner-scoped. ModelEndpoint is per-user (core/database.py: non-null owner = private, "the model picker only shows the endpoint to that user"). So a user could pass another user's endpoint base_url, have that owner's api_key copied into a [CMP] session they own, then drive /api/chat_stream on that session — spending the victim's API key / quota and reaching whatever base_url they configured. Same multi-tenant owner-scoping class already fixed for companion/models, /api/v1/chat (#870, #1045), session create/switch-model (#1093), and /api/research/start (#1099). Extract `_owned_endpoint_by_url(db, base_url, owner)` which scopes the match via the shared owner_filter helper (own rows + legacy null-owner shared rows), mirroring session_routes._owned_endpoint. A scoped miss copies no key (the comparison session simply carries no borrowed credential). A null/empty owner stays a no-op (single-user / legacy mode). Add regression tests pinning the scoped match (cross-owner rejected, own-row allowed, legacy shared-row allowed, no-match None, null-owner no-op).	2026-06-03 23:17:12 +01:00
Afonso Coutinho	b6607d219d	fix(memory): owner-scope memory route session access	2026-06-03 23:13:56 +01:00
Sushanth Reddy	c58cb067f2	fix(calendar): avoid double-encrypting CalDAV password cfg is loaded from prefs and already holds the existing, already-encrypted password. When the edit form was re-submitted without re-typing the password, the elif branch called encrypt() on that stored ciphertext, compounding the encryption on every save and eventually breaking sync with a decrypt error. Drop the elif branch: the stored value is preserved as-is, and we only encrypt when a new password is actually supplied. Fixes #1915 Co-authored-by: EkaTantra Dev <dev@ekatantra.com>	2026-06-03 22:59:40 +01:00

1 2 3 4 5

240 Commits