odysseus

Author	SHA1	Message	Date
Paulo Victor Cordeiro	cf60a14d74	fix: capture download exit code before test consumes it (#1497 ) The shell pattern 'if [ $? -eq 0 ]; ... else ... echo DOWNLOAD_FAILED (exit $?)' always reports 'exit 1' because $? inside the else branch is the exit code of the [ test command, not the download. Capture into _ec first.	2026-06-03 08:12:54 +09:00
Isharak	70103d8719	fix(email): no-op IMAP connection leak in _auto_summarize_pass_single on exception (#1423 ) `_auto_summarize_pass_single` in `routes/email_pollers.py` opens a long-lived IMAP connection at line 172 and then performs ~700 lines of work — IMAP `select`/`FETCH`/`SEARCH`, network POSTs to the LLM endpoint, SQLite writes, and per-uid awaits. The only `conn.logout()` calls were on three safe paths (early `"No recent emails"`, early `"No model configured"`, and the happy path at the very end). If any exception fired between `conn` being created and the final happy path, the outer `except` block at line 921 caught it, logged, and returned — without ever calling `conn.logout()`. The IMAP socket leaked until the server's idle timeout killed it. This is the same shape as the just-merged upstream fixes #1325 (`_imap_move` in `routes/email_helpers.py`) and #1330 (`_list_emails_sync` in `routes/email_routes.py`), but in the background poller path — `_auto_summarize_poller` invokes it every 30 min, so the leak accumulates on every crashed pass instead of being a transient request-path leak. The fix is the exact try/finally pattern from #1330: 1. initialize `conn = None` before the try 2. let the try-block assign `conn = _imap_connect(...)` 3. drop the three explicit `conn.logout()` calls on safe paths 4. add a `finally:` block that calls `conn.logout()` if `conn` was set Tests in `tests/test_email_polly_imap_leak.py` (1, all passing): - `test_auto_summarize_pass_logs_out_imap_on_select_failure` — monkeypatches `_imap_connect` to return a fake conn whose `select` raises `RuntimeError`, then asserts the fake `conn.logout` was called exactly once and the function returned an `Error: ...` string. Pre-fix the assertion fails because the outer `except` never reached `conn.logout`; post-fix the `finally` block guarantees it on every exit path. Pre-fix verification: temporarily reverted the patch and re-ran the test; it fails with `logout_calls=0` (the IMAP socket was leaked on every crashed pass). Post-fix: `logout_calls=1`. Uniqueness: - `git log --all --oneline -S 'conn.logout' -- routes/email_pollers.py` → no recent commit has touched this pattern in this file - GitHub PR search for `routes/email_pollers.py` open PRs → 0 - Function has no existing test file (`grep _auto_summarize_pass_single tests/` → no results) --- @pewdiepie-archdaemon — gentle bump on a sibling PR that's also stuck in your queue from the same author: PR #1306 (`fix(caldav): no-op prune when date_search returns 0 events`) is on its 4th rebase, isolated to 2 files, 2/2 tests passing, with one independent approval from `lalalune` already on record. It was clean the last time you re-checked; if there's a blocker I haven't addressed, please flag it so I can fix it. Otherwise, both #1306 and this one are ready to merge. Co-authored-by: isharak7m <192635824+isharak7m@users.noreply.github.com>	2026-06-03 04:13:52 +09:00
lekt8	0ec8415f0e	Fix multi-file uploads tripping the per-IP concurrency guard (#1346 ) (#1362 ) * Stop multi-file uploads from tripping the per-IP concurrency guard The /api/upload concurrency check summed its condition over `files`, but the condition didn't reference the loop variable — so it collapsed to len(files) whenever the IP had any recent upload. A single multi-file batch sent right after another upload therefore counted itself as N concurrent uploads and hit max_concurrent_uploads (3), returning 429. The browser swallows the 429 (no `files` in the body) and sends the chat with no attachments, so the model "doesn't even see" them (issue #1346). Count genuine recent upload events instead, via a pure count_recent_uploads() helper, independent of the current batch's file count. save_upload still enforces the per-minute sliding-window rate limit per file, so throttling is preserved. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * Also reconcile the per-minute upload rate limit with the batch cap Follow-up within #1346: even after the concurrency-guard fix, a 6+ file batch still failed because save_upload() counts each file against upload_rate_limit (was 5/min) while the composer allows MAX_FILES=10 per batch — the reporter saw "5 attachments work, 6 fail". Raise the per-minute file cap to 60 so a single full batch (and a few of them) isn't self-rejected; burst abuse stays bounded by max_concurrent_uploads. Add a real 6-file regression + a config guard that the cap exceeds the frontend MAX_FILES. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 04:04:19 +09:00
Shaw	66c9349ee3	fix(skills): markdown save must not rename the skill, so delete keeps working (#1333 ) (#1365 ) POST /api/skills/{id}/markdown set sk.name = slugify(sk.name or match['name']), taking the name parsed from the edited markdown frontmatter. A changed name makes update_skill() move the skill directory on disk and re-key its usage sidecar, orphaning the original id. The UI still holds that original id, so the next DELETE /api/skills/{id} fails the name/id lookup and 404s — 'can't delete them now'. The audit save path (_apply_skill_md) already guards against exactly this with sk.name = name and an explicit 'must NEVER rename the skill' comment. Apply the same pin here: keep the stored name on markdown save (content edits still take effect; only the rename is suppressed). Drops the now-unused slugify import. Adds tests/test_skill_save_no_rename.py: saving markdown whose frontmatter renames the skill keeps the original name and applies the edit, and a subsequent delete-by-original-id succeeds. Pure unit test — calls the route handlers directly with a mock Request (no server/network), like test_skills_delete_owner.py. Co-authored-by: lalalune <shawgotbags@gmail.com>	2026-06-03 03:16:11 +09:00
Vykos	b2291fad49	Harden CalDAV credentials and URLs (#1310 )	2026-06-03 02:50:02 +09:00
Paulo Victor Cordeiro	54a221b367	fix: IMAP connection leak in _list_emails_sync on exception (#1330 ) If any exception occurred after conn was created but before the explicit conn.logout() call, the IMAP connection leaked. Use try/finally to guarantee cleanup on all exit paths.	2026-06-03 02:44:23 +09:00
Vykos	4771d80eb2	Harden session endpoint owner scope (#1308 )	2026-06-03 02:40:22 +09:00
Paulo Victor Cordeiro	4019283eba	fix: IMAP connection leak in _imap_move on store/expunge failure (#1325 ) If c.store() or c.expunge() raised an exception, the connection was never logged out. Use try/finally to ensure c.logout() is always called regardless of how the function exits.	2026-06-03 02:35:36 +09:00
Paulo Victor Cordeiro	97f855b40d	fix: pass owner to start_research in chat stream path (#1265 ) * fix: pass owner to start_research in chat stream path Research launched from the chat stream omits the owner parameter, causing those research sessions to never appear in the user's research library (which filters by owner). All other start_research call sites in this file already pass owner=_user. * test: assert all start_research calls in chat_routes pass owner Uses AST inspection to verify every start_research() call site includes the owner= keyword argument, preventing regressions where new call sites forget to scope research by user.	2026-06-03 02:32:38 +09:00
Vykos	5ee30cc144	Scope skills usage by owner (#1312 )	2026-06-03 02:27:43 +09:00
Vykos	1adf21a7e5	Scope email account workflows by owner (#1309 )	2026-06-03 02:21:02 +09:00
Vykos	e73545f64f	Keep Bitwarden unlock password off argv (#1311 )	2026-06-03 02:13:51 +09:00
Michael Gerber	e392be0d65	fix: Cookbook local GGUF serving inside Docker (#1264 ) * fix: Cookbook local GGUF serving inside Docker Cookbook’s in-container GGUF serve flow had multiple Docker-specific breakages that made local llama.cpp models fail or register against the wrong endpoint. Fixes included here: use the scanned model cache root when generating GGUF serve commands instead of hardcoding $HOME/.cache/huggingface/hub fix malformed llama.cpp preflight build lines that generated invalid bash in serve runner scripts preserve loopback model URLs inside Docker when the target port is already reachable from the Odysseus container, instead of rewriting them unconditionally to host.docker.internal Before this change, Docker local serves could fail in several ways: Cookbook pointed llama.cpp at the wrong GGUF path generated serve runner scripts crashed before launch with a shell syntax error successfully started in-container model servers were auto-registered as host.docker.internal: instead of localhost/127.0.0.1 This makes the Docker Cookbook path work as expected for: downloaded GGUF -> local llama.cpp serve -> endpoint registration * test: add test for docker-local endpoint rewrites	2026-06-03 02:08:09 +09:00
lekt8	adde94e430	fix: closed document stays active & leaks into new chats (#1160 ) (#1238 ) * fix: closed document no longer stays active and leaks into new chats (#1160) Closing a document tab calls _detachDocFromSession: a doc with content is PATCHed to session_id="" (unlinked, session_id -> NULL, is_active stays True), an empty one is DELETEd. But the in-memory active-document pointer (tool_implementations._active_document_id) was never cleared on either path. The chat doc-injection last-resort looks up that pointer by id and injects it when `not cand.session_id or cand.session_id == session`. An unlinked doc has session_id NULL, so the stale pointer re-surfaced a closed document in later, unrelated chats — the agent kept reading/suggesting edits to a doc the user had closed. Fix: add clear_active_document(doc_id) and call it when a document is unlinked (PATCH session_id="") or deleted, so the pointer no longer resurrects a closed document. clear_active_document only clears when the id matches (or no id), so a different active doc is left untouched. Covered by tests/test_active_document_clear.py (4 cases). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test: add route-level regression for #1160 (detach/delete clears active doc) Per review: prove the actual API path, not just the helper. Drives PATCH /api/document/{id} (session_id="") and DELETE /api/document/{id} through TestClient against a temp SQLite DB under real owner routing, and asserts get_active_document() is cleared (and untouched when a different document is closed). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test: make #1160 route regression hang-proof and dev-DB-independent The route test could hang in other environments: it set DATABASE_URL at import time, which is ignored if core.database was already imported, so it fell back to the real dev DB and could contend for its locks (maintainer saw it hang, exit 124). Rebind to a DEDICATED temporary SQLite engine (NullPool) and patch the document route module's SessionLocal to it via an autouse fixture — so the test never touches the dev DB and is independent of import order. Runs in ~0.3s. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test: drive #1160 route regression without TestClient (fixes local hang) The route test used Starlette TestClient (middleware app + threadpool), which hung in the maintainer's environment. Rework it to call the async route handlers directly — extracted from the router — with a minimal fake request against a temp-SQLite-patched SessionLocal. Same real coverage (handler + DB + owner routing), but it completes reliably (~0.3s) with no TestClient/threadpool. Verified the maintainer's exact batch now passes: pytest tests/test_document_close_clears_active_route.py \ tests/test_active_document_clear.py \ tests/test_document_tool_owner_scope.py -> 14 passed Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 01:47:13 +09:00
lekt8	1507d140b8	feat: CalDAV write-back — push local event create/update/delete to the remote (#800 ) (#1282 ) * feat: CalDAV write-back — push local event create/update/delete to the remote (#800) CalDAV sync was pull-only (src/caldav_sync.py), so events created, edited, or deleted in Odysseus on a CalDAV-backed calendar only changed local SQLite and never reached the server — they silently vanished on the next pull and never appeared on the user's phone (iCloud, etc.). This adds the missing write half: - src/caldav_writeback.py builds the VEVENT, re-discovers the remote calendar by the same URL-hash the local id was derived from (the remote URL isn't stored), and PUTs/DELETEs the event by UID via the caldav lib. The pure pieces (build_event_ical, find_remote_calendar, push_event) take inputs by argument so they unit-test against a fake client with no network. - create/update/delete event handlers (routes/calendar_routes.py) call it best-effort for caldav-sourced calendars only: the local DB stays the source of truth, a remote failure is logged, never fatal, and local calendars are untouched. Tests: tests/test_caldav_writeback.py (9, pure logic incl. iCal serialization, hash discovery, create/update/delete orchestration) and tests/test_caldav_writeback_route.py (3, route-level: a caldav calendar pushes, a local one does not, delete pushes a delete). 12 passed. Note: write-back re-discovers the remote calendar per write (the URL isn't persisted locally); a follow-up could cache it. Live-iCloud verification needs a real account — flagging for a maintainer pass. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test: drive #800 route regression without TestClient (fixes local hang) Same fix as the document route test: the CalDAV write-back route regression used Starlette TestClient (middleware app + threadpool) which hung in the maintainer's environment. Rework it to call the async create/delete calendar handlers directly — extracted from the router — with a minimal fake request, temp-SQLite-patched SessionLocal, and writeback_event stubbed to record calls. Same coverage (a caldav calendar pushes, a local one does not, delete pushes a delete), completes in ~0.3s with no TestClient. Verified the maintainer's exact batch: pytest tests/test_caldav_writeback.py tests/test_caldav_writeback_route.py -> 12 passed Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 01:44:02 +09:00
Paulo Victor Cordeiro	44e0259163	fix: fire-reminder endpoint crashes with NameError on _gcu (#1250 ) dispatch_reminder call on line 699 references _gcu(request) which is never defined. The local helper wrapping get_current_user is _owner. Every POST to /api/notes/fire-reminder raises NameError and returns 500.	2026-06-03 01:02:25 +09:00
red person	a901992d03	Ignore non-object vault config (#1258 )	2026-06-03 00:55:04 +09:00
ghreprimand	77320b617f	Fix owner-scoped skill updates (#1240 ) Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-03 00:42:56 +09:00
Afonso Coutinho	35fa022e2e	fix: email pre-retrieval ignores contacts (reads non-existent email/phone keys) (#1241 ) * fix: match known email senders against the contact 'emails' list * fix: build contact-match snippets from emails/phones lists	2026-06-03 00:39:31 +09:00
ghreprimand	1fda906407	Fix Cookbook container-local model endpoints (#1223 ) Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-03 00:09:48 +09:00
lekt8	87babb58d5	fix: SSRF hardening for the custom embedding endpoint URL (#132 ) (#1206 ) POST /api/embeddings/endpoint takes a user-supplied URL and immediately makes an outbound httpx request to it with no validation. The admin gate added earlier (PR #80) closed the unauthenticated-access part of #132; this addresses the remaining request: validate the URL before fetching it. Odysseus is local-first, so pointing the embedding endpoint at a loopback or LAN server (local vLLM / llama.cpp / Ollama) is a normal setup — a blanket private-IP block would break the primary use case. So the guard: - always rejects non-HTTP(S) schemes (file://, gopher://, ftp:// …), - always rejects the link-local range (169.254.0.0/16, incl. the cloud instance-metadata 169.254.169.254 exfil vector) plus multicast / reserved / unspecified, and IPv4-mapped-IPv6 forms of the above, - keeps loopback/LAN allowed by default, and - adds EMBEDDING_BLOCK_PRIVATE_IPS=true for full SSRF lockdown on exposed multi-tenant deployments. Logic lives in src/url_safety.py (stdlib only, resolver injectable) so it is unit-testable without real DNS; the route calls it before the health-check request. Covered by tests/test_url_safety.py (8 cases). Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-02 23:46:33 +09:00
red person	42ae905df7	fix(models): clear deleted endpoint fallback refs (#1207 )	2026-06-02 23:41:04 +09:00
red person	76a7685105	fix(models): clear stale speech endpoint settings (#1196 )	2026-06-02 23:32:01 +09:00
spooky	f667667da3	fix: distinguish external cookbook runtimes (#1188 )	2026-06-02 23:20:00 +09:00
PrabinDevkota	6b7dd4ea28	fix(auth): case-insensitive owner migration on username rename (#1183 ) Use func.lower() when updating SQL owner columns, match prefs keys case-insensitively, and normalize session usernames before comparing during rename. Prevents silently skipping legacy mixed-case owner data. Fixes #1165	2026-06-02 23:18:15 +09:00
Shaw	16f7feee0a	fix(hwfit): honor manual "metal" backend in the hardware simulator (#1090 ) The Cookbook's manual hardware simulator ("what if I had this setup") let users pick a backend, but _apply_manual_hardware only accepted cuda/rocm/cpu_x86/ cpu_arm and silently coerced anything else to cuda. So selecting Apple/Metal simulated a CUDA box instead — and ranked safetensors-only repos a Mac can't serve, even though the rest of hwfit (services.hwfit.fit, the serve-command generation) already supports Metal as GGUF-only via llama.cpp/Ollama. Add "metal" to the accepted backends (now a named _MANUAL_BACKENDS set, kept a subset of what fit.py understands) and set unified_memory=True for it — Apple Silicon shares one memory pool with the GPU — while clearing that flag for the discrete (cuda/rocm) and CPU backends. _apply_manual_hardware is lifted to module scope so it is directly unit-testable; both route call sites are unchanged. Adds tests/test_hwfit_manual_backend.py, including an end-to-end check that a simulated Metal box only recommends GGUF-servable models. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 23:12:34 +09:00
red person	c7ddfd7dd2	Use shared IMAP timeout for account tests (#1088 )	2026-06-02 23:11:04 +09:00
Jordan Urbs	c0c1ceb36d	Treat Venice as a tool-capable SOTA cloud provider (#1173 ) Follow-up to the Venice provider PR. Wire api.venice.ai into the three host allowlists so Venice behaves like the other paid OpenAI-compatible clouds: - agent_loop: add api.venice.ai to _API_HOSTS so the agent sends native OpenAI tool-call schemas (Venice supports function calling) instead of degrading to fenced-block parsing. - teacher_escalation: add api.venice.ai to _SOTA_HOSTS so the escalation loop stays OFF for Venice (it's a paid top-tier API; no need to add teacher-model latency). - webhook_routes: add venice to KNOWN_PROVIDERS so the sync chat webhook can auto-resolve base_url from provider=venice. Tests: tests/test_venice_hosts.py pins tool-host matching + SOTA classification for Venice; py_compile on touched modules. Co-authored-by: Cursor <cursoragent@cursor.com>	2026-06-02 23:03:46 +09:00
Mayank Ukey	3799dc102f	fix: ICS export — escape X-WR-CALNAME and honour is_utc on DTSTART/DTEND (#1174 ) Two bugs in the export_ics path: 1. X-WR-CALNAME was written raw: calendar names containing commas, semicolons or backslashes produced invalid ICS (RFC 5545 §3.3.11 requires those characters to be escaped as \, \; and \\). Fix: wrap cal.name in the existing _ics_escape() helper, which is already used for SUMMARY, DESCRIPTION, and LOCATION on the lines immediately below. 2. DTSTART and DTEND on non-all-day events always emitted the naive ISO string (e.g. 20260602T100000) regardless of CalendarEvent.is_utc. Consumers treat a naive datetime as floating/local time, so UTC events imported into Google Calendar or Apple Calendar shifted by the user's timezone offset. Fix: append 'Z' when is_utc is True, matching the pattern already used by the serialise_event() helper at line 408.	2026-06-02 23:02:28 +09:00
Shaw	4e769d537c	fix(cookbook): detect llama-cpp-python via its real distribution name (#1020 ) (#1167 ) The Cookbook → Dependencies tab reported llama-cpp-python[server] as "not installed" even when it was installed and usable for serving. The local check looked up distribution metadata as pkg["name"].replace("_", "-") — for the import name `llama_cpp` that yields "llama-cpp", but the module ships in the `llama-cpp-python` distribution. importlib.metadata.version("llama-cpp") then raised PackageNotFoundError and the package was marked missing (the import itself succeeds, which is why serving still worked). Derive the distribution name from the package's declared pip spec instead (stripping [extras] and version markers), falling back to the munged import name only when no pip spec is declared. New _pip_dist_name() helper. Adds tests/test_cookbook_package_detection.py covering the llama_cpp mapping, extras/marker stripping, plain names, the no-pip-spec fallback, and that the route wires the helper in (guarding against the exact regression).	2026-06-02 22:52:37 +09:00
pewdiepie-archdaemon	ff93a6c63b	Polish email and cookbook flows	2026-06-02 22:42:07 +09:00
red person	028a39b42c	Fix local Cookbook dependency installs in venvs (#1082 )	2026-06-02 22:39:02 +09:00
Afonso Coutinho	5b12bf3f55	fix: ICS export doesn't escape commas/semicolons in event fields (#1161 ) * fix: escape SUMMARY/LOCATION per RFC 5545 in ICS export * fix: escape commas/semicolons in ICS DESCRIPTION, not just newlines * test: ICS export escapes commas, semicolons, backslashes, newlines	2026-06-02 22:36:12 +09:00
red person	fd89d098a1	Chat: use cached endpoint model ids before probing	2026-06-02 21:00:58 +09:00
ooovenenoso	bd2fa82c1e	Cookbook: prefer ROCm for native llama.cpp bootstrap Co-authored-by: Kevin <120500656+oooindefatigable@users.noreply.github.com>	2026-06-02 20:59:44 +09:00
Robin Fröhlich	3c6ae3713e	Models: add Z.AI coding endpoint and GLM vision detection	2026-06-02 20:59:17 +09:00
SurprisedDuck	934bca9e48	Providers: omit temperature for OpenAI reasoning models * fix: omit temperature for OpenAI reasoning models (o1/o3/o4/gpt-5) These models only accept the default temperature; sending any explicit value (even 0.0) returns HTTP 400 "Only the default (1) value is supported". This broke two paths: - Endpoint probing in _probe_single_model hardcodes temperature: 0.0, so a perfectly valid o3/gpt-5 endpoint is reported as failing in the Model Endpoints health check. - Chat/stream payloads send temperature unconditionally, so a non-default temperature preset 400s on these models. The code already special-cases the same model family for max_completion_tokens, so this adds a sibling _restricts_temperature() helper and omits the field for those models, letting the API use its required default. gpt-4.5 is intentionally excluded (not a reasoning model; accepts temperature normally). Adds tests/test_llm_core_temperature.py covering the predicate and the synchronous payload builder. * fix: also omit temperature for reasoning models on the direct-POST paths The first commit only covered llm_call/llm_call_async/stream_llm and the endpoint probe. Email auto-summary, urgency-less spam classification, the email reply-summary endpoint, and gallery vision tagging build their OpenAI payloads inline and POST them directly (requests/httpx), bypassing llm_core — so a reasoning model configured there would still 400 on the temperature field. These sites already branch on _uses_max_completion_tokens, so they're the same class; added the matching _restricts_temperature guard. gallery_routes also gains the max_completion_tokens branch it was missing, so gpt-5 vision tagging works end to end. Note: email_pollers urgency scoring goes through llm_call_async and was already covered.	2026-06-02 20:58:33 +09:00
Tushar-Projects	c3228f8b59	Background tasks: respect active session model fallback	2026-06-02 20:57:42 +09:00
Georgiy	34c81e5b16	Auth: use require_user for remaining guarded routes	2026-06-02 20:55:50 +09:00
Leo	6c15dc7d33	Chat metrics: surface backend generation speed * Chat metrics: show backend's true generation t/s, not tokens÷wall-clock The per-message tokens/sec read low and felt wrong because it was computed as output_tokens / total_duration, where total_duration is wall-clock including prefill, tool calls, and network — not pure decode time. llama.cpp already reports the correct gen speed in its stream (timings.predicted_per_second), but it was being dropped. - llm_core.py: when parsing the OpenAI-compatible usage chunk, also read the sibling `timings` block llama.cpp includes — pass predicted_per_second through as gen_tps and prompt_per_second as prefill_tps on the usage event. - agent_loop.py: capture backend_gen_tps/backend_prefill_tps from usage events; in _compute_final_metrics prefer backend_gen_tps over the wall-clock division when present (fall back to computed for cloud APIs that omit timings). Tag the result with tps_source ("backend" vs "computed") and surface prefill_tps. Result: the displayed t/s now matches the model's real decode speed and is stable regardless of prompt length (a long prefill no longer deflates it). Checks: py_compile passes; verified extraction against a real llama.cpp final chunk (gen 79 t/s surfaced vs the deflated wall-clock figure shown before). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * Chat metrics: surface true t/s on the direct-chat path too Follow-up to the gen-tps work: the non-agent direct-chat stream path in chat_routes turned the raw `usage` event straight into a metrics event but only copied token counts — it never set tokens_per_second or response_time. So simple (non-tool) replies showed "Speed: n/a" / "Time: undefineds" and the chip fell back to a bare token count ("27 tok") instead of t/s. Map the usage event's gen_tps (llama.cpp timings.predicted_per_second, added in the prior commit) into tokens_per_second here too, tag tps_source=backend, and set response_time from wall-clock for the stats popup. Checks: py_compile passes; verified llama.cpp emits usage+timings on the final stream chunk (gen ~90 t/s) that this path consumes. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * Tests: backend gen/prefill t/s passthrough and preference Cover the two pieces of the true-t/s metric so it can be reviewed on its own: - stream_llm surfaces llama.cpp's timings.predicted_per_second / prompt_per_second as gen_tps / prefill_tps on the usage event (captured llama.cpp final-chunk fixture), and omits them when the backend reports no timings. - _compute_final_metrics prefers backend_gen_tps over output/wall-clock, tags tps_source ("backend" vs "computed"), and surfaces prefill_tps. Reuses the fake-client stream harness from test_llm_core_streaming.py. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-02 20:52:08 +09:00
ghreprimand	4cec31d988	Chat: route image sessions only to matching image endpoints Co-authored-by: ghreprimand <203024559+ghreprimand@users.noreply.github.com>	2026-06-02 20:52:03 +09:00
Shaw	db10c8d95b	Sessions: allow deleting memory-only ghost sessions A session that exists only in the in-memory SessionManager — never persisted, or whose DB row was removed out-of-band — was listed by GET /api/sessions (the list is built from the in-memory manager) but 404'd on every per-session operation, so it could never be deleted. Two causes, both fixed: 1. _verify_session_owner() only consulted the DB and raised 404 when no row existed. It now falls back to the in-memory session's owner when (and only when) a session_manager is supplied and the caller actually owns the ghost. The DB row stays authoritative when present, and a ghost owned by another user still 404s, so the ownership/security model is unchanged. The new parameter defaults to None, preserving behavior for all other callers. 2. SessionManager.delete_session() only removed the in-memory entry when a DB row was found, so memory-only ghosts survived. It now drops the in-memory copy regardless and reports success when either the DB row or the in-memory entry was removed. Added tests/test_session_ghost_delete.py covering both layers, including the cross-owner 404, the unauthenticated 403, DB-row-wins precedence, and backward compatibility when no manager is passed. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-02 20:51:26 +09:00
Yavor Ivanov	7cc8fdb2f5	Models: avoid hidden models in default fallback Both get_default_chat and _recover_empty_session_model picked the first model from cached_models[0] without checking hidden_models. If the first cached model was hidden (e.g. minimax-m3), it was returned as the default or used to repair empty session models, even though the model list endpoints already filter hidden_models. - Add _visible_models() helper that filters cached_models by hidden_models (mirrors the filtering in list_model_endpoints) - Use _visible_models() in get_default_chat fallback (when no explicit default_model is saved) - Use _visible_models() in _recover_empty_session_model (when repairing a session whose model field is empty before chat send) - Add regression tests for hidden-model filtering in default chat resolution, and unit tests for _visible_models helper	2026-06-02 20:37:14 +09:00
Tatlatat	bd78e1d5c2	Admin: wipe gallery albums with images The /api/admin/wipe/gallery branch deleted GalleryImage rows but left every GalleryAlbum row behind (GalleryAlbum wasn't even imported). After "wipe gallery" the user is left with orphaned, empty albums whose cover_id points at now-deleted images — inconsistent with the other wipe branches, which clear both parent and child tables. Delete GalleryAlbum alongside GalleryImage and include both in the returned count. Adds tests/test_admin_wipe_gallery.py: seeds a real in-memory SQLite DB with an album + image, runs the actual wipe handler, and asserts both tables are emptied. Fails before this change (albums survive).	2026-06-02 20:35:57 +09:00
SurprisedDuck	78747b56ca	Documents: strip PDF marker without corrupting text _process_pdf prepends "\n\n[PDF content]:" to extracted text, and two call sites in document_routes.py stripped it with .lstrip("\n[PDF content]:"). str.lstrip(chars) treats its argument as a set of characters, so it keeps eating into the page text that follows the marker — e.g. a body starting with "to the board" loses its leading "to" because 't'/'o' are in the marker's character set. Replace both sites with a shared strip_pdf_content_marker() helper that uses str.removeprefix.	2026-06-02 20:35:27 +09:00
Ernest Hysa	996a2027dd	Cookbook: surface pip install failures in logs _pip_install_fallback_chain silently discarded pip stderr via 2>/dev/null on every attempt. When pip failed (network error, venv mismatch, disk full), the wrapper exited 0 and the Cookbook UI showed the download as running — the silent-failure mode from #354. Extract _pip_install_attempt() which wraps each pip invocation in a bash -c subshell that captures output to a temp file, prints tail -5 on failure, cleans up, and exits with pip's real exit code. This avoids the \| tail pipefail masking (the first blocker on #363) while surfacing the last 5 lines of pip output in the tmux log so users can see what went wrong. Both local wrapper and remote SSH runner use the same helper through _pip_install_fallback_chain, so the fix is symmetric.	2026-06-02 20:34:52 +09:00
Hayk Arzumanyan	514050d098	Models: rewrite Docker loopback endpoints to host gateway In Docker, a model-endpoint URL pointing at loopback (e.g. the LM Studio default http://localhost:1234/v1) targets the Odysseus container itself, not the host running the server, so the probe gets a connection error and the endpoint is rejected with a misleading 'No models found for that provider/key'. Rewrite loopback to host.docker.internal (which compose already maps to host-gateway) for the probe and the saved URL, mirroring the existing Ollama handling. Gated on actually being in a container with the gateway reachable, so native installs and gateway-less deploys are untouched. Fixes #25 Co-authored-by: Claude <noreply@anthropic.com>	2026-06-02 20:34:40 +09:00
Tatlatat	67517eaed1	Gallery: match image endpoint URLs with exact v1 suffix The image-edit endpoint lookup compared stored vs incoming base URLs with `.rstrip("/v1")`. `str.rstrip(chars)` treats its argument as a character set, not a suffix, so any URL ending in '/', 'v', or '1' is over-stripped (e.g. `http://host1/v1` -> `http://host`). Two endpoints that are not the same can then compare equal, or the real endpoint fails to match its own stored record, leaving `api_key` unset and sending the upstream image call unauthenticated. Use `.removesuffix("/v1")` (exact-suffix removal) with surrounding `.rstrip("/")` on both sides so only a genuine trailing `/v1` is dropped. Adds a focused test that parses the actual comparison expression out of gallery_routes.py via AST and evaluates it — it fails if the fix is reverted and uses no mocking.	2026-06-02 20:34:05 +09:00
Mahdi Salmanzade	280c29d572	Security: owner-scope v1 chat endpoint fallback The sync-chat endpoint's Case 3 fallback selected a ModelEndpoint with an unscoped `query(ModelEndpoint).filter(is_enabled == True).first()` and then used that row's decrypted `api_key` for the LLM call. ModelEndpoint is a per-user resource (owner non-null = private to that user), so a chat-scoped API token for user A that sent no session and no api_key could fall back onto user B's PRIVATE endpoint — spending B's API key/quota and reaching whatever internal base_url B configured. This is the same multi-tenant owner-scoping class already fixed for the session gate on this very endpoint (_caller_owns_session) and for companion/models. Scope the fallback to the token owner's own rows plus legacy null-owner (shared) rows via the existing owner_filter helper, matching routes/model_routes.py and companion/routes.py. A null/empty owner stays a no-op, preserving single-user/legacy behaviour. Add regression tests pinning the scoped fallback (cross-owner, shared-only, no-visible-row, disabled-owned, and the legacy null-owner no-op).	2026-06-02 20:31:35 +09:00
Refuse	323f027865	Security: sanitize export and gallery filenames Co-authored-by: RefuseOdd <refuseodd@users.noreply.github.com>	2026-06-02 20:29:56 +09:00

1 2 3 4

190 Commits