odysseus

Author	SHA1	Message	Date
Afonso Coutinho	f6f86c4b34	fix: research source extraction crashes on a non-dict finding (#1714 )	2026-06-03 13:34:40 +09:00
lekt8	126e91e8b9	Don't attempt the same (url, model) route twice in the fallback chains (#1733 ) The fallback helpers (llm_call_with_fallback, llm_call_async_with_fallback, stream_llm_with_fallback) build their candidate list as the primary target followed by the configured fallbacks. Callers prepend the session's live (url, model) to default_model_fallbacks, so if the user also lists their current model among the fallbacks — a common misconfiguration — the chain re-attempts the very route that just failed: a wasted round-trip (and, for the streaming path, a spurious 'fallback' notice for a switch that didn't actually happen). Add a small _dedupe_candidates() helper that filters malformed entries and drops a later repeat of an already-seen (url, model), preserving order (first wins, keeping its headers). Apply it in all three fallback chains. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 13:33:50 +09:00
lekt8	77614e9feb	Don't force-include the email toolset on every "tell me" query (#1707 ) (#1735 ) The agent tool-RAG force-includes a keyword hint's tools whenever any of its keywords appears in the query (word-boundary match). The email-intent hint listed "tell", which matches a huge fraction of requests — e.g. "visit <url> and tell me the title" — so the whole email toolset was force-included and crowded out the relevant tools. The model then saw a prompt dominated by email tools and reported it had no web search / could not visit the URL. Remove "tell" from the email keyword set. Genuine email intent still fires on email/mail/gmail/inbox/unread/message/send/reply. Test drives get_tools_for_query directly with retrieval stubbed (the keyword hints are deterministic, no embeddings needed): a "...tell me..." web query no longer pulls in email tools, a real email request still does. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 13:33:43 +09:00
Mubashir R	a8a5d6f56e	fix: RAG keyword fallback leaked owner-less documents across users (#1722 ) VectorRAG.search() filters with ChromaDB where={"owner": owner}, returning only documents whose owner equals the requesting user. The keyword fallback (_keyword_search_fallback, used when the primary query raises) guarded with `if doc_owner and doc_owner != owner: continue`, so a document with a missing/empty owner fell through and was returned to whichever user issued the query — a cross-user information leak on the fallback path. Match the primary path's strict filter: skip any doc whose owner != the requested owner, including owner-less docs. Co-authored-by: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-03 13:31:33 +09:00
Afonso Coutinho	ada30aa039	fix: evaluate_turn_regex crashes on a non-string agent_reply (#1723 )	2026-06-03 13:31:26 +09:00
Wes Huber	49885ff9e7	fix(documents): use strip_pdf_content_marker instead of lstrip for PDF auto-open (#1727 ) lstrip("\n[PDF content]:") treats the argument as a character set, not a prefix, so it chews into the following [Page N text]: marker — e.g. turning [Page 1 text]: into "age 1 text]:". The correct helper strip_pdf_content_marker (which uses removeprefix) already exists in the same file and is used by other call sites. Fixes #1663 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-03 13:30:04 +09:00
Ethan	0e538ecd29	Fix RAG remove_directory wiping the entire shared collection (#1660 ) (#1734 ) Removing one RAG directory destroyed the whole shared ChromaDB collection (all owners + base index) instead of just that directory's chunks. Shared root cause: PersonalDocsManager.remove_directory called rebuild_index() (delete_collection + recreate) then re-indexed only the remaining tracked dirs (ownerless, never personal_dir). The targeted VectorRAG.remove_directory that should have been used was itself broken (where={"source":{"$contains":dir}} selects nothing on scalar metadata and would over-delete siblings), and the dead do_manage_rag path fired a second unconditional rebuild. - VectorRAG.remove_directory: select chunks in Python by a path-boundary match on the stored absolute `source` (dir or dir+os.sep), abspath-normalized. Keys on `source` (always written), never `owner` -- no migration. - PersonalDocsManager.remove_directory: call the targeted remove instead of rebuild_index() + partial reindex. - do_manage_rag (dead code): drop the second rebuild_index() (hygiene). - rag_server.py add path: abspath so indexed `source` matches the remove. No schema change. Prevents future wipes (does not recover already-wiped vectors). Adds hermetic regression tests at three layers. Fixes #1660 Co-authored-by: Ethan <23321960+0xLeathery@users.noreply.github.com>	2026-06-03 13:29:51 +09:00
Ethan	b9c382006e	Clamp Anthropic temperature to [0.0, 1.0] in _build_anthropic_payload (#1737 ) Anthropic's Messages API rejects temperature > 1.0 with HTTP 400, but _build_anthropic_payload forwarded it verbatim. The shipped "Nietzsche" preset uses temperature 1.2 and the UI slider allows up to 2.0, so every Claude request under such a preset hard-broke. Clamp into [0.0, 1.0] in the Anthropic builder only (OpenAI keeps its wider 0.0-2.0 range). Covers all three Anthropic call paths, which build through this one function. None is passed through unchanged. Fixes #1615 Co-authored-by: Ethan <23321960+0xLeathery@users.noreply.github.com>	2026-06-03 13:29:36 +09:00
Afonso Coutinho	96a874c604	fix: a non-dict finding silently drops all raw research findings (#1739 )	2026-06-03 13:29:29 +09:00
Afonso Coutinho	063e7114e3	fix: youtube transcript formatter crashes on a non-dict segment (#1745 )	2026-06-03 13:29:08 +09:00
Afonso Coutinho	133948cc78	fix: uploads with _ or - in the extension become permanently unreadable (#1756 )	2026-06-03 13:28:45 +09:00
Afonso Coutinho	51857c9008	fix: chat memory extraction crashes on a non-dict message (#1749 )	2026-06-03 13:25:48 +09:00
clockworksquirrel	2625e97f11	Stop conversations crashing during compaction on tool-call turns (#1777 ) context_compactor.maybe_compact built its summary text with msg.get('content', '')[:2000], which raised TypeError: 'NoneType' object is not subscriptable on assistant turns whose content is None (turns that carried only native tool_calls). Once a conversation crossed the 85% compaction threshold — reached after only a few turns on small-context local models plus the large agent prompt — every subsequent message failed ("send more than three messages and it stops working"). Flatten message content to text first via a _content_as_text helper (str passthrough, multimodal list blocks joined, None -> "") and tolerate a missing role. Adds tests/test_context_compactor.py covering the helper and a >=4-message conversation that forces compaction with a None-content tool-call turn (fails before this change, passes after). Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 13:25:33 +09:00
Afonso Coutinho	2fa4d50115	fix: is_youtube_url crashes on a non-string url (#1752 )	2026-06-03 13:24:33 +09:00
Wes Huber	3abb735200	fix(security): scope send_to_session agent tool by owner (#1757 ) send_to_session was the only agent tool that didn't check session ownership — an agent acting for user A could read from and write into user B's session on a multi-user instance. Add owner parameter and reject access when the target session belongs to a different user, matching the pattern used by create_session, list_sessions, and manage_session. Fixes #1616 Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-06-03 13:24:08 +09:00
Afonso Coutinho	b3da01efd5	fix: ui_control rejects the advertised rag toggle (#1763 )	2026-06-03 13:24:00 +09:00
Lucas Daniel	68da800dcb	fix(agent): stop sending tool schemas to native Ollama endpoints (#1765 ) Models like gemma4, qwen3.5, and ministral served via Ollama's native /api/chat respond to OpenAI-style tool schemas by emitting a single native tool_call chunk and then stopping. The agent loop receives 1 token of round_response and no recognised ToolBlock, so the round ends immediately — the user sees a one-token response. Root cause: _is_api_model was True for any endpoint whose host appears in _API_HOSTS (which includes "host.docker.internal" and "localhost") OR whose model name matches a keyword like "gemma". Native Ollama endpoints were never excluded from this path. Fix: import _is_ollama_native_url from llm_core and treat native Ollama endpoints (/api/chat, port 11434) as text-only by default — falling back to the fenced-block tool path the local models are tuned for. The per-endpoint supports_tools=True toggle (Settings → Endpoints) still overrides this for users who have explicitly opted in. Fixes #1567	2026-06-03 13:23:42 +09:00
Afonso Coutinho	d25a860f71	fix: document tidy crashes on a duplicate with NULL timestamps (#1772 )	2026-06-03 13:23:01 +09:00
Afonso Coutinho	db1596f3b4	fix: signature learning never skips support@/info@/admin@ senders (#1773 )	2026-06-03 13:22:52 +09:00
pewdiepie-archdaemon	ed7956cbd3	Owner-scope RAG doc ids so identical chunks across users don't collide (#1738 , #1760 ) _generate_doc_id hashed only text. add_document / add_documents_batch early-return when the id exists, so the second owner indexing a byte-identical chunk hit the first owner's id, was silently dropped, and never stored under their owner — their owner-filtered search then quietly omitted it. Hash owner + text; empty owner reproduces the legacy id, so the unowned/base index keeps existing ids and isn't re-churned. Same-owner identical chunks still dedupe. Caught by #1738 and #1760 (independent reports of the same bug).	2026-06-03 11:36:31 +09:00
pewdiepie-archdaemon	9960d55a41	Decrypt CalDAV password before write-back (#1731 ) writeback_event read cfg["password"] (the encrypted blob) and passed it straight to DAVClient, so every local create/edit/delete authenticated with the literal ciphertext, the remote rejected it, and the change never reached the server — the exact silent-write-loss this module was built to prevent. The pull path src/caldav_sync.py already decrypts; mirror that. decrypt() is a no-op on legacy plaintext. Caught by #1731.	2026-06-03 11:36:12 +09:00
pewdiepie-archdaemon	6153c5ed68	Close app_api blocklist gap for bare /api/tokens and /api/users The blocklist prefixes had trailing slashes, so path.startswith() only matched /api/tokens/{id} but not /api/tokens itself — the bare GET (list) and POST (mint) endpoints were reachable via app_api. Same gap on /api/users (list/create/delete). Drop trailing slashes so both bare and sub-resource forms are blocked. /api/auth and /api/admin had no bare endpoints today but get the same treatment to prevent future drift. Caught by #1462.	2026-06-03 11:20:39 +09:00
Afonso Coutinho	aa5e3f6884	fix: is_markitdown_format crashes on a non-string path (#1618 )	2026-06-03 09:00:10 +09:00
Afonso Coutinho	fc220f760f	fix: inside_base_dir raises TypeError on a non-string path instead of failing closed (#1619 )	2026-06-03 09:00:04 +09:00
Afonso Coutinho	2d94e38d23	fix: document_actions title/content helpers crash on non-string input (#1621 )	2026-06-03 08:59:55 +09:00
Afonso Coutinho	03ddc5d2c4	fix: check_outbound_url crashes on a truthy non-string URL (#1623 )	2026-06-03 08:59:49 +09:00
Afonso Coutinho	3175d7ca21	fix: tool-block parsing crashes on a non-string input (#1628 )	2026-06-03 08:59:42 +09:00
Afonso Coutinho	d818117d4c	fix: _extract_skill_json crashes on a truthy non-string teacher response (#1630 )	2026-06-03 08:59:36 +09:00
Afonso Coutinho	8783f12c4c	fix: builtin_actions heuristics crash on a truthy non-string input (#1639 )	2026-06-03 08:59:16 +09:00
Afonso Coutinho	82c09dd768	fix: split_chunks emits a duplicate trailing chunk for text over size-overlap (#1573 )	2026-06-03 08:57:54 +09:00
red person	8051e25c65	Reject CalDAV writeback events without uid (#1582 )	2026-06-03 08:57:15 +09:00
red person	f39c87561b	Save only string personal doc paths (#1566 )	2026-06-03 08:37:29 +09:00
Afonso Coutinho	382d49d887	fix: validate_caldav_url crashes with TypeError on a non-string URL (#1608 )	2026-06-03 08:35:16 +09:00
lekt8	1f743970dd	Don't lose deep-research findings when synthesis times out (#1551 ) (#1562 ) Two problems made deep research report "No information could be gathered" even after it had extracted findings, on slow local models (reporter served a 20B via LM Studio): - _synthesize hard-capped its LLM call at timeout=60, while extraction uses the user's extraction_timeout (300s here) and the final report uses 180s. The slow model needed >60s to synthesize the round's findings, so synthesis timed out after 3 attempts. Raised it to 180s to match the final-report call. - When synthesis produced no report (it returns the unchanged, still-empty report on failure during round 1), the run hit `if not report: return "No information could be gathered…"` and discarded the findings it had already gathered. Now it falls back to a compiled report built from those findings (_fallback_report) so the user keeps the gathered material. Tests stub the LLM (no live model/DB), pin the synthesis timeout >= 180, that the fallback surfaces the findings rather than the give-up message, and that a failed synthesis preserves the previous report. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 08:11:44 +09:00
Afonso Coutinho	c9361262df	fix: APIKeyManager.load crashes app startup on a corrupt/wrong-shape api_keys.json (#1565 )	2026-06-03 08:11:37 +09:00
lekt8	583df3dd6a	Recognize gemma3/llama4/mistral-small3.1+/multimodal as vision models (#1430 ) is_vision_model() classified several genuinely multimodal families as text-only because their names contain neither "vision" nor "vl": Gemma 3 (4b+), Llama 4, Mistral Small 3.1/3.2, and *-multimodal models (e.g. phi-4-multimodal). For those the attached image was stripped before the request, so the model never saw it — a "can't read the image" report (issue #1274), common with Ollama tags like gemma3:4b. Add those keywords (plus a generic "multimodal"). Per the file's err-toward-True policy (#124), a rare text-only tag treated as vision is the safer failure than dropping a real image. Guard tests confirm the text-only siblings (gemma2, plain gemma, mistral-small, phi-3) are not over-matched. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 04:17:40 +09:00
lekt8	0ec8415f0e	Fix multi-file uploads tripping the per-IP concurrency guard (#1346 ) (#1362 ) * Stop multi-file uploads from tripping the per-IP concurrency guard The /api/upload concurrency check summed its condition over `files`, but the condition didn't reference the loop variable — so it collapsed to len(files) whenever the IP had any recent upload. A single multi-file batch sent right after another upload therefore counted itself as N concurrent uploads and hit max_concurrent_uploads (3), returning 429. The browser swallows the 429 (no `files` in the body) and sends the chat with no attachments, so the model "doesn't even see" them (issue #1346). Count genuine recent upload events instead, via a pure count_recent_uploads() helper, independent of the current batch's file count. save_upload still enforces the per-minute sliding-window rate limit per file, so throttling is preserved. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * Also reconcile the per-minute upload rate limit with the batch cap Follow-up within #1346: even after the concurrency-guard fix, a 6+ file batch still failed because save_upload() counts each file against upload_rate_limit (was 5/min) while the composer allows MAX_FILES=10 per batch — the reporter saw "5 attachments work, 6 fail". Raise the per-minute file cap to 60 so a single full batch (and a few of them) isn't self-rejected; burst abuse stays bounded by max_concurrent_uploads. Add a real 6-file regression + a config guard that the cap exceeds the frontend MAX_FILES. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 04:04:19 +09:00
red person	fd37ccebae	Ignore invalid personal docs state (#1401 )	2026-06-03 04:02:16 +09:00
red person	35c40bce75	Fall back from invalid settings stores (#1416 )	2026-06-03 03:53:05 +09:00
Paulo Victor Cordeiro	1f2a06facd	fix: MCP reconnect via tool passes only server_id to connect_server (#1385 ) * fix: MCP reconnect via tool passes only server_id to connect_server connect_server requires name, transport, command, args, env, and url but the reconnect path in do_manage_mcp only passed the server_id, causing a TypeError on every reconnect attempt. Mirror the pattern used in mcp_routes.py reconnect_server. * test: verify MCP reconnect passes full server config to connect_server Mocks the MCP manager and DB to assert that do_manage_mcp reconnect passes name, transport, command, args, env, and url — not just the server_id.	2026-06-03 03:46:07 +09:00
lekt8	b6843c7621	Route "read that report" to manage_research instead of the HTML render (#1375 ) After a deep-research job completes, a follow-up like "check it out" / "read that report" had the agent web_fetch the /api/research/report/{id} HTML render (and then drift into unrelated searches) instead of reading the saved report (issue #1363). The report text is already available via the manage_research tool (action read), and action list returns ids most-recent-first, so the agent can resolve "the recent report" itself. Strengthen the manage_research instructions: read a finished report via action list -> action read; do NOT web_fetch/app_api the report URL (it renders HTML, not clean text) and do NOT start a fresh web_search just to read an existing report. Annotate the app_api endpoint list to say the same. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 03:24:09 +09:00
Paulo Victor Cordeiro	c3fd969965	fix: once-schedule comparison uses local time against UTC date (#1349 ) When a timezone is configured, `now` is tz-aware local time. The comparison stripped tzinfo with `.replace(tzinfo=None)`, producing naive local time, but `scheduled_date` is stored as naive UTC. For users east of UTC this causes tasks to appear expired prematurely; for users west they linger past due time. Use `_to_utc_naive(now)` to convert to the same reference frame.	2026-06-03 03:07:00 +09:00
lekt8	ce7f5dbbdd	Inject current date into deep research planning and query prompts (#1347 ) Deep research generated search queries from the LLM's training-cutoff knowledge, so it emitted stale-year queries like "best Python tutorials 2025" when the actual year is later (issue #1341). The chat/agent path already grounds the model with "Today is ..." (src/agent_loop.py); the deep research planning and query-generation prompts had no equivalent. Add a small current_date_context() helper and prepend it at the plan and query-generation prompt sites (and the research_handler plan preview path that reuses RESEARCH_PLAN_PROMPT). System-TZ local, portable strftime. Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 03:00:52 +09:00
Vykos	b2291fad49	Harden CalDAV credentials and URLs (#1310 )	2026-06-03 02:50:02 +09:00
Aaran Lawing	56656de5bc	fix: RRULE added to schema (#1322 ) * fix: RRULE added to schema * Update tool_schemas.py	2026-06-03 02:47:14 +09:00
Vykos	4771d80eb2	Harden session endpoint owner scope (#1308 )	2026-06-03 02:40:22 +09:00
lekt8	80de69ebb0	feat: document rrule in the manage_calendar tool schema (#1320 ) (#1324 ) * feat: document rrule in the manage_calendar tool schema (#1320) The create_event handler already persists `rrule` (a single event carrying an iCalendar RRULE), but the manage_calendar tool schema didn't list it, so the agent had no documented way to make a recurring event and took a roundabout path. Add `rrule?` to the create_event field list with examples (FREQ=WEEKLY;BYDAY=MO etc.) and an explicit note to create ONE event with the rule rather than looping. Covered by tests/test_calendar_rrule.py: do_manage_calendar create_event with an rrule stores one event with that recurrence; without it, the event is single. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * test: restore SessionLocal via monkeypatch in #1320 rrule test (review) Per review: the test patched core.database.SessionLocal at module import and never restored it, which could leak the temp DB into later tests in the same process. Move the patch into an autouse monkeypatch fixture so it is restored after each test. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>	2026-06-03 02:37:45 +09:00
Vykos	5ee30cc144	Scope skills usage by owner (#1312 )	2026-06-03 02:27:43 +09:00
Vykos	e73545f64f	Keep Bitwarden unlock password off argv (#1311 )	2026-06-03 02:13:51 +09:00
Afonso Coutinho	a8395b4e4c	fix: agent_input_token_budget wrongly treated as a secret and unsettable from chat (#1294 ) * fix: don't classify agent_input_token_budget as a secret (token must be a suffix) * test: agent_input_token_budget is settable from chat	2026-06-03 01:53:47 +09:00

1 2 3 4

180 Commits