faster-whisper runs on CTranslate2, not torch, but _get_whisper() imported torch (only to check cuda availability) inside the same try as the faster-whisper import. on a torch-less machine that raised ImportError and reported the misleading 'faster-whisper not installed' even when it was installed, so local mic transcription silently failed. probe torch separately and optionally: present -> cuda, absent -> cpu. also declare faster-whisper in requirements-optional.txt (torch stays an optional extra for gpu).
26 lines
1.3 KiB
Plaintext
26 lines
1.3 KiB
Plaintext
# Optional dependencies — install only if you use the corresponding feature.
|
|
# The app handles their absence gracefully (clear error message on first use).
|
|
#
|
|
# Note: chromadb-client + fastembed moved to requirements.txt — RAG, semantic
|
|
# memory, and tool selection are core paths, so they ship by default now.
|
|
|
|
# Local speech-to-text (microphone -> text) via faster-whisper, for the
|
|
# "local" STT provider. Runs on CPU out of the box (CTranslate2 backend, no
|
|
# torch needed). Install if you want to dictate/transcribe with the mic
|
|
# without sending audio to an external endpoint.
|
|
# Optional extra: install `torch` too if you have a CUDA GPU and want
|
|
# GPU-accelerated transcription — it's auto-detected, CPU is used otherwise.
|
|
faster-whisper
|
|
|
|
# DuckDuckGo as a search provider option.
|
|
# Install if you want DDG in the search-provider dropdown.
|
|
# Alternatives: SearXNG, Brave, Tavily, Serper, Google PSE.
|
|
duckduckgo-search
|
|
|
|
# PDF form-filling feature (fillable AcroForm detection, field extraction,
|
|
# value/annotation/signature stamping, page rendering for the form overlay).
|
|
# NOTE: PyMuPDF is AGPL-3.0. Installing it brings AGPL obligations for a
|
|
# network-served app — see ACKNOWLEDGMENTS.md. The MIT core (PDF *text*
|
|
# extraction via pypdf) works without it; this only unlocks form-filling.
|
|
PyMuPDF
|