Filter malformed non-string research summaries instead of letting the broad exception path classify them as usable, with regression coverage.
* fix: drop over-broad 'cookie'/'copyright' low-quality markers * fix: detect cookie/copyright boilerplate via phrases, not bare words * test: keep research findings that merely mention cookies or copyright