https://bugs.kde.org/show_bug.cgi?id=413694

--- Comment #8 from Joachim Wagner <jwag...@computing.dcu.ie> ---
Re the idea of recording the sha256 of each file, this may be problematic for
large files with only a small content area such as meta data and subtitles of a
video. Still, reading excessive amounts of data can be preferable over writing
excessive amounts of index data. A solution may be to require the content
indexer modules to support returning a content fingerprint, with the default
implementation running the normal content extraction and calculating a
fingerprint over the extracted content. File-format-specific implementations
can skip some processing steps such as decompression of a data stream and
character set conversion.

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to