shiyao.ma added the comment:

Hi.

JUST FYI.

When a user submits an issue, the tracker will split the both the title and 
message into words. The splitting regex is the same as provided by RdM.

Those splitted words are stored in the table _words. Each _words entry is in 
the form of (a_splitted_word, a_text_id).

A _text_id is the PM key of the table _textids. The _textids entry will tell us 
what's the associated issue title id or message id.

Thus, an inverted index is formed.

When a user searches, for example, through the web interface. The 
roundup.cgi.templating.py will re-split the words, search that inverted index, 
and finally find the result.


What I am thinking is, regex is not powerful enough. We may add fix for this by 
excluding "." from the word boundary.

But this may be (better) achieved by other means, like specifying the "version".

----------
nosy: +introom

_______________________________________________________
PSF Meta Tracker <metatrac...@psf.upfronthosting.co.za>
<http://psf.upfronthosting.co.za/roundup/meta/issue515>
_______________________________________________________
_______________________________________________
Tracker-discuss mailing list
Tracker-discuss@python.org
https://mail.python.org/mailman/listinfo/tracker-discuss

Reply via email to