The documentation at http://www.sqlite.org/fts3.html#tokenizer states about
the tokenizer
        eligible characters are all alphanumeric characters, the "_"
character, 
        and all characters with UTF codepoints greater than or equal to 128
This suggests to me that an underscore is part of words like 'normal'
characters.

However, it seems words are still split at the underscore charachter:

CREATE VIRTUAL TABLE simple USING fts3(tokenize=simple);
INSERT INTO simple VALUES('This is a word_with_underscores');
INSERT INTO simple VALUES('This is a wordwithoutunderscores');

-- If the underscore is not a token separator this should yield no records.
SELECT * FROM simple WHERE simple MATCH 'with';
-- The first record is returned though

Tested in version 3.7.2

BTW: a documentation error on http://www.sqlite.org/fts3.html#tokenizer 
        SELECT * FROM simple WHERE simple MATCH 'Frustrated');
Several of these lines should drop the last parenthesis.

_______________________________________________
sqlite-users mailing list
sqlite-users@sqlite.org
http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users

Reply via email to