[GENERAL] Full text search tsv column aproach vs concat confusion

cen Wed, 16 Nov 2016 00:55:32 -0800

Hi

I am seeking some clarification in regard to full text search acrossmultiple tables and what the best approach is. Documentation talks abouttwo approaches when it comes to building a document: on-the-fly concatof columns and a dedicated tsv column approach. Let's say I want toperform a search for |"txt1 & txt2 & txt3" on columns table1.col1,table1.col2 and table2.col1. I see the following solutions:|


|1. Concat all three into a document and perform a FTS.|

|SELECT * FROM (
|

|SELECTto_tsvector(table1.col1)||to_tsvector(table1.col2)||to_tsvector(table2.col1)asdocumentF|||ROM table1 LEFTJOINtable2 ONtable1.table2_id=table2.id| ) subquery |||WHEREsubquery.document@@to_tsquery(unaccent(?));| |

|2. Create a tsv column in each table, concat tsv columns and performFTS on that.|

|SELECT*FROMtable1 LEFTJOINtable2 ONtable1.table2_id=table2.idWHEREtable1.tsv ||tale2.tsv @@to_tsquery(unaccent(?));|

|3. Have a tsv column only in table1 and insert table2.col1 to the tsvvia triggers. Works but seems very hacky.|

|
|

|It seems to me that option #2 is fast and easy to implement but I amnot sure what the concat of tsvs really means from index usage andperformance standpoint. Option #1 is the most flexible and I'd use thatall the time if it was not THAT much slower than tsv column approacj.Documentation on TSV columns states: "||Another advantage is thatsearches will be faster, since it will not be necessary to redo theto_tsvector calls to verify index matches."

The question is, how much faster are tsv columns really? Are there anybenchmarks about this? If the performance difference is negligible I'dadvocate that using tsv columns is a waste of time and space in mostgeneral cases. But since there is no information on how much faster it'shard to decide.



Best regards,
Klemen

||

[GENERAL] Full text search tsv column aproach vs concat confusion

Reply via email to