Re: pg_trgm vs. Solr ngram

2023-02-11 Thread Christian Ramseyer (mx04)
On 10.02.23 04:48, Laurenz Albe wrote: On Fri, 2023-02-10 at 03:20 +0100, Chris wrote: In Solr I was using ngrams and customized the TokenizerFactories until more or less only whitespace was as separator, while [.:-_\d] remains part of the ngrams. This allows to search for ".12.255/32" or "xzy-e

Re: pg_trgm vs. Solr ngram

2023-02-09 Thread Bertrand Mamasam
Le ven. 10 févr. 2023, 03:20, Chris a écrit : > Hello list > > I'm pondering migrating an FTS application from Solr to Postgres, just > because we use Postgres for everything else. > > The application is basically fgrep with a web frontend. However the > indexed documents are very computer networ

Re: pg_trgm vs. Solr ngram

2023-02-09 Thread Tom Lane
Chris writes: > Or maybe hacking my own pg_trgm wouldn't be so hard and could be fun, do > I pretty much just need to change the emitted tokens or will this lead > to significant complications in the operators, indexes etc.? See KEEPONLYALNUM in pg_trgm/trgm.h ... Now, using a custom-modified

Re: pg_trgm vs. Solr ngram

2023-02-09 Thread Laurenz Albe
On Fri, 2023-02-10 at 03:20 +0100, Chris wrote: > I'm pondering migrating an FTS application from Solr to Postgres, just > because we use Postgres for everything else. > > The application is basically fgrep with a web frontend. However the > indexed documents are very computer network specific a

pg_trgm vs. Solr ngram

2023-02-09 Thread Chris
Hello list I'm pondering migrating an FTS application from Solr to Postgres, just because we use Postgres for everything else. The application is basically fgrep with a web frontend. However the indexed documents are very computer network specific and contain a lot of hyphenated hostnames wi