Re: [TOKYO Lucene/Solr Meetup] Lecture by Alessandro Benedetti and Mingchun Zhao

2024-10-15 Thread Alessandro Benedetti
Hi, I was talking with Kaya (who reads in cc) and it seems it's not possible to register to attend physically? For Guillaume, my part is on youtube already: https://www.youtube.com/watch?v=8x2cbT5CCEM Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache L

Re: Vector search highlightings

2024-10-04 Thread Alessandro Benedetti
We have developed a Neural Highlighting plugin already: https://sease.io/2023/05/apache-solr-neural-highlighting-plugin.html We are not planning to donate it yet but if we get some sponsor we may make it available sooner rather than later! Cheers On Fri, 4 Oct 2024, 19:49 Robi Petersen, wrote:

Re: schemaVersion 1.7 docValues

2024-09-18 Thread Alessandro Benedetti
That's correct, I haven't checked the internals for this one, but as far as I know, it only applies to places where docValues is not explicitly set. I agree with your understanding. Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr

Re: 如何使用DenseVectorField

2024-07-23 Thread Alessandro Benedetti
Hi, Apache Solr only supports fixed-length vectors. So you need to define the cardinality for your field and that's it. You can't pass variable length vectors. Each document, for the field, needs to have that vector dimension. Cheers -- *Alessandro Benedetti

Re: Terms query with filters

2024-07-23 Thread Alessandro Benedetti
are not a problem per se. Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <ht

Re: Reranking solr results using cross encoders

2024-06-11 Thread Alessandro Benedetti
Hi Sergio, Thay's not possible yet. It's among the contributions I want to make but there's no concrete plan yet. Cheers On Tue, 11 Jun 2024, 17:38 Sergio García Maroto, wrote: > Hi, > > Is it actually possible to rerank using cross encoders in solr? > > I am trying to find something around but

Re: Performance Suggestion for Dense Vectors

2024-05-31 Thread Alessandro Benedetti
Cool! I missed that! I'll make sure to align with my digital marketing manager to make her add all our Solr-related external posts! Good to see this live! ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e

[Community Over Code EU] Bratislava, June 3-5, 2024

2024-05-17 Thread Alessandro Benedetti
-- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <https://linkedin.co

Re: solr9.5 dense vector search using "Double" type instead of "Float"

2024-05-13 Thread Alessandro Benedetti
A byte can be used to represent 2^8 different values. Apache Solr supports each element b as a byte (-128 to 127). A double in Java needs 8 bytes, so you won't be able to encode a double in a single byte lossless. Cheers ------ *Alessandro Benedetti* Director @ Seas

Re: (Gen)AI-Support for searching with Solr/Lucene

2024-04-25 Thread Alessandro Benedetti
Hi Walter, We've been doing many AI integrations with Solr and we drafted a roadmap to get some funding to implement it directly in Solr: https://sease.io/2023/10/apache-lucene-solr-ai-roadmap-do-you-want-to-make-it-happen.html We made little progress so far but hopefully will attract some more a

Re: Performance Suggestion for Dense Vectors

2024-03-29 Thread Alessandro Benedetti
is a possibility in many blogs. We've done that as well with an initiative in collaboration with the University of Padua. Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.i

Re: Performance Suggestion for Dense Vectors

2024-03-28 Thread Alessandro Benedetti
just this sort of mail message are welcome. Let me reply in line: -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Sourc

Re: Migrating from solr8.11.3 to solr 9.5.0

2024-03-13 Thread Alessandro Benedetti
;s no need to start from the new default, but also I strongly don't recommend copying and pasting the old solrconfig.xml and schema because stuff may not work as expected or something could silently stop working. Cheers -- *Alessandro Benedetti* Director @ Sease Lt

Re: Bury search results with bq

2024-03-13 Thread Alessandro Benedetti
thing) -> never tested (at least recently) 2) you use a constant score field:terrible^=-100 -> never tested Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* -

Re: Block MAX WAND feature use

2024-02-05 Thread Alessandro Benedetti
It would be interesting to see the level pf fragmentation of each index indeed... I.e. How many segments per core, in a collection On Tue, 6 Feb 2024, 06:59 Mikhail Khludnev, wrote: > 200-300 docs might be too few to get significant gain. With a 400M index > it's worth experimenting with skippin

Re: Use the solr similarity as a recomender

2024-02-02 Thread Alessandro Benedetti
t the More Like This seems what you want: https://solr.apache.org/guide/solr/latest/query-guide/morelikethis.html Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Inform

Re: Nested documents limitations and cautions

2024-01-31 Thread Alessandro Benedetti
collection. Possibly there have been other changes, I remember some stuff in Solr 9.x by Mikhail, but to list all of them in a nice report I should spend some time doing the proper homework. Hope his helps! Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache

Re: Limiting Query Result - Solr Version 8.11.2

2024-01-30 Thread Alessandro Benedetti
I am not sure I get the question, but can't you add a check in the search-API layer that says: int maxAllowedRows = 50; if (rows > maxAllowedRows){ rows = maxAllowedRows } Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apa

Re: LTR model upload API issue

2024-01-29 Thread Alessandro Benedetti
change for Learning To Rank only. Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <ht

Re: Dense vector search distance calculation

2024-01-29 Thread Alessandro Benedetti
This class is probably your best bet: org.apache.lucene.index.VectorSimilarityFunction >From here you can explore the java code used to calculate the vector similarities in Apache Solr. Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Commit

Re: Solr 9.4 - Help regarding vector search min Similarity threshold with knn parser

2024-01-29 Thread Alessandro Benedetti
in line: -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn

Re: Solr 9.4 - Help regarding vector search min Similarity threshold with knn parser

2024-01-26 Thread Alessandro Benedetti
-apache-solr.html Cheers ---------- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <http

Re: ML Model Management in Solr

2024-01-24 Thread Alessandro Benedetti
hat, the integrations you mentioned are on my team roadmap: https://sease.io/2023/10/apache-lucene-solr-ai-roadmap-do-you-want-to-make-it-happen.html But at the moment we are waiting for sponsors to make it happen. Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *A

Re: SolrCloud availability when a server is down

2024-01-10 Thread Alessandro Benedetti
input). In that SolrJ class you see both indexing/querying approaches. Hope this helps, Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Ap

Re: Dense Vector - Similarity Function

2024-01-02 Thread Alessandro Benedetti
to contribute but we lack funding at the moment ( https://sease.io/2023/10/apache-lucene-solr-ai-roadmap-do-you-want-to-make-it-happen.html ). Of course, you are also welcome to contribute it yourself, as a community we welcome new contributors. Cheers ------ *Alessandro Benedet

Re: Dense Vector - Similarity Function

2024-01-02 Thread Alessandro Benedetti
won't be easy at all. Cheers ---------- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http:

Re: [TOKYO Lucene/Solr meetup] ~ How to Integrate Open Source LargeLanguage with Apache Solr (Alessandro) ~

2023-12-04 Thread Alessandro Benedetti
ndows > > From: Moriyasu Kannami > Sent: Monday, December 4, 2023 12:58 PM > To: Mikhail Khludnev > Cc: Alessandro Benedetti; users@solr.apache.org; Koji Sekiguchi > Subject: Re: [TOKYO Lucene/Solr meetup] ~ How to Integrate Open Source > LargeLanguage with Apache Solr (Alessandro) ~ &

Re: [TOKYO Lucene/Solr meetup] ~ How to Integrate Open Source Large Language with Apache Solr (Alessandro) ~

2023-11-23 Thread Alessandro Benedetti
Thanks, Moriyasu for the heads up! I'll advertise it on my LinkedIn soon and on the Apache Solr Twitter account! Also contacted some of the Japanese people I know to get better engagement :) Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene

A new Forum for Apache Solr discussions

2023-11-10 Thread Alessandro Benedetti
perts there(me included), open to discussion! Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source We

Re: Using expressions in sort and json stats

2023-11-01 Thread Alessandro Benedetti
I suspect this is an XY problem, can you elaborate better what's the original task to be solved? It's unusual to use those sort expressions in Solr. Maybe we can solve it differently, at indexing time or maybe using some ranking models (like a linear one on learning to rank, but using not learned

Re: Help Solr Newsletter October 2023 with links, blogs, articles

2023-10-26 Thread Alessandro Benedetti
Plenty of cool stuff! I'll have our digital marketing manager @Lisa Biella who reads in copy to add our part as well! Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *

Re: Num Found of docs containing vector fields

2023-10-22 Thread Alessandro Benedetti
Hi, Using the knn query parser is not going to cut it as it's driven by the topK parameter (and you do all distance similarity and stuff). Normally in Solr you just do: field:[* TO *] to retrieve docs that have at least one value in a field. I can't try right now, can you check if that's implement

Re: knn query parser, number of results and filtering by score

2023-10-20 Thread Alessandro Benedetti
e the ticket and contribute a fix if you like, happy to review it! ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source We

Re: knn query parser, number of results and filtering by score

2023-10-18 Thread Alessandro Benedetti
Let's distinguish if you are talking about a "debug" bug (the explain message) or a functional bug. ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Inform

Re: knn query parser, number of results and filtering by score

2023-10-18 Thread Alessandro Benedetti
-- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn

Re: knn query parser, number of results and filtering by score

2023-10-17 Thread Alessandro Benedetti
Sometimes you have big gaps in score and still, the document is relevant. Long story short, I would need to study your case and think on a solution, nothing easy is out there. Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr

Re: knn query parser, number of results and filtering by score

2023-10-17 Thread Alessandro Benedetti
Hi Mirko, the topK is per shard. Then shards * k results are aggregated. Does it make sense? In regards to the debugging, it seems a bug, they all should be with a score and within top-k -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache

Re: knn query parser, number of results and filtering by score

2023-10-17 Thread Alessandro Benedetti
What's your full Solr query? Are you on SolrCloud or single Solr node? ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Tra

Re: Omit feature names in Learning to Rank logging

2023-10-06 Thread Alessandro Benedetti
the draft pull request. Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <h

Re: luceneMatchVersion in Solr9.2.1

2023-08-31 Thread Alessandro Benedetti
Back in the day I also wrote a blog post about this param: https://sease.io/2021/04/lucenematchversion-in-apache-solr.html Hope it helps! Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede

Re: luceneMatchVersion in Solr9.2.1

2023-08-30 Thread Alessandro Benedetti
Hi, Since Apache Solr and Apache Lucene are not the same project, the lucene and solr version are not aligned anymore. So please use the correct Lucene version included in the Solr release you are using. Cheers On Wed, 30 Aug 2023, 21:26 Oakley, Craig (NIH/NLM/NCBI) [C], wrote: > In previous u

Re: Join and Sort between 2 collections in Solr 8.9.0 cloud mode

2023-08-11 Thread Alessandro Benedetti
I don't know if it was the merit of this mailing list, but the bad debt has been paid right now, you can ignore my previous comment now! ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sea

Re: Join and Sort between 2 collections in Solr 8.9.0 cloud mode

2023-08-08 Thread Alessandro Benedetti
As a community how should we behave with gentlemen that didn't pay their suppliers for their services and then come here? As an Apache Lucene/Solr committer I very commonly use the funds I receive from my clients (for my services) to sponsor my and my colleague's contributions. This company has a

Re: knn parser not working as expected

2023-08-02 Thread Alessandro Benedetti
Can you post the debug section related to query parsing? You can get it using 'debug=query'. What's your select request handler configuration? Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e

Re: LTR Features on nested documents

2023-07-27 Thread Alessandro Benedetti
t for reranking. So you can't calculate features on children (which are anyway separate docs). It could be a nice contribution though, if you want to work on this, ahppy to review! Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apach

Re: [ANNOUNCE] Apache Solr 9.3.0 released

2023-07-21 Thread Alessandro Benedetti
Thanks Houston and everyone involved, for the release! On Fri, 21 Jul 2023, 21:56 Houston Putman, wrote: > The Solr PMC is pleased to announce the release of Apache Solr 9.3.0. > > Solr is the popular, blazing fast, open source NoSQL search platform from > the Apache Solr project. Its major fea

Re: Negative Query not working when use brackets in Solr 8.9.0

2023-07-10 Thread Alessandro Benedetti
anding if the the proper records are coming or not (if you don't know, who can? :)) Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied

Re: Leading zero were automatically removed from int data type field in solr so how to overcome this issue without using string data type

2023-07-04 Thread Alessandro Benedetti
tored value, a copy field if you like, and keep the original indexed as an integer to do math search operations on. " -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information

Re: Leading zero were automatically removed from int data type field in solr so how to overcome this issue without using string data type

2023-07-03 Thread Alessandro Benedetti
As an integer 0045 or 045 or 45 is exactly the same value in the decimal positional format. I agree with Charlie, if you need it to be a string, then use a string. You can potentially just use the string as a stored value, a copy field if you like and keep the original indexed as an in

Re: Compiling Apache Solr Resources

2023-06-02 Thread Alessandro Benedetti
very useful! Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <https://l

Re: Streaming expr as LTR feature

2023-05-31 Thread Alessandro Benedetti
been tested, it may work but there's no dedicated design for that so it may be quite clunky and expensive. And by the way, Images are not visible in the mailing list. Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC

Re: Vector Search Parser(!vp) : calculate cosine

2023-05-26 Thread Alessandro Benedetti
hmark.html Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/>

Re: Vector as LTR Field Value Feature Type

2023-05-22 Thread Alessandro Benedetti
-- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <https://linkedin.co

Re: Created a Comparison Video between Generative AI Chat and Dense Vector Search

2023-05-19 Thread Alessandro Benedetti
to do the same, thanks for your effort! -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <ht

Re: Skip applying boost queries

2023-05-04 Thread Alessandro Benedetti
-- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <https://linkedin.com/comp

Re: [TOKYO Lucene/Solr meet-up] ~ Neural search: Fine-tuning a Multi-language model & Product retrieval based on online reviews ~

2023-04-21 Thread Alessandro Benedetti
Great Mingchun! I am so happy of the various applications our neural search contribution has enabled! I'll try to attend remotely (but it's quite late in europe) and I wish you the best for the meetup! Cheers On Fri, 21 Apr 2023, 11:33 Mingchun Zhao, wrote: > Hi everyone! > > We, the Solr commu

Re: How to use MorelikeThis with duplicates

2023-04-12 Thread Alessandro Benedetti
Following up on Mikhail good insights, I would probably recommend using the More Like This Query Parser followed by grouping/field collapsing on a field. It should solve your problem! If your requirements are more advanced feel free to let us know! Cheers -- *Alessandro

Re: SynonymGraphFilterFactory with WordDelimiterGraphFilterFactory usage

2023-04-12 Thread Alessandro Benedetti
Same here, I wanted to add a colleague of mine to the discussion but possibly I have done it wrong, apologies! We'll add more info soon if relevant to the community! ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Membe

Re: Problem with "sow" and WordGraphDelimeter

2023-04-12 Thread Alessandro Benedetti
My bad, I was adding a colleague of mine to the discussion, but possibly done in the wrong way! We are observing some problems in mixing up those two token filters, we may update the mail thread in the next few days! Cheers -- *Alessandro Benedetti* Director @ Sease Ltd

Re: SynonymGraphFilterFactory with WordDelimiterGraphFilterFactory usage

2023-04-12 Thread Alessandro Benedetti
FYI -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn

Re: Problem with "sow" and WordGraphDelimeter

2023-04-12 Thread Alessandro Benedetti
FYI -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn

Re: [ANNOUNCE] Apache Solr 9.2.0 released

2023-03-28 Thread Alessandro Benedetti
Well done Houston and everyone involved in this release! -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source

Re: Apache Solr Neural Search Training

2023-03-10 Thread Alessandro Benedetti
> > > > On Mar 10, 2023, at 7:48 AM, Alessandro Benedetti < > a.benede...@sease.io> > > wrote: > > > > > > Given it's almost time for an upcoming live training of ours, I take > the > > > occasion for a bit of self-promoti

Re: Embeddings for Dense Vector Search

2023-03-10 Thread Alessandro Benedetti
We've been using mostly contextual vectors (BERT based models) and working in english (vast majority of our projects) and italian. We've been using word2vec embeddings for a Lucene contribution for machine-learned synonyms. Cheers On Thu, 9 Mar 2023, 16:03 Daniel Benito, wrote: > Hi everyone,

Apache Solr Neural Search Training

2023-03-10 Thread Alessandro Benedetti
Given it's almost time for an upcoming live training of ours, I take the occasion for a bit of self-promotion :) The 16th of March we host the Neural Search training for Apache Solr: https://sease.io/information-retrieval-mini-training-2/end-to-end-apachesolr-neural-search-tutorial If you are in

Re: Inconsistent Solr Learning to Rank Feature Store State - SolrCloud / 7.7

2023-03-06 Thread Alessandro Benedetti
Doug, you are always too kind :) I'm glad our blog post helped! We have an additional contribution in progress that should be ready soon: https://issues.apache.org/jira/browse/SOLR-15493 Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Comm

Re: using dense vector search with Solr

2023-03-06 Thread Alessandro Benedetti
l Solr Jira and GitHub for news! Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io &

Re: KNN HNSW - performance over time with document updates

2023-03-06 Thread Alessandro Benedetti
the K in top-K, any better? There are many points where this sort of search may go wrong, it may be Solr's fault or not :) Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.i

Re: Query time

2023-02-08 Thread Alessandro Benedetti
/issues.apache.org/jira/browse/SOLR-9882) Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: S

Re: [ANNOUNCE] Apache Solr 9.1.1 released

2023-01-25 Thread Alessandro Benedetti
Thanks, Michael for leading this and everyone involved! Great bug-fixing release! -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting

Re: Vector/array variable as input for KNN query with Streaming Expressions

2023-01-18 Thread Alessandro Benedetti
Hi, during the implementation work so far, streaming expressions were not taken into account at all. If it works, it has not been tested. Can you elaborate a little more your requirements and idea? Cheers On Wed, 18 Jan 2023, 13:51 Gabriel Magno, wrote: > Hi > > I'm trying to use streaming exp

Re: Retconn'ing Solr index schema

2022-12-16 Thread Alessandro Benedetti
Hi, Apache Solr sorts by lexicographic order so uppercase/lowercase counts! Cheers On Sat, 17 Dec 2022, 02:51 Christopher Schultz, < ch...@christopherschultz.net> wrote: > All, > > I'm trying to determine why a change was made to my internal project > some years ago. The commit comment is unhelp

Re: Dense Vector Search fieldType default parameters result in index error

2022-12-14 Thread Alessandro Benedetti
Thanks, Alastair! We'll keep you updated! ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sea

Re: Dense Vector Search fieldType default parameters result in index error

2022-12-13 Thread Alessandro Benedetti
Hi Alastair, That's definitely not intended and we'll fix it as soon as possible (it will be available in the next release). Feel free to create a Jira issue on the Apache Solr project with this details (and I'll be able to link it in the Pull Request) Cheers On Wed, 14 Dec 2022, 04:22 Alastair

Tokyo Lucene/Solr Meetup - 9 December 2022

2022-12-08 Thread Alessandro Benedetti
und! This message is meant to be for anyone tonight in Tokyo, not necessarily committers/PMC members! Have a good one! -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information

Re: 8.11.2 Performance degradation

2022-12-02 Thread Alessandro Benedetti
ou see the degradation in: - indexing? indexing how? indexing what? The extent of the degradation - searching? what kind of queries? faceting? reranking?... That would definitely help but I suspect it's not going to be an easy one. Cheers ------ *Alessandro Benedetti* Direc

Re: Announcing {!mlt_content} Re: MoreLikeThis with externally supplied text, and facets?

2022-11-30 Thread Alessandro Benedetti
Well done Mikhail! Bit busy in the next month but over Christmas, I'll take a look! So good to see there's still interest in the More Like This (that I really love) Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr

Re: [External] Re: Seeking tips about MoreLikeThis exceptions

2022-11-24 Thread Alessandro Benedetti
| Twitter <https://twitter.com/seaseltd> | Youtube <https://www.youtube.com/channel/UCDx86ZKLYNpI3gzMercM7BQ> | Github <https://github.com/seaseltd> On Thu, 24 Nov 2022 at 18:00, Alessandro Benedetti wrote: > I agree with Mikhail, escaping should solve the problem! > If af

Re: Improper Solr Search results

2022-11-24 Thread Alessandro Benedetti
uest param: '...&debug=query', the output can help us. Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Trainin

Re: [External] Re: Seeking tips about MoreLikeThis exceptions

2022-11-24 Thread Alessandro Benedetti
I agree with Mikhail, escaping should solve the problem! If after that it has to do with More Like This, please let us know and I'll be happy to take a look at it (been working in the last few years quite extensively on the More Like This feature) Cheers ------ *Aless

Re: Commit Process

2022-11-20 Thread Alessandro Benedetti
After point 5 you need to draw the attention of one or more free committers that will need to review the pull request. If they agree the contribution is valid and in an acceptable form, the code will be merged. This will require some time, especially for the first contributions. The more you becom

Re: Solr Contributor Bootcamp announced to coincide with ApacheCon USA

2022-10-17 Thread Alessandro Benedetti
Option 1 works for me! Cheers On Mon, 17 Oct 2022, 20:14 Anakhe Ajayi, wrote: > Option 1 works for me, will be there > > On Mon, Oct 17, 2022 at 6:40 PM Jason Gerlowski > wrote: > > > Hey all, > > > > Wanted to give a quick update (and additional "plug") on this front. > > > > After a bit of d

Re: Understanding LTR debug query output

2022-10-10 Thread Alessandro Benedetti
ok to the way you defined those in the features.json . If hierScore is just a field value and doesn't match, possibly a bug? maybe related with the numerical representation? What is the field type? Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache L

Re: Solr upgrade|| LTR performance degrade

2022-09-28 Thread Alessandro Benedetti
Hi Sachin, you didn't reply to the whole mailing list, just did, Anna will follow up with some investigation and reproduction we are trying. Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.b

Re: Solr upgrade|| LTR performance degrade

2022-09-27 Thread Alessandro Benedetti
So you noticed that the degradation was also there pre 8.8 (<8.8)? Have you identified the first version having the regression? Was it there with 8.7, this information would help a lot! Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Commit

Re: Solr upgrade|| LTR performance degrade

2022-09-26 Thread Alessandro Benedetti
ix it! Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> Link

Re: ANN: ApacheCON "BoaF" for Solr

2022-09-20 Thread Alessandro Benedetti
Thanks David for organizing! See you there! -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io

Re: Solr search function not working on website

2022-09-14 Thread Alessandro Benedetti
(you will have to ask your engineers/support of that platform), for what went wrong. Cheers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consu

Re: Problem with "sow" and WordGraphDelimeter

2022-09-09 Thread Alessandro Benedetti
-whitespace-and-multi-field-full-text-search.html Cheers -- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source

Re: MoreLikeThis with externally supplied text, and facets?

2022-09-09 Thread Alessandro Benedetti
heers ------ *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <https://linkedin.com/com

Re: Ranking based on number of OR clauses matched

2022-08-26 Thread Alessandro Benedetti
Hi Noah, That's pretty much the default if you go with a pure boolean query! Do you see a different behaviour? What is your query? Cheers On Fri, 26 Aug 2022, 11:22 Noah Torp-Smith, wrote: > We have a search engine with books that have topics and other features. We > do faceting on these featu

Re: Semantic Knowledge Graph theoric question

2022-06-27 Thread Alessandro Benedetti
ake a look here : org.apache.solr.search.facet.RelatednessAgg#computeRelatedness The formula you mentioned is ok, but I would recommend remote debugging Solr and putting some breakpoints there to investigate if something doesn't look right. Let me know! ------ *Alessandro Benede

Re: Solr meet-up "Solr 9" will be held very soon (19:00 PDT, today)

2022-06-22 Thread Alessandro Benedetti
Are any recordings or slides available? -- *Alessandro Benedetti* CEO @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <h

Re: Solr upgrade|| LTR performance degrade

2022-06-17 Thread Alessandro Benedetti
Hi Sachin, Any update on this? This could be very important for the community. We may end up discovering and fixing a bug, so please get in touch with all the info you have! Cheers -- *Alessandro Benedetti* CEO @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC

Re: SOLR 7-8 Upgrade. Now getting OOM when updating content

2022-05-24 Thread Alessandro Benedetti
r soft) ? auto-commit? Cheers -- *Alessandro Benedetti* CEO @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http:/

Re: Solr upgrade|| LTR performance degrade

2022-05-24 Thread Alessandro Benedetti
heers ------ *Alessandro Benedetti* CEO @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <https://linkedin.com/com

Re: Problem with indexing a String field in SOLR.

2022-04-28 Thread Alessandro Benedetti
-- *Alessandro Benedetti* CEO @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: a.benede...@sease.io *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <https://linkedin.co

Re: Question regarding the MoreLikeThis features

2022-03-21 Thread Alessandro Benedetti
Yes, Marco, I think you are on the right track! The bug you linked is relevant, feel free to fix it and I'll be glad to help in review and commit. In case you need someone to fix it, let me know! Cheers -- Alessandro Benedetti Apache Lucene/Solr PMC member and Comm

Re: [EXT] Re: Looking for expertise on comparing Solr search to Postgres full-text search

2022-03-17 Thread Alessandro Benedetti
Ok Charlie, Eric, we are on the same page. I agree it's definitely possible with some custom proxy work on both Quepid and RRE, I meant it's not possible to directly point to the DB (for example via JDBC). Thanks! Cheers -- Alessandro Benedetti Apache Lucen

Re: Looking for expertise on comparing Solr search to Postgres full-text search

2022-03-17 Thread Alessandro Benedetti
s. Cheers -- Alessandro Benedetti Apache Lucene/Solr PMC member and Committer Director, R&D Software Engineer, Search Consultant www.sease.io On Sat, 5 Mar 2022 at 05:04, David Smiley wrote: > Hello Sam, > > You are a familiar name from my MITRE days :-) > > Check out Solr&

  1   2   >