Re: Azure AI Search uses Apache Lucene for full text search

2024-01-22 Thread Ali Akhtar
Sure, please share On Mon, Jan 22, 2024 at 1:33 AM Michael Wechner wrote: > Hi > > I recently noticed, that Azure AI Search uses Apache Lucene > for full text search > > > https://learn.microsoft.com/en-us/azure/search/search-lucene-query-architecture > > which I did

Re: Log4j

2021-12-15 Thread Ali Akhtar
Does Lucene not have any internal logging at all, e.g for debugging? On Thu, Dec 16, 2021 at 2:49 AM Uwe Schindler wrote: > Hi, > > Lucene is an API and does not log with log4j. > > Only the user interface Luke uses log4j, but this one does not do any > networking. So unless user of Luke enters

Demo: Personal Q/A site using Lucene

2020-09-23 Thread Ali Akhtar
Hey guys, Just wanted to show you my new personal site: https://ali.actor You can put in any question about me and it finds the closest match (from about 100 documents fed into it) using Lucene + some extra magic on the backend. Hosted on a $5 box on AWS! Ali.

Re: Port on iOS

2020-08-21 Thread Ali Akhtar
Run it on the backend and make http requests from iOS On Fri, 21 Aug 2020 at 9:26 AM, Saad Umar wrote: > I want to run Lucene with iOS, how do I do that > > > > -- > > > > Best, > > > > Saad Umar > > > > Senior Software Engineer > > > > *Avanza Solutions (Pvt.) Ltd.* > > > > Office # 14-B, Fakhr

Re: Reg Java Learning Git repo for Apache Lucene & Solr.

2020-08-17 Thread Ali Akhtar
I don't know if there is a repo but this site helped me when i was getting started: http://www.lucenetutorial.com/ On Mon, Aug 17, 2020 at 10:04 AM Ganapathy Subramanian < ganapathy.s.j...@gmail.com> wrote: > Hi , > I am in the Learning path of Apache Solr & Lucene capabilities > using

Re: I resurrected a 2013 project (Lucene 4.2) and I want to convert it to 8.6

2020-08-03 Thread Ali Akhtar
You could probably google for a dictionary and download a text file. For English, there is Wordnet which has a java client for accessing it. I think you would use a FuzzyQuery or QueryParser with a tilde (-) to indícate the terms you’d like to do the spellcheck for. This will find terms within a 2

Re: Optimizing term-occurrence counting (code included)

2020-07-23 Thread Ali Akhtar
I'm new to lucene so I'm not sure what the best way of speeding this up in Lucene is, but I've previously used https://github.com/npgall/cqengine for similar stuff. It provided really good performance, especially if you're just counting things. On Fri, Jul 24, 2020 at 6:55 AM Alex K wrote: > Hi

Re: ANN search current state

2020-07-16 Thread Ali Akhtar
I’m a bit of a layman in this area, but if we are talking about formats for vectors, I vote for the one used by FastAI word vectors. It’s pretty easy to work with. If we are talking about the same / similiar things, if not just ignore me 😀 On Thu, 16 Jul 2020 at 7:06 PM, Michael Sokolov wrote:

Re: Ulimit recommendation for Apache Lucene 6.5.1

2020-07-14 Thread Ali Akhtar
If you cache the IndexSearcher and only have a couple of segments, and it’s a read only system (indexing is done just once), would it still open a lot of files? On Tue, 14 Jul 2020 at 7:05 PM, Erick Erickson wrote: > At least 65K. Yes, 65 thousand. Ditto for processes. > > > On Jul 14, 2020, at

Re: Providing weights for individual terms in a query based on similarity to document terms

2020-07-03 Thread Ali Akhtar
ieve this w/ Lucene's API or do I need to extend the similarity class myself? On Fri, Jul 3, 2020 at 8:44 PM Ali Akhtar wrote: > Hellooo, > > Suppose a user enters ‘box of shoes’ in my search box. I have two > documents titled ‘box of clothes’ and ‘box of socks’. I’ve figur

Providing weights for individual terms in a query based on similarity to document terms

2020-07-03 Thread Ali Akhtar
Hellooo, Suppose a user enters ‘box of shoes’ in my search box. I have two documents titled ‘box of clothes’ and ‘box of socks’. I’ve figured out through a separate algorithm that ‘socks’ is more similar to ‘shoes’ than clothes. I even have a numeric score for the similarity: for socks it’s 0.8 a

Calculate the score of an arbitrary string vs a query?

2015-04-06 Thread Ali Akhtar
Hello, I'm in a situation where a search query string is being submitted simultaneously to Lucene, and to an external API. Results are fetched from both sources. I already have a score available for Lucene results, but I don't have a score for the results fetched from the external source. I'd li