Re: solr in classic asp project

2014-08-07 Thread parnab kumar
Can you elaborate on how you plan to use SOLR in your project?

Parnab..
CSE, IIT Kharagpur



On Thu, Aug 7, 2014 at 12:51 PM, Sandeep Bohra <
sandeep.bo...@3pillarglobal.com> wrote:

> I am using an classic ASP 3.0 application and would like to implement SOLR
> onto it. My database is SQL server and also it connects to AS/400 using
> batch processing. Can someone suggest a starting point?
>
>
>
> *RegardsSandeep*
>


Re: solr boosting any perticular URL

2014-07-20 Thread parnab kumar
Look into query elevation component...

https://wiki.apache.org/solr/QueryElevationComponent

Thanks,
Parnab..
Cork, Ireland


On Thu, Jul 17, 2014 at 7:58 PM, rahulmodi  wrote:

> Hi There,
>
> I am new to Solr. My client is asking me to boost a particular URL so that
> it should appear on the top of the results.
> I have already searched on various websites but i did not found boosting
> for
> particular URL.
>
> Please tell me whether this feature is available or not, if available then
> how to achieve it.
>
> Thanks
> Rahul Modi
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/solr-boosting-any-perticular-URL-tp4147657.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: How to integrate nlp in solr

2014-07-02 Thread parnab kumar
Aman,

  I feel focusing on  "Question-Answering" and "Information Extraction"
 components of NLP should help you achieve what  you are looking for. Go
through this book *Taming Text * (http://www.manning.com/ingersoll/ ) .
Most of your queries should be answered including details on implementation
and sample source codes.



To state naively :
  NLP tools gives you the power to extract or  interpret knowledge from
text, which you basically store in the lucene index in form of fields or
store along with the terms using payloads. During query processing time,
you similarly gather additional knowledge from the query (using techniques
like query expansion, relevance feedback, or ontologies ) and simply map
those knowledge with the knowledge gained from the text. Its an effort to
move to semantic retrieval rather than simple term matching.

Thanks,
Parnab


On Wed, Jul 2, 2014 at 6:29 AM, Aman Tandon  wrote:

> Hi Alex,
>
> Thanks alex, one more thing i want to ask that so do we need to add the
> extra fields for those entities, e.g. "Item" (bags), "color" (blue), etc.
>
> If some how i managed to implement this nlp then i will definitely publish
> it on my blog :)
>
> With Regards
> Aman Tandon
>
>
> On Wed, Jul 2, 2014 at 10:34 AM, Alexandre Rafalovitch  >
> wrote:
>
> > Not from me, no. I don't have any real examples for this ready. I
> > suspect the path beyond the basics is VERY dependent on your data and
> > your business requirements.
> >
> > I would start from thinking how would YOU (as a human) do that match.
> > Where does the 'blue' and 'color' and 'college' and 'bags' come from.
> > Then, figuring out what is required for Solr to know to look there.
> >
> > NLP is not magic, just advanced technology. You need to know where you
> > are going to get there.
> >
> > Regards,
> >Alex.
> > Personal website: http://www.outerthoughts.com/
> > Current project: http://www.solr-start.com/ - Accelerating your Solr
> > proficiency
> >
> >
> > On Wed, Jul 2, 2014 at 11:35 AM, Aman Tandon 
> > wrote:
> > > Any help here
> > >
> > > With Regards
> > > Aman Tandon
> > >
> > >
> > > On Mon, Jun 30, 2014 at 11:00 PM, Aman Tandon  >
> > > wrote:
> > >
> > >> Hi Alex,
> > >>
> > >> I was try to get knowledge from these tutorials
> > >> http://www.slideshare.net/teofili/natural-language-search-in-solr &
> > >> https://wiki.apache.org/solr/OpenNLP: this one is kinda bit
> explaining
> > >> but the real demo is not present.
> > >> e.g. query: I want blue color college bags, then how using nlp it will
> > >> work and how it will search, there is no such brief explanation out
> > there,
> > >> i will be thankful to you if you can help me in this.
> > >>
> > >> With Regards
> > >> Aman Tandon
> > >>
> > >>
> > >> On Mon, Jun 30, 2014 at 6:38 AM, Alexandre Rafalovitch <
> > arafa...@gmail.com
> > >> > wrote:
> > >>
> > >>> On Sun, Jun 29, 2014 at 10:19 PM, Aman Tandon <
> amantandon...@gmail.com
> > >
> > >>> wrote:
> > >>> > the appropriate results
> > >>> What are those specifically? You need to be a bit more precise about
> > >>> what you are trying to achieve. Otherwise, there are too many NLP
> > >>> branches and too many approaches.
> > >>>
> > >>> Regards,
> > >>>Alex.
> > >>> Personal website: http://www.outerthoughts.com/
> > >>> Current project: http://www.solr-start.com/ - Accelerating your Solr
> > >>> proficiency
> > >>>
> > >>
> > >>
> >
>


Re: Alternatives to GATE?

2014-01-16 Thread parnab kumar
Hi,

You can have a look at OpenNLP.
http://opennlp.apache.org/



Thanks,
Parnab


On Thu, Jan 16, 2014 at 1:12 PM, Philippe de Rochambeau wrote:

> Hello,
>
> can anyone suggest alternatives to GATE (http://gate.ac.uk/download/)? I
> would like to index place and person names in PDFs using gazetteers (ie,
> dictionaries) and normalize dates ( (eg, December 1st, 2001 will be indexed
> as 20011201) and feed the indexes to SOLR?
>
> GATE is a great tool, but the search engine, Mimir, is unfortunately not
> customizable (well-document enough) enough for my purposes, which are to
> return the found documents (PDFs) ordered by document or entity (eg,
> {Date}, {Person}) name.
>
> Many thanks.
>
> Philippe
>
>
>


Re: Solr index searcher to lucene index searcher

2013-04-26 Thread parnab kumar
Hi  ,

Thanks Chris . For every document that matches the query i want to able
to compute the following set of features for a query document pair

LuceneScore ( The vector space score that lucene gives to each doc)
LinkScore  ( computed from nutch )
OpicScore ( computed from nutch)
   co-rd in title,content,anchor,url
   wt of Entity in title,content,anchor,url
   length of title,content,anchor,url
   sum-of-tf in title,content,anchor,url
   sum-of-norm-tf in title,content,anchor,url
   min-of-tf in title,content,anchor,url
   max-of-tf in title,content,anchor,url
   variance-of-tf in title,content,anchor,url
   sum-of-tf-idf in title,content,anchor,url
   site-reputation-score
   enity-support-score
   domain score
  url-click-count
   query-url-click-count
  num-of-slashes-in-url

Based on these above features i want to build a machine learned model that
will learn to rank/score the documents .i am trying to understand how to
compute the features efficiently on the fly. Looking into the index and
computing these features seems to be very slow . So for the time being i
wanted to implement the same by looking into the TopK documents.Few of
these features has to be computed on the fly and some of them are computed
while indexing and stored in the index . I need to be able to look into all
features to score/rank the final set of documents.

Thanks ,
Pom..

On Sat, Apr 27, 2013 at 5:43 AM, Chris Hostetter
wrote:

> : used to call the lucene IndexSearcher . As the documents are collected in
> : TopDocs in Lucene , before that is passed back to Nutch , i used to look
> : into the top K matching documents , consult some external repository
> : and further score the Top K documents and reorder them in the TopDocs
> array
> : . These reordered  TopDocs is passed to Nutch .  All these reordering
> code
> : was implemented by Extending the lucene IndexSearcher class .
>
> 1) that's basically the same info you provided before -- it still doesn't
> really tell us anything about what your current logic does with the top K
> documents and how/why/when you decide to reorder them or by how much --
> details that are kind of important in being able to provide you with any
> meaningful advice on how to achieve your goal using hte plugin hooks
> available in Solr.
>
> 2) if you only care about re-ordering the Top K documents using some
> secret sauce, then i would suggest you just set rows=K and let Solr do
> it's thing, the post process the reuslts -- either in your client, or in a
> SearchComponent that modifies the SolrDocumentList produces by
> QueryComponent.
>
> : > can you elaborate on what exactly your "some logic" involves?
> ...
> : > https://people.apache.org/~hossman/#xyproblem
> : > XY Problem
> : >
> : > Your question appears to be an "XY Problem" ... that is: you are
> dealing
> : > with "X", you are assuming "Y" will help you, and you are asking about
> "Y"
> : > without giving more details about the "X" so that we can understand the
> : > full issue.  Perhaps the best solution doesn't involve "Y" at all?
> : > See Also: http://www.perlmonks.org/index.pl?node_id=542341
>
>
> -Hoss
>


Re: Solr index searcher to lucene index searcher

2013-04-23 Thread parnab kumar
Hi ,

Thanks Chris. I had been using Nutch 1.1 . The Nutch IndexSearcher
used to call the lucene IndexSearcher . As the documents are collected in
TopDocs in Lucene , before that is passed back to Nutch , i used to look
into the top K matching documents , consult some external repository
and further score the Top K documents and reorder them in the TopDocs array
. These reordered  TopDocs is passed to Nutch .  All these reordering code
was implemented by Extending the lucene IndexSearcher class .
The lucene core that comes with solr is a bit different
from the one that used to come with Nutch 1.1 . As a result implementing
the same is not straight forward .Moreover , i cannot figure out at which
point exactly the SolrIndexSearcher makes a direct Interaction with
LuceneIndexSearcher .
   With FunctionQuery i loose the flexibility of looking into
the documents before passing to the final result set.

  Now i am using solr 3.4 and i would like to implement the same between
solr and lucene.

Thanks ,
Pom

On Wed, Apr 24, 2013 at 3:05 AM, Chris Hostetter
wrote:

>
> : > > . For any query it passes through the search handler and solr finally
> : > > directs it to lucene Index Searcher. As results are matched and
> collected
> : > > as TopDocs in lucene i want to inspect the top K Docs , reorder them
> by
> : > > some logic and pass the final TopDocs to solr which solr may send as
> a
> : > > response .
>
> can you elaborate on what exactly your "some logic" involves?
>
> instead of writing a custom collector, using a function query may be the
> best solution.
>
> https://people.apache.org/~hossman/#xyproblem
> XY Problem
>
> Your question appears to be an "XY Problem" ... that is: you are dealing
> with "X", you are assuming "Y" will help you, and you are asking about "Y"
> without giving more details about the "X" so that we can understand the
> full issue.  Perhaps the best solution doesn't involve "Y" at all?
> See Also: http://www.perlmonks.org/index.pl?node_id=542341
>
>
> -Hoss
>


Re: Solr index searcher to lucene index searcher

2013-04-23 Thread parnab kumar
Hi  ,

Timothy,Thanks for pointing out . But i have a specific requirement
. For any query it passes through the search handler and solr finally
directs it to lucene Index Searcher. As results are matched and collected
as TopDocs in lucene i want to inspect the top K Docs , reorder them by
some logic and pass the final TopDocs to solr which solr may send as a
response .

I need to know the point where actually these interaction between solr and
lucene takes place .
Can anyone please help where to look into for this purpose .

Thanks..
Pom

On Tue, Apr 23, 2013 at 9:25 PM, Timothy Potter wrote:

>org.apache.solr.search.SolrIndexSearcher
>
> On Tue, Apr 23, 2013 at 9:51 AM, parnab kumar 
> wrote:
> > Hi ,
> >
> > Can anyone please point out from where a solr search
> originates
> > and how it passes to the lucene index searcher and back to solr . I
> > actually what to know which class in solr directly calls the lucene Index
> > Searcher .
> >
> > Thanks.
> > Pom
>


Solr index searcher to lucene index searcher

2013-04-23 Thread parnab kumar
Hi ,

Can anyone please point out from where a solr search originates
and how it passes to the lucene index searcher and back to solr . I
actually what to know which class in solr directly calls the lucene Index
Searcher .

Thanks.
Pom


how to skip test while building

2013-04-06 Thread parnab kumar
Hi All,

  I am new to Solr . I am using solr 3.4 . I want to build without
building  lucene tests files in lucene and skip the tests to be fired . Can
anyone please help where to make the necessary changes .

Thanks,
Pom