Re: Seeking New Moderators for solr-user@lucene

2013-10-20 Thread Harshvardhan Ojha
Hi Chris,

I am willing to be moderator.

Regards
Harshvardhan Ojha


On Sun, Oct 20, 2013 at 1:17 AM, Furkan KAMACI wrote:

> Hi Chris;
>
> I am volunteer and I really want to be a moderator for a long time as you
> know :)
>
> Thanks;
> Furkan KAMACI
>
>
> 2013/10/19 Alexandre Rafalovitch 
>
> > I'll be happy to moderate. I do it for some other lists already.
> >
> > Regards,
> > Alex
> >
>


Re: Seeking New Moderators for solr-user@lucene

2013-10-20 Thread Anshum Gupta
Hi Hoss,

Forgot to mention the email address: ans...@anshumgupta.net


On Sat, Oct 19, 2013 at 12:40 AM, Anshum Gupta wrote:

> Hey Hoss,
>
> I'd be happy to moderate.
>
> Sent from my iPhone
>
> > On 19-Oct-2013, at 0:22, Chris Hostetter 
> wrote:
> >
> >
> > It looks like it's time to inject some fresh blood into the
> solr-user@lucene moderation team.
> >
> > If you'd like to volunteer to be a moderator, please reply back to this
> thread and specify which email address you'd like to use as a moderator (if
> different from the one you use when sending the email)
> >
> > Being a moderator is really easy: you'll get a some extra emails in your
> inbox with MODERATE in the subject, which you skim to see if they are spam
> -- if they are you delete them, if not you "reply all" to let them get sent
> to the list, and authorize that person to send future messages w/o
> moderation.
> >
> > Occasionally, you'll see an explicit email to solr-user-owner@lucenefrom a 
> > user asking for help realted to their subscription (usually
> unsubscribing problems) and you and the other moderators chime in with
> assistance when possible.
> >
> > More details can be found here...
> >
> > https://wiki.apache.org/solr/MailingListModeratorInfo
> >
> > (I'll wait ~72+ hours to see who responds, and then file the appropriate
> jira with INFRA)
> >
> >
> > -Hoss
>



-- 

Anshum Gupta
http://www.anshumgupta.net


RE: Seeking New Moderators for solr-user@lucene

2013-10-20 Thread Jeevanandam M.
Hello Hoss -

My pleasure, kindly accept my moderator nomination.

Regards,
Jeeva

-- Original Message --
From: Chris Hostetter [mailto:hossman_luc...@fucit.org]
Sent: October 19, 2013 12:22:34 AM GMT+05:30
To: solr-user@lucene.apache.org
Subject: Seeking New Moderators for solr-user@lucene



It looks like it's time to inject some fresh blood into the solr-user@lucene 
moderation team.

If you'd like to volunteer to be a moderator, please reply back to this thread 
and specify which email address you'd like to use as a moderator (if different 
from the one you use when sending the email)

Being a moderator is really easy: you'll get a some extra emails in your inbox 
with MODERATE in the subject, which you skim to see if they are spam -- if they 
are you delete them, if not you "reply all" to let them get sent to the list, 
and authorize that person to send future messages w/o moderation.

Occasionally, you'll see an explicit email to solr-user-owner@lucene from a 
user asking for help realted to their subscription (usually unsubscribing 
problems) and you and the other moderators chime in with assistance when 
possible.

More details can be found here...

https://wiki.apache.org/solr/MailingListModeratorInfo

(I'll wait ~72+ hours to see who responds, and then file the appropriate jira 
with INFRA)


-Hoss



statistic on a field?

2013-10-20 Thread Bruno Mannina

Dear,

I have a field named "Authors", is it possible to have
the frequency of terms (first 2000 for i.e.) of this field ?

Thanks,

Bruno


Re: statistic on a field?

2013-10-20 Thread Bruno Mannina

Le 20/10/2013 17:52, Bruno Mannina a écrit :

Dear,

I have a field named "Authors", is it possible to have
the frequency of terms (first 2000 for i.e.) of this field ?

Thanks,

Bruno


By using Schema Browser, I have information on my field Authors but I 
have a problem,

I have statistic on part of terms of this field...

i.e.

termfreq
co256875
ltd235899
corp 195554
etc...

The field has been splitted to do stats ?!

FieldType: TEXT_GENERAL
Properties: Indexed, Tokenized, Stored, Multivalued
Schema: Indexed, Tokenized, Stored, Multivalued
Index: indexed, Tokenized, Stored

Position Increment Gap: 100

Distinct: 1803034

I think it's because this field is Tokenized ? no ?

Regards,
Bruno




Two easy questions...

2013-10-20 Thread Chris
Hi,

I am new to solr & have two questions -

1. how do i get an excerpt for a huge content field (would love to show
google  like excerpts, where word searched for is highlighted)

2. If i have a field - A, is it possible to get top results with only
unique values for this field in a page...?

Thanks,
Chris


difference between apache tomcat vs Jetty

2013-10-20 Thread Karunakar Reddy
Hi,
I want to know what is the difference between running solr on tomcat-apache
server vs jetty server in production.
With jetty there is some issue with indexing(eg: If 120k[batch size 100]
records are getting indexed, around 15k are missing).





Regards,
Karunakar.


Re: difference between apache tomcat vs Jetty

2013-10-20 Thread Furkan KAMACI
Which Jira issue points for: "With jetty there is some issue with
indexing(eg: If 120k[batch size 100] records are getting indexed, around
15k are missing)."?


2013/10/20 Karunakar Reddy 

> Hi,
> I want to know what is the difference between running solr on tomcat-apache
> server vs jetty server in production.
> With jetty there is some issue with indexing(eg: If 120k[batch size 100]
> records are getting indexed, around 15k are missing).
>
>
>
>
>
> Regards,
> Karunakar.
>


Re: Two easy questions...

2013-10-20 Thread Upayavira


On Sun, Oct 20, 2013, at 06:08 PM, Chris wrote:
> Hi,
> 
> I am new to solr & have two questions -
> 
> 1. how do i get an excerpt for a huge content field (would love to show
> google  like excerpts, where word searched for is highlighted)

This is possible using the highlighting component (search 'solr
highlighting' and you'll find it.

> 2. If i have a field - A, is it possible to get top results with only
> unique values for this field in a page...?

I think what you're asking is to only show one document for each value
of your field? If this is the case, then 'field collapsing' aka 'result
grouping' should be able to get you there. You group on your field, and
only show on value per group.

Sorry I can't give you more specifics right now, but google with the
above keywords should get you there.

Upayavira


Re: pivot range faceting

2013-10-20 Thread Upayavira


On Sun, Oct 20, 2013, at 04:04 AM, Toby Lazar wrote:
> Is it possible to get pivot info on a range-faceted query?  For example,
> if
> I want to query the number of orders placed in January, February, etc., I
> know I can use a simple range search.  If I want to get the number of
> orders by category, I can do that easily by faceting on category.  I'm
> wondering if I can get the number of all orders by month, and also broken
> down by category.  Is that possible in a single query?

You can't yet include a range facet within a pivot. The way to achieve
this is to store a version of your date field rounded to the nearest
month, then you will be able to use that field in a pivot facet.

Obviously, this requires index time effort, which is less than ideal.

I guess this is a feature just waiting for someone to implement it.

Upayavira


Re: Two easy questions...

2013-10-20 Thread Furkan KAMACI
For highlighting you can read here:
http://wiki.apache.org/solr/HighlightingParameters
For field collapsing you can read here:
http://wiki.apache.org/solr/FieldCollapsing


2013/10/20 Upayavira 

>
>
> On Sun, Oct 20, 2013, at 06:08 PM, Chris wrote:
> > Hi,
> >
> > I am new to solr & have two questions -
> >
> > 1. how do i get an excerpt for a huge content field (would love to show
> > google  like excerpts, where word searched for is highlighted)
>
> This is possible using the highlighting component (search 'solr
> highlighting' and you'll find it.
>
> > 2. If i have a field - A, is it possible to get top results with only
> > unique values for this field in a page...?
>
> I think what you're asking is to only show one document for each value
> of your field? If this is the case, then 'field collapsing' aka 'result
> grouping' should be able to get you there. You group on your field, and
> only show on value per group.
>
> Sorry I can't give you more specifics right now, but google with the
> above keywords should get you there.
>
> Upayavira
>


Re: measure result set quality

2013-10-20 Thread Furkan KAMACI
Let's assume that you have keywords to search and different configurations
for indexing. A/B testing is one of techniques that you can use as like
Erick mentioned.

If you want to have an automated comparison and do not have a oracle for
A/B testing there is another way. If you have an ideal result list you can
compare the similarity of your different configuration results and that
ideal result list.

The "ideal result list" can be created by an expert just for one time. If
you are developing a search engine you can search same keywords at that one
of search engines and you can use that results as ideal result list to
measure your result lists' similarities.

Kendall's tau is one of the methods to use for such kind of situations. If
you do not have any document duplication at your index (without any other
versions) I suggest to use tau a.

If you explain your system and if you explain what is good for you or what
is ideal for you I can explain you more.

Thanks;
Furkan KAMACI


2013/10/18 Erick Erickson 

> bq: How do you compare the quality of your
> search result in order to decide which schema is better?
>
> Well, that's actually a hard problem. There's the
> various TREC data, but that's a generic solution and most
> every individual application of this generic thing called
> "search" has its own version of "good" results.
>
> Note that scores are NOT comparable across different
> queries even in the same data set, so don't go down that
> path.
>
> I'd fire the question back at you, "Can you define what
> good (or better) results are in such a way that you can
> program an evaluation?" Often the answer is "no"...
>
> One common technique is to have knowledgable users
> do what's called A/B testing. You fire the query at two
> separate Solr instances and display the results side-by-side,
> and the user says "A is more relevant", or "B is more
> relevant". Kind of like an eye doctor. In sophisticated A/B
> testing, the program randomly changes which side the
> results go, so you remove "sidedness" bias.
>
>
> FWIW,
> Erick
>
>
> On Thu, Oct 17, 2013 at 11:28 AM, Alvaro Cabrerizo  >wrote:
>
> > Hi,
> >
> > Imagine the next situation. You have a corpus of documents and a list of
> > queries extracted from production environment. The corpus haven't been
> > manually annotated with relvant/non relevant tags for every query. Then
> you
> > configure various solr instances changing the schema (adding synonyms,
> > stopwords...). After indexing, you prepare and execute the test over
> > different schema configurations.  How do you compare the quality of your
> > search result in order to decide which schema is better?
> >
> > Regards.
> >
>


Re: Complex Queries in solr

2013-10-20 Thread Furkan KAMACI
You should star reading from here:
https://cwiki.apache.org/confluence/display/solr/The+Standard+Query+Parser and
you should have a look at Fuzzy Searches section. On the other hand read
here: http://wiki.apache.org/solr/FunctionQuery


2013/10/18 sayeed 

> Hi,
> Is it possible to search complex queries like
> (consult* or advis*) NEAR(40) (fee or retainer or salary or bonus)
> in solr
>
>
>
>
> -
> Sayeed
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Complex-Queries-in-solr-tp4096288.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: measure result set quality

2013-10-20 Thread Doug Turnbull
That's exactly what we advocate for in our Solr work. We call in "Test
Driven Relevancy". We work closely with content experts to help build
collaboration around search quality. (disclaimer, yes we build a product
around this) but the advice still stands regardless.

http://www.opensourceconnections.com/2013/10/14/what-is-test-driven-search-relevancy/

Cheers
-Doug Turnbull
Search Relevancy Expert
OpenSource Connections




On Sun, Oct 20, 2013 at 4:21 PM, Furkan KAMACI wrote:

> Let's assume that you have keywords to search and different configurations
> for indexing. A/B testing is one of techniques that you can use as like
> Erick mentioned.
>
> If you want to have an automated comparison and do not have a oracle for
> A/B testing there is another way. If you have an ideal result list you can
> compare the similarity of your different configuration results and that
> ideal result list.
>
> The "ideal result list" can be created by an expert just for one time. If
> you are developing a search engine you can search same keywords at that one
> of search engines and you can use that results as ideal result list to
> measure your result lists' similarities.
>
> Kendall's tau is one of the methods to use for such kind of situations. If
> you do not have any document duplication at your index (without any other
> versions) I suggest to use tau a.
>
> If you explain your system and if you explain what is good for you or what
> is ideal for you I can explain you more.
>
> Thanks;
> Furkan KAMACI
>
>
> 2013/10/18 Erick Erickson 
>
> > bq: How do you compare the quality of your
> > search result in order to decide which schema is better?
> >
> > Well, that's actually a hard problem. There's the
> > various TREC data, but that's a generic solution and most
> > every individual application of this generic thing called
> > "search" has its own version of "good" results.
> >
> > Note that scores are NOT comparable across different
> > queries even in the same data set, so don't go down that
> > path.
> >
> > I'd fire the question back at you, "Can you define what
> > good (or better) results are in such a way that you can
> > program an evaluation?" Often the answer is "no"...
> >
> > One common technique is to have knowledgable users
> > do what's called A/B testing. You fire the query at two
> > separate Solr instances and display the results side-by-side,
> > and the user says "A is more relevant", or "B is more
> > relevant". Kind of like an eye doctor. In sophisticated A/B
> > testing, the program randomly changes which side the
> > results go, so you remove "sidedness" bias.
> >
> >
> > FWIW,
> > Erick
> >
> >
> > On Thu, Oct 17, 2013 at 11:28 AM, Alvaro Cabrerizo  > >wrote:
> >
> > > Hi,
> > >
> > > Imagine the next situation. You have a corpus of documents and a list
> of
> > > queries extracted from production environment. The corpus haven't been
> > > manually annotated with relvant/non relevant tags for every query. Then
> > you
> > > configure various solr instances changing the schema (adding synonyms,
> > > stopwords...). After indexing, you prepare and execute the test over
> > > different schema configurations.  How do you compare the quality of
> your
> > > search result in order to decide which schema is better?
> > >
> > > Regards.
> > >
> >
>



-- 
Doug Turnbull
Search & Big Data Architect
OpenSource Connections 


Re: difference between apache tomcat vs Jetty

2013-10-20 Thread Shawn Heisey
On 10/20/2013 11:23 AM, Karunakar Reddy wrote:
> I want to know what is the difference between running solr on tomcat-apache
> server vs jetty server in production.
> With jetty there is some issue with indexing(eg: If 120k[batch size 100]
> records are getting indexed, around 15k are missing).

We recommend jetty.  The solr example uses jetty.  Only jetty receives
testing - whenever someone runs the tests included in the source code,
jetty is used - A LOT.  No tests are done with other containers before
release unless a person installs it under something like tomcat and
tries it out.  After release, other container testing is pretty much
done by users that don't use jetty.

Can you detail the problems you are having with indexing?  How is the
indexing being done - DIH, SolrJ, curl, or some other method?  Are there
errors in the client application or the Solr server log?

Thanks,
Shawn



Re: difference between apache tomcat vs Jetty

2013-10-20 Thread Shawn Heisey
On 10/20/2013 2:57 PM, Shawn Heisey wrote:
> We recommend jetty.  The solr example uses jetty.

I have a clarification for this statement.  We actually recommend using
the jetty that's included in the Solr 4.x example.  It is stripped of
all unnecessary features and its config has had some minor tuning so
it's optimized for Solr.  The jetty binaries in 4.x are completely
unmodified from the upstream download, we just don't include all of
them.  On the 1.x and 3.x examples, there was a small bug in Jetty 6, so
those versions included modified binaries.

If you download jetty from eclipse.org or install it from your operating
system's repository, it will include components you don't need and its
config won't be optimized for Solr, but it will still be a lot closer to
what's actually tested than tomcat is.

Thanks,
Shawn



Re: Complex Queries in solr

2013-10-20 Thread Roman Chyla
i just tested it whether our 'beautifu' parser supports it, and funnily
enough, it does :-)
https://github.com/romanchyla/montysolr/commit/f88577345c6d3a2dbefc0161f6bb07a549bc6b15

but i've (kinda) given up hope that people need powerful query parsers in
the lucene world, the LUCENE-5014 is there sitting without attention for
eons ... so your best bet is probably to use LucidWorks parser (but i don't
know if it supports proximity searches combined with wildcards and
booleans) or you can create a custom query plugin that builds the following
query

spanNear([spanOr([SpanMultiTermQueryWrapper(all:consult*),
SpanMultiTermQueryWrapper(all:advis*)]), spanOr([all:fee,
all:retainer, all:salary, all:bonus])], 4, true)

you can draw inspiration from here:
https://github.com/romanchyla/montysolr/blob/master/contrib/antlrqueryparser/src/java/org/apache/lucene/queryparser/flexible/aqp/builders/AqpNearQueryNodeBuilder.java
https://github.com/romanchyla/montysolr/blob/master/contrib/antlrqueryparser/src/java/org/apache/lucene/queryparser/flexible/aqp/builders/SpanConverter.java


but i think you are aware that such a query is NOT going to be very
efficient. especially when proximity=40 ;-)

hth

roman



On Fri, Oct 18, 2013 at 3:28 AM, sayeed  wrote:

> Hi,
> Is it possible to search complex queries like
> (consult* or advis*) NEAR(40) (fee or retainer or salary or bonus)
> in solr
>
>
>
>
> -
> Sayeed
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Complex-Queries-in-solr-tp4096288.html
> Sent from the Solr - User mailing list archive at Nabble.com.
>


Re: SOLR: Searching on OpenNLP fields is unstable

2013-10-20 Thread Lance Norskog
Hi-

Unit tests to the rescue! The current unit test system in the 4.x branch
catches code sequence problems.

  [junit4]> Throwable #1: java.lang.IllegalStateException:
TokenStream contract violation: reset()/close() call missing, reset()
called multiple times, or subclass does not call super.reset().
 Please see Javadocs of TokenStream class for more information about the
correct consuming workflow.

I'll try to get this right. But both OpenNLP and LUCENE-2899 have
deployment problems:
1) OpenNLP does not have a good source of statistical training data for the
models. For example, the NER models are trained from late 1980's newspaper
articles, so the organization finder is kind of... obsolete. That kind of
problem. I think the currency recognizer is trained on text from before the
Euro was introduced (not sure about this).
2) Solr has a basic packaging problem when the Lucene code uses external
libraries.

As to adding it to the main Solr project, I think the Marketplace Of Ideas
has spoken with deafening silence :)


On Wed, Sep 25, 2013 at 9:26 AM, rashi gandhi wrote:

> HI,
>
>
>
> I am working on OpenNLP integration with SOLR. I have successfully applied
> the patch (LUCENE-2899-x.patch) to latest SOLR source code (branch_4x).
>
> I have designed OpenNLP analyzer and index data to it. Analyzer
> declaration in schema.xml is as
>
>
>
>positionIncrementGap="100">
>
> 
>
> 
>
>  class="solr.StandardTokenizerFactory"/>
>
>  class="solr.LowerCaseFilterFactory"/>
>
>  ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true"/>
>
>  synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
>
>  class="solr.SnowballPorterFilterFactory"/>
>
>  class="solr.ASCIIFoldingFilterFactory"/>
>
> 
>
> 
>
> 
>
>  class="solr.StandardTokenizerFactory"/>
>
>  posTaggerModel="opennlp/en-pos-maxent.bin"/>
>
>  nerTaggerModels="opennlp/en-ner-person.bin"/>
>
>   nerTaggerModels="opennlp/en-ner-location.bin"/>
>
>  class="solr.LowerCaseFilterFactory"/>
>
>  ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true"/>
>
>  
>
> 
>
>
>
> And field declared for this analyzer:
>
>  omitNorms="true" omitPositions="true"/>
>
>
>
> Problem is here : When I search over this field Detail_Person, results are
> not constant.
>
>
>
> When I search Detail_Person:brett, it return one document
>
>
>
>
>
> But again when I fire the same query, it return zero document.
>
>
>
> Searching is not stable on OpenNLP field, sometimes it return documents
> and sometimes not but documents are there.
>
> And if I search on non OpenNLP fields, it is working properly, results are
> stable and correct.
>
> Please help me to make solr results consistent.
>
> Thanks in Advance.
>
>



-- 
Lance Norskog
goks...@gmail.com


Re: pivot range faceting

2013-10-20 Thread Toby Lazar
Thanks for confirming my fears.  I saw some presentations where I thought
this feature was used, but perhaps it was done performing multiple range
queries.

Any chance there is a way for copyField to copy a function of a field
instead of the original itself is there?  Or, must this be done by the
application?

Thank you again for your help.

Toby

***
  Toby Lazar
  Capital Technology Group
  Email: tla...@capitaltg.com
  Mobile: 646-469-5865
***


On Sun, Oct 20, 2013 at 2:39 PM, Upayavira  wrote:

>
>
> On Sun, Oct 20, 2013, at 04:04 AM, Toby Lazar wrote:
> > Is it possible to get pivot info on a range-faceted query?  For example,
> > if
> > I want to query the number of orders placed in January, February, etc., I
> > know I can use a simple range search.  If I want to get the number of
> > orders by category, I can do that easily by faceting on category.  I'm
> > wondering if I can get the number of all orders by month, and also broken
> > down by category.  Is that possible in a single query?
>
> You can't yet include a range facet within a pivot. The way to achieve
> this is to store a version of your date field rounded to the nearest
> month, then you will be able to use that field in a pivot facet.
>
> Obviously, this requires index time effort, which is less than ideal.
>
> I guess this is a feature just waiting for someone to implement it.
>
> Upayavira
>


Re: pivot range faceting

2013-10-20 Thread Bram Van Dam

On 10/21/2013 03:46 AM, Toby Lazar wrote:

Thanks for confirming my fears.  I saw some presentations where I thought
this feature was used, but perhaps it was done performing multiple range
queries.


Probably. I had a look at implementing the feature (because it's 
something we rely on quite a bit), but decided against it. The solr 
implementation of faceting is hard to get my head around -- and 
launching multiple queries seems to outperform pivot queries anyway.


You can use a range query to determine the ranges (and their total 
counts), and then launch an extra query per range.




Re: Query modification

2013-10-20 Thread Sidharth
Hi,

Even I am using the QueryComponent to perform similar modification to the
query. I am modifying the query in the process() method of the Component. 
The problem I am facing is that after modifying the query and setting it in
the response builder, I make a call to super.process(rb).

This call is taking around 100ms and is degrading component's performance.
Wanted to know that is process the right place to do it and do we need to
make a call to super.process() method?

Regards,
Sidharth.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Query-modification-tp939584p4096753.html
Sent from the Solr - User mailing list archive at Nabble.com.


caching HTML pages in SOLR

2013-10-20 Thread Shailendra Mudgal
Hi,

As google stores HTML pages as "*cached*" documents, is there a similar
provision in SOLR. I am using SOLR-4.4.0.


Thanks,
Shailendra


Re: caching HTML pages in SOLR

2013-10-20 Thread Alexandre Rafalovitch
Not in Solr itself, no. Solr is all about Search. Caching (and rewriting
resource links, etc) should probably be part of whatever does the document
fetching.

Regards,
   Alex.

Personal website: http://www.outerthoughts.com/
LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
- Time is the quality of nature that keeps events from happening all at
once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)


On Mon, Oct 21, 2013 at 1:19 PM, Shailendra Mudgal <
mudgal.shailen...@gmail.com> wrote:

> Hi,
>
> As google stores HTML pages as "*cached*" documents, is there a similar
> provision in SOLR. I am using SOLR-4.4.0.
>
>
> Thanks,
> Shailendra
>


Re: caching HTML pages in SOLR

2013-10-20 Thread Shailendra Mudgal
Thanks Alex.

I was thinking if something already exists of this sort.




On Mon, Oct 21, 2013 at 12:05 PM, Alexandre Rafalovitch
wrote:

> Not in Solr itself, no. Solr is all about Search. Caching (and rewriting
> resource links, etc) should probably be part of whatever does the document
> fetching.
>
> Regards,
>Alex.
>
> Personal website: http://www.outerthoughts.com/
> LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
> - Time is the quality of nature that keeps events from happening all at
> once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)
>
>
> On Mon, Oct 21, 2013 at 1:19 PM, Shailendra Mudgal <
> mudgal.shailen...@gmail.com> wrote:
>
> > Hi,
> >
> > As google stores HTML pages as "*cached*" documents, is there a similar
> > provision in SOLR. I am using SOLR-4.4.0.
> >
> >
> > Thanks,
> > Shailendra
> >
>


Re: caching HTML pages in SOLR

2013-10-20 Thread Alexandre Rafalovitch
I have not used it myself, but perhaps something like
http://www.crawl-anywhere.com/ is along what you were looking for.

Regards,
   Alex.

Personal website: http://www.outerthoughts.com/
LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
- Time is the quality of nature that keeps events from happening all at
once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)


On Mon, Oct 21, 2013 at 1:44 PM, Shailendra Mudgal <
mudgal.shailen...@gmail.com> wrote:

> Thanks Alex.
>
> I was thinking if something already exists of this sort.
>
>
>
>
> On Mon, Oct 21, 2013 at 12:05 PM, Alexandre Rafalovitch
> wrote:
>
> > Not in Solr itself, no. Solr is all about Search. Caching (and rewriting
> > resource links, etc) should probably be part of whatever does the
> document
> > fetching.
> >
> > Regards,
> >Alex.
> >
> > Personal website: http://www.outerthoughts.com/
> > LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
> > - Time is the quality of nature that keeps events from happening all at
> > once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)
> >
> >
> > On Mon, Oct 21, 2013 at 1:19 PM, Shailendra Mudgal <
> > mudgal.shailen...@gmail.com> wrote:
> >
> > > Hi,
> > >
> > > As google stores HTML pages as "*cached*" documents, is there a similar
> > > provision in SOLR. I am using SOLR-4.4.0.
> > >
> > >
> > > Thanks,
> > > Shailendra
> > >
> >
>


Class name of parsing the fq clause

2013-10-20 Thread YouPeng Yang
Hi
   I search the solr with fq clause,which is like:
   fq=BEGINTIME:[2013-08-25T16:00:00Z TO *] AND BUSID:(M3 OR M9)


   I am curious about the parsing process . I want to study it.
   What is the Java file name describes  the parsing  process of the fq
clause.


  Thanks

Regards.