Re: solr unicode problem in query page

2011-09-17 Thread Erik Hatcher
Xue-Feng - 你好 - 

You'll need to give us some more specifics.  *:* will always work, but 
searching on other strings is entirely dependent on your configuration and the 
exact query.  For us to help, you'll need to share your full query 
(debugQuery=true helps here) and the related field(s)/type(s) that are being 
queried.

Erik

On Sep 16, 2011, at 17:10 , Xue-Feng Yang wrote:

 I just started to test solr in Chinese content. In the admin page, I could 
 use *:* and saw the correct result. However, when I tried some specific 
 search, I came to no results.  
 
 
 I don't know what the problem is. Any thought?
 
 
 By the way, I have a few cores running in English without encoding problem. 
 And in my testing application server, I have some applications which is OK 
 with Chinese. 



Glassfish errors: DPL8006 and DPL8007

2011-09-17 Thread Xue-Feng Yang
I have solr deployed in Glassfish 3.1.1. It has the error messages in the 
following order at the starting up time. 


WARNING: DPL8007: Unsupported deployment descriptors element schemaLocation 
value http://www.bea.com/ns/weblogic/90 
http://www.bea.com/ns/weblogic/90/weblogic-web-app.xsd
SEVERE: DPL8006: get/add descriptor failure : 
filter-dispatched-requests-enabled TO false

Here is the reference for these two errors from sun doc:

DPL8006 get/add descriptor failure : {0} TO {1} 
Cause:  Adding or getting a descriptor failed
Cause:  May be because the node / information to be added is not
valid; may be because of the descriptor was not registered
Solution:  Ensure that the node to be added is valid
Solution:  Ensure that the permissions are set as expected
DPL8007 Unsupported deployment descriptors element {0} value {1}
Cause:  Failed to find the resource specified in the deployment descriptor
Cause:  May be because of wrong specification in the descriptor
Solution:  Ensure that the resource specified is present
Solution:  Ensure that there is no typo in the resource specified in the
descriptor
I checked there is weblogic.xml  in apache-solr-3.4.0.war.

What is this xml for? Should I delete it?

Thanks.

Re: solr unicode problem in query page

2011-09-17 Thread Xue-Feng Yang
Thanks-谢谢-Erik.

Actually, I just played with http://localhost:8080/solr/cuv/admin/

cuv is the Chinese Union version of Bible. I inputted content:上帝 in Query 
String without quotations. It then directs to

http://localhost:8080/solr/cuv/select/?q=content%3A%E4%B8%8A%E5%B8%9Dversion=2.2start=0rows=10indent=on

The response XML is

responselst name=responseHeaderint name=status0/intint 
name=QTime1/intlst name=paramsstr name=qcontent:ä¸å¸/strstr 
name=version2.2/strstr name=start0/strstr name=rows10/strstr 
name=indenton/str/lst/lstresult name=response numFound=0 
start=0//response

There are no results. 


In addition to this, the query *:* returns correct Chinese. On the other 
hand, I have 26 versions of Bible in English which are OK so far. I also have 
some other applications in Chinese in the same glassfish domain. They don't 
have any trouble in language presentation.


Best,

Xue-Feng








From: Erik Hatcher erik.hatc...@gmail.com
To: solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 8:43:15 AM
Subject: Re: solr unicode problem in query page

Xue-Feng - 你好 - 

You'll need to give us some more specifics.  *:* will always work, but 
searching on other strings is entirely dependent on your configuration and the 
exact query.  For us to help, you'll need to share your full query 
(debugQuery=true helps here) and the related field(s)/type(s) that are being 
queried.

    Erik

On Sep 16, 2011, at 17:10 , Xue-Feng Yang wrote:

 I just started to test solr in Chinese content. In the admin page, I could 
 use *:* and saw the correct result. However, when I tried some specific 
 search, I came to no results.  
 
 
 I don't know what the problem is. Any thought?
 
 
 By the way, I have a few cores running in English without encoding problem. 
 And in my testing application server, I have some applications which is OK 
 with Chinese. 

Re: solr unicode problem in query page

2011-09-17 Thread Xue-Feng Yang
Here is the return by use of 
http://localhost:8080/solr/cuv/select/?q=content%3A%C9%CF%B5%DBversion=2.2start=0rows=10indent=ondebugQuery=true

responselst name=responseHeaderint name=status0/intint 
name=QTime11/intlst name=paramsstr name=qcontent:ÉϵÛ/strstr 
name=version2.2/strstr name=start0/strstr name=rows10/strstr 
name=indenton/strstr name=debugQuerytrue/str/lst/lstresult 
name=response numFound=0 start=0/lst name=debugstr 
name=rawquerystringcontent:ÉϵÛ/strstr 
name=querystringcontent:ÉϵÛ/strstr name=parsedquery/str 
name=parsedquery_toString/lst name=explain/str 
name=QParserLuceneQParser/strlst name=timingdouble 
name=time6.0/doublelst name=preparedouble 
name=time3.0/doublelst 
name=org.apache.solr.handler.component.QueryComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.FacetComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.MoreLikeThisComponentdouble
 name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.HighlightComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.StatsComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.DebugComponentdouble 
name=time0.0/double/lst/lstlst name=processdouble 
name=time1.0/doublelst 
name=org.apache.solr.handler.component.QueryComponentdouble 
name=time1.0/double/lstlst 
name=org.apache.solr.handler.component.FacetComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.MoreLikeThisComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.HighlightComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.StatsComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.DebugComponentdouble
 name=time0.0/double/lst/lst/lst/lst/response

Basically, it doesn't say anything.



From: Xue-Feng Yang just4l...@yahoo.com
To: solr-user@lucene.apache.org solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 9:14:24 AM
Subject: Re: solr unicode problem in query page

Thanks-谢谢-Erik.

Actually, I just played with http://localhost:8080/solr/cuv/admin/

cuv is the Chinese Union version of Bible. I inputted content:上帝 in Query 
String without quotations. It then directs to

http://localhost:8080/solr/cuv/select/?q=content%3A%E4%B8%8A%E5%B8%9Dversion=2.2start=0rows=10indent=on

The response XML is

responselst name=responseHeaderint name=status0/intint 
name=QTime1/intlst name=paramsstr name=qcontent:ä¸å¸/strstr 
name=version2.2/strstr name=start0/strstr name=rows10/strstr 
name=indenton/str/lst/lstresult name=response numFound=0 
start=0//response

There are no results. 


In addition to this, the query *:* returns correct Chinese. On the other 
hand, I have 26 versions of Bible in English which are OK so far. I also have 
some other applications in Chinese in the same glassfish domain. They don't 
have any trouble in language presentation.


Best,

Xue-Feng








From: Erik Hatcher erik.hatc...@gmail.com
To: solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 8:43:15 AM
Subject: Re: solr unicode problem in query page

Xue-Feng - 你好 - 

You'll need to give us some more specifics.  *:* will always work, but 
searching on other strings is entirely dependent on your configuration and the 
exact query.  For us to help, you'll need to share your full query 
(debugQuery=true helps here) and the related field(s)/type(s) that are being 
queried.

    Erik

On Sep 16, 2011, at 17:10 , Xue-Feng Yang wrote:

 I just started to test solr in Chinese content. In the admin page, I could 
 use *:* and saw the correct result. However, when I tried some specific 
 search, I came to no results.  
 
 
 I don't know what the problem is. Any thought?
 
 
 By the way, I have a few cores running in English without encoding problem. 
 And in my testing application server, I have some applications which is OK 
 with Chinese. 

Re: solr unicode problem in query page

2011-09-17 Thread Xue-Feng Yang
Just a reminder: two returns contains different uni-code while my input doesn't 
change. You can see this in the following two messages.




From: Xue-Feng Yang just4l...@yahoo.com
To: solr-user@lucene.apache.org solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 9:18:54 AM
Subject: Re: solr unicode problem in query page

Here is the return by use of 
http://localhost:8080/solr/cuv/select/?q=content%3A%C9%CF%B5%DBversion=2.2start=0rows=10indent=ondebugQuery=true

responselst name=responseHeaderint name=status0/intint 
name=QTime11/intlst name=paramsstr name=qcontent:ÉϵÛ/strstr 
name=version2.2/strstr name=start0/strstr name=rows10/strstr 
name=indenton/strstr name=debugQuerytrue/str/lst/lstresult 
name=response numFound=0 start=0/lst name=debugstr 
name=rawquerystringcontent:ÉϵÛ/strstr 
name=querystringcontent:ÉϵÛ/strstr name=parsedquery/str 
name=parsedquery_toString/lst name=explain/str 
name=QParserLuceneQParser/strlst name=timingdouble 
name=time6.0/doublelst name=preparedouble 
name=time3.0/doublelst 
name=org.apache.solr.handler.component.QueryComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.FacetComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.MoreLikeThisComponentdouble
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.HighlightComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.StatsComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.DebugComponentdouble 
name=time0.0/double/lst/lstlst name=processdouble 
name=time1.0/doublelst 
name=org.apache.solr.handler.component.QueryComponentdouble 
name=time1.0/double/lstlst 
name=org.apache.solr.handler.component.FacetComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.MoreLikeThisComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.HighlightComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.StatsComponentdouble 
name=time0.0/double/lstlst 
name=org.apache.solr.handler.component.DebugComponentdouble
name=time0.0/double/lst/lst/lst/lst/response

Basically, it doesn't say anything.



From: Xue-Feng Yang just4l...@yahoo.com
To: solr-user@lucene.apache.org solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 9:14:24 AM
Subject: Re: solr unicode problem in query page

Thanks-谢谢-Erik.

Actually, I just played with http://localhost:8080/solr/cuv/admin/

cuv is the Chinese Union version of Bible. I inputted content:上帝 in Query 
String without quotations. It then directs to

http://localhost:8080/solr/cuv/select/?q=content%3A%E4%B8%8A%E5%B8%9Dversion=2.2start=0rows=10indent=on

The response XML is

responselst name=responseHeaderint name=status0/intint 
name=QTime1/intlst name=paramsstr name=qcontent:ä¸å¸/strstr 
name=version2.2/strstr name=start0/strstr name=rows10/strstr 
name=indenton/str/lst/lstresult name=response numFound=0 
start=0//response

There are no results. 


In addition to this, the query *:* returns correct Chinese. On the other 
hand, I have 26 versions of Bible in English which are OK so far. I also have 
some other applications in Chinese in the same glassfish domain. They don't 
have any trouble in language presentation.


Best,

Xue-Feng








From: Erik Hatcher erik.hatc...@gmail.com
To: solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 8:43:15 AM
Subject: Re: solr unicode problem in query page

Xue-Feng - 你好 - 

You'll need to give us some more specifics.  *:* will always work, but 
searching on other strings is entirely dependent on your configuration and the 
exact query.  For us to help, you'll need to share your full query 
(debugQuery=true helps here) and the related field(s)/type(s) that are being 
queried.

    Erik

On Sep 16, 2011, at 17:10 , Xue-Feng Yang wrote:

 I just started to test solr in Chinese content. In the admin page, I could 
 use *:* and saw the correct result. However, when I tried some specific 
 search, I came to no results.  
 
 
 I don't know what the problem is. Any thought?
 
 
 By the way, I have a few cores running in English without encoding problem. 
 And in my testing application server, I have some applications which is OK 
 with Chinese. 

Is it possible to use different types of datasource in DIH?

2011-09-17 Thread O. Klein
I want to combine data in XML on disk and XML online. 

dataSource type=FileDataSource encoding=UTF-8 /

is needed to read all the XML-files on disk and

dataSource type=URLDataSource name=url encoding=UTF-8
connectionTimeout=3 readTimeout=3/

is needed to get the content from XML online.

Using them both causes problems as the FileDataSource is being used
eventhough the entity specifically calls for datasource=url.

Is there way to fix this?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-use-different-types-of-datasource-in-DIH-tp3344380p3344380.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Is it possible to use different types of datasource in DIH?

2011-09-17 Thread Ahmet Arslan
 I want to combine data in XML on disk
 and XML online. 
 
 dataSource type=FileDataSource encoding=UTF-8
 /
 
 is needed to read all the XML-files on disk and
 
 dataSource type=URLDataSource name=url
 encoding=UTF-8
 connectionTimeout=3 readTimeout=3/
 
 is needed to get the content from XML online.
 
 Using them both causes problems as the FileDataSource is
 being used
 eventhough the entity specifically calls for
 datasource=url.
 
 Is there way to fix this?

Multiple_DataSources?
http://wiki.apache.org/solr/DataImportHandler#Multiple_DataSources


integrate solr and nutch index

2011-09-17 Thread hadi
I want to index my video and image files with solrj api's and index my sites
with nutch 1.3,but the problem is :
i should overwrite the nutch schema on solr schema to index the sites,if i
do this i lost the solr schema and i couldnt index files with solrj,how can
i solve this problem ?
should i define multiple core for each schema and data?

--
View this message in context: 
http://lucene.472066.n3.nabble.com/integrate-solr-and-nutch-index-tp3344176p3344176.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Is it possible to use different types of datasource in DIH?

2011-09-17 Thread O. Klein
That doesn't really help.

Using multiple datasources of the same type or combination of e.g.
FileDataSource and BinURLDataSource is no problem.

Using FileDataSource and URLDataSource doesn't work, cause FileDataSource is
always being used, even if the entity is using a URLDataSource as
datasource.

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Is-it-possible-to-use-different-types-of-datasource-in-DIH-tp3344380p3344668.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: integrate solr and nutch index

2011-09-17 Thread Gora Mohanty
On Sat, Sep 17, 2011 at 5:57 PM, hadi md.anb...@gmail.com wrote:
 I want to index my video and image files with solrj api's and index my sites
 with nutch 1.3,but the problem is :
 i should overwrite the nutch schema on solr schema to index the sites,if i
 do this i lost the solr schema and i couldnt index files with solrj,how can
 i solve this problem ?
 should i define multiple core for each schema and data?
[...]

Yes, it would probably be best to use separate cores, or separate Solr
instances for data that need different schemas.

If you really wish to, and there is some reason to do this, you could
also have a broad schema that has non-required fields that cover the
needs for both data models.

Regards,
Gora


Re: Distinct elements in a field

2011-09-17 Thread Bill Bell
SOLR-2242 can do it.

On 9/16/11 2:15 AM, swiss knife swiss_kn...@email.com wrote:

I could get this number by using

 group.ngroups=truegroup.limit=0

 but doing grouping for this seems like an overkill

 Would you advise using JIRA SOLR-1814 ?

- Original Message -
From: swiss knife
Sent: 09/15/11 12:43 PM
To: solr-user@lucene.apache.org
Subject: Distinct elements in a field

 Simple question: I want to know how many distinct elements I have in a
field and these verify a query. Do you know if there's a way to do it
today in 3.4. I saw SOLR-1814 and SOLR-2242. SOLR-1814 seems fairly easy
to use. What do you think ? Thank you




Re: Distinct elements in a field

2011-09-17 Thread Ken Krugler

On Sep 15, 2011, at 3:43am, swiss knife wrote:

 Simple question: I want to know how many distinct elements I have in a field 
 and these verify a query. Do you know if there's a way to do it today in 3.4.
 
 I saw SOLR-1814 and SOLR-2242.
 
 SOLR-1814 seems fairly easy to use. What do you think ? Thank you

If you turn on facets in your query (facet=truefacet.field=field name) then 
you'll get back all of the distinct values, though might have to play with 
other settings (e.g. facet.limit=-1) to get the results you need.

-- Ken

--
Ken Krugler
+1 530-210-6378
http://bixolabs.com
custom big data solutions  training
Hadoop, Cascading, Mahout  Solr





Re: Schema fieldType y-m-d ?!?!

2011-09-17 Thread Erick Erickson
Strings, especially if you sort, facet, etc, will be significantly
more costly in terms
of memory requirements. Doesn't mean you can't, just be aware of the fact...

Best
Erick

On Thu, Sep 15, 2011 at 10:42 AM, stockii stock.jo...@googlemail.com wrote:
 thx =)

 i think i will save this as an string if ranges really works =)

 -
 --- System 
 

 One Server, 12 GB RAM, 2 Solr Instances, 8 Cores,
 1 Core with 45 Million Documents other Cores  200.000

 - Solr1 for Search-Requests - commit every Minute  - 5GB Xmx
 - Solr2 for Update-Request  - delta every Minute - 4GB Xmx
 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Schema-fieldType-y-m-d-tp3335359p3339160.html
 Sent from the Solr - User mailing list archive at Nabble.com.



Re: Can index size increase when no updates/optimizes are happening?

2011-09-17 Thread Erick Erickson
I'd guess user error in the absence of stronger information. No
docs added, no optimized kicked off, no replication and I
can't think of any reason the index size should increase.

Erick

2011/9/15 Yury Kats yuryk...@yahoo.com:
 On 9/14/2011 2:36 PM, Erick Erickson wrote:
 What is the machine used for? Was your user looking at
 a master? Slave? Something used for both?

 Stand-alone machine with multiple Solr cores. No replication.

 Measuring the size of all the files in the index? Or looking
 at memory?

 Disk space.

 The index files shouldn't be getting bigger unless there
 were indexing operations going on.

 That's what I thought.

 Is it at all possible that
 DIH was configured to run automatically (or any other
 indexing job for that matter) and your user didn't realize it?

 There's no DIH, but there is a custom app that submit docs
 for indexing via SolrJ. Supposedly, Solr logs were not showing
 any updates over night, so the assumption was that no new docs
 were added.

 I'd write it off as a user error, but wanted to double check with
 the community that no other internal Solr/Lucene task can change the index
 file size in the absence of submits.



Miscellaneous DIH related questions

2011-09-17 Thread Pulkit Singhal
My DIH's full-import logs end with a tailing output saying that 1500
documents were added, which is correct because I have 16 sources and
one of them was down and each source is supposed to give me 100
results:
(1500 adds)],optimize=} 0 0

But When I check my document count I get only 1384 results:
INFO: [rss] webapp=/solr path=/select params={start=0q=*:*rows=0}
hits=1384 status=0 QTime=0

1) I think I may have duplicates based on the primary key for the data
coming in. Is there any other explnation than that?
2) Is there some way to get a log of how many documents were deleted?
Because an update does a delete then add, this would allow me to make
sure of what is going on.

The sources I have are URL based, soemtimes they appear to be down
because the request gets denied I suppose:
SEVERE: Exception thrown while getting data
java.io.FileNotFoundException:
http://www.amazon.com/rss/tag/anime/popular/ref=tag_tdp_rss_pop_man?length=100
Caused by: java.io.FileNotFoundException:
http://www.amazon.com/rss/tag/anime/popular/ref=tag_tdp_rss_pop_man?length=100
at 
sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1434)

3) Is there some way to configure the datasource to retry 3 time or
something like that? I have increased the values for connectionTimeout
and readTimeout but it doesn't help when sometimes the server simply
denies the request due to heavy load. I need to be able to retry at
those times. The onError has only the abort,skip,continue options, non
of which really let me retry anything.

Thank You.
- Pulkit


Re: Generating large datasets for Solr proof-of-concept

2011-09-17 Thread Pulkit Singhal
Thanks Hoss. I agree that the way you restated the question is better
for getting results. BTW I think you've tipped me off to exactly what
I needed with this URL: http://bbyopen.com/

Thanks!
- Pulkit

On Fri, Sep 16, 2011 at 4:35 PM, Chris Hostetter
hossman_luc...@fucit.org wrote:

 : Has anyone ever had to create large mock/dummy datasets for test
 : environments or for POCs/Demos to convince folks that Solr was the
 : wave of the future? Any tips would be greatly appreciated. I suppose
 : it sounds a lot like crawling even though it started out as innocent
 : DIH usage.

 the better question to ask is where you can find good sample data sets for
 building proof of concept implementations.

 If you want an example of product data, the best buy product catalog is
 available for developers using either an API or a bulk download of xml
 files...

        http://bbyopen.com/

 ...last time i looked (~1 year ago) there were about 1 million products in
 the data dump.


 -Hoss



Re: Add copyTo Field without re-indexing?

2011-09-17 Thread Erick Erickson
Luis:

This will only work if *all* the fields have ' stored=true '. Or at
least enough that you
can get back all the original data. Fetching a document
will not return data for any field that has stored=false...

Best
Erick

On Fri, Sep 16, 2011 at 2:33 PM, Luis Cappa luisca...@gmail.com wrote:
 Hello.

 You can also develop an application by yourself that uses Solrj to retrieve 
 all the documents from your índex, process and add all the new information 
 (fields) desired and the index them into another Solr index. Its easy.

 Goodbye!



 El 16/09/2011, a las 17:39, Olson, Ron rol...@lbpc.com escribió:

 Hi all-

 I have an 11 gig index that I realize I need to add another field to, but 
 not from the actual query using DIH, but via copyTo.

 Is there any way to re-parse an existing index, adding the new copyTo field, 
 without having to basically start all over again with DIH?

 Thanks,

 Ron

 DISCLAIMER: This electronic message, including any attachments, files or 
 documents, is intended only for the addressee and may contain CONFIDENTIAL, 
 PROPRIETARY or LEGALLY PRIVILEGED information.  If you are not the intended 
 recipient, you are hereby notified that any use, disclosure, copying or 
 distribution of this message or any of the information included in or with 
 it is  unauthorized and strictly prohibited.  If you have received this 
 message in error, please notify the sender immediately by reply e-mail and 
 permanently delete and destroy this message and its attachments, along with 
 any copies thereof. This message does not create any contractual obligation 
 on behalf of the sender or Law Bulletin Publishing Company.
 Thank you.



Re: Miscellaneous DIH related questions

2011-09-17 Thread Erick Erickson
For (2), look at your admin/stats page. The difference between numDocs and
maxDocs is the number of documents that have been deleted from your
index...

For (3) I don't have a clue about.

Best
Erick

On Sat, Sep 17, 2011 at 7:20 PM, Pulkit Singhal pulkitsing...@gmail.com wrote:
 My DIH's full-import logs end with a tailing output saying that 1500
 documents were added, which is correct because I have 16 sources and
 one of them was down and each source is supposed to give me 100
 results:
 (1500 adds)],optimize=} 0 0

 But When I check my document count I get only 1384 results:
 INFO: [rss] webapp=/solr path=/select params={start=0q=*:*rows=0}
 hits=1384 status=0 QTime=0

 1) I think I may have duplicates based on the primary key for the data
 coming in. Is there any other explnation than that?
 2) Is there some way to get a log of how many documents were deleted?
 Because an update does a delete then add, this would allow me to make
 sure of what is going on.

 The sources I have are URL based, soemtimes they appear to be down
 because the request gets denied I suppose:
 SEVERE: Exception thrown while getting data
 java.io.FileNotFoundException:
 http://www.amazon.com/rss/tag/anime/popular/ref=tag_tdp_rss_pop_man?length=100
 Caused by: java.io.FileNotFoundException:
 http://www.amazon.com/rss/tag/anime/popular/ref=tag_tdp_rss_pop_man?length=100
        at 
 sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1434)

 3) Is there some way to configure the datasource to retry 3 time or
 something like that? I have increased the values for connectionTimeout
 and readTimeout but it doesn't help when sometimes the server simply
 denies the request due to heavy load. I need to be able to retry at
 those times. The onError has only the abort,skip,continue options, non
 of which really let me retry anything.

 Thank You.
 - Pulkit



Re: Glassfish errors: DPL8006 and DPL8007

2011-09-17 Thread Erik Hatcher
It's for weblogic.  Deleting it won't cause any issues (unless you're deploying 
to weblogic, but you're not).  give it a try.  


On Sep 17, 2011, at 08:52 , Xue-Feng Yang wrote:

 I have solr deployed in Glassfish 3.1.1. It has the error messages in the 
 following order at the starting up time. 
 
 
 WARNING: DPL8007: Unsupported deployment descriptors element schemaLocation 
 value http://www.bea.com/ns/weblogic/90 
 http://www.bea.com/ns/weblogic/90/weblogic-web-app.xsd
 SEVERE: DPL8006: get/add descriptor failure : 
 filter-dispatched-requests-enabled TO false
 
 Here is the reference for these two errors from sun doc:
 
 DPL8006 get/add descriptor failure : {0} TO {1} 
 Cause:  Adding or getting a descriptor failed
 Cause:  May be because the node / information to be added is not
 valid; may be because of the descriptor was not registered
 Solution:  Ensure that the node to be added is valid
 Solution:  Ensure that the permissions are set as expected
 DPL8007 Unsupported deployment descriptors element {0} value {1}
 Cause:  Failed to find the resource specified in the deployment descriptor
 Cause:  May be because of wrong specification in the descriptor
 Solution:  Ensure that the resource specified is present
 Solution:  Ensure that there is no typo in the resource specified in the
 descriptor
 I checked there is weblogic.xml  in apache-solr-3.4.0.war.
 
 What is this xml for? Should I delete it?
 
 Thanks.



Re: Glassfish errors: DPL8006 and DPL8007

2011-09-17 Thread Xue-Feng Yang
I deleted. Thanks.




From: Erik Hatcher erik.hatc...@gmail.com
To: solr-user@lucene.apache.org
Sent: Saturday, September 17, 2011 8:44:36 PM
Subject: Re: Glassfish errors: DPL8006 and DPL8007

It's for weblogic.  Deleting it won't cause any issues (unless you're deploying 
to weblogic, but you're not).  give it a try.  


On Sep 17, 2011, at 08:52 , Xue-Feng Yang wrote:

 I have solr deployed in Glassfish 3.1.1. It has the error messages in the 
 following order at the starting up time. 
 
 
 WARNING: DPL8007: Unsupported deployment descriptors element schemaLocation 
 value http://www.bea.com/ns/weblogic/90 
 http://www.bea.com/ns/weblogic/90/weblogic-web-app.xsd
 SEVERE: DPL8006: get/add descriptor failure : 
 filter-dispatched-requests-enabled TO false
 
 Here is the reference for these two errors from sun doc:
 
 DPL8006 get/add descriptor failure : {0} TO {1} 
 Cause:  Adding or getting a descriptor failed
 Cause:  May be because the node / information to be added is not
 valid; may be because of the descriptor was not registered
 Solution:  Ensure that the node to be added is valid
 Solution:  Ensure that the permissions are set as expected
 DPL8007 Unsupported deployment descriptors element {0} value {1}
 Cause:  Failed to find the resource specified in the deployment descriptor
 Cause:  May be because of wrong specification in the descriptor
 Solution:  Ensure that the resource specified is present
 Solution:  Ensure that there is no typo in the resource specified in the
 descriptor
 I checked there is weblogic.xml  in apache-solr-3.4.0.war.
 
 What is this xml for? Should I delete it?
 
 Thanks.

a weird error of embedded server initiaizationl

2011-09-17 Thread Xue-Feng Yang
I could resolve the unicode issue for a deploy on glassfish, so I turn to test 
the embedded server for solr.  There is a weird error. It throws from   
CoreContainer container = new CoreContainer();




java.io.FileNotFoundException: C:\intelli\config\security.properties (The 
system cannot find the path specified)
    at java.io.FileInputStream.open(Native Method)
    at java.io.FileInputStream.init(FileInputStream.java:120)
    at com.sun.enterprise.util.Utility.getPropertiesFromFile(Utility.java:56)
    at com.sun.enterprise.iiop.POAEJBORB.clinit(POAEJBORB.java:83)
    at java.lang.Class.forName0(Native Method)
    at java.lang.Class.forName(Class.java:247)
    at org.omg.CORBA.ORB.create_impl(ORB.java:295)
    at org.omg.CORBA.ORB.init(ORB.java:336)
    at com.sun.enterprise.util.ORBManager.createORB(ORBManager.java:140)
    at com.sun.enterprise.util.ORBManager.init(ORBManager.java:69)
    at 
com.sun.enterprise.naming.SerialInitContextFactory.init(SerialInitContextFactory.java:35)
    at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
    at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
    at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
    at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
    at java.lang.Class.newInstance0(Class.java:355)
    at java.lang.Class.newInstance(Class.java:308)
    at javax.naming.spi.NamingManager.getInitialContext(NamingManager.java:654)
    at javax.naming.InitialContext.getDefaultInitCtx(InitialContext.java:288)
    at javax.naming.InitialContext.init(InitialContext.java:223)
    at javax.naming.InitialContext.init(InitialContext.java:175)
    at 
org.apache.solr.core.SolrResourceLoader.locateSolrHome(SolrResourceLoader.java:593)
    at org.apache.solr.core.CoreContainer.init(CoreContainer.java:82)
    at 
com.norvar.test.samples.SimpleMain.getEmbeddedSolrServer(SimpleMain.java:151)
    at com.norvar.test.samples.SimpleMain.getSolrServer(SimpleMain.java:143)
    at com.norvar.test.samples.CUVMain.run(CUVMain.java:20)
    at com.norvar.test.samples.CUVMain.main(CUVMain.java:15)
java.io.FileNotFoundException: C:\intelli\config\ejb.properties (The system 
cannot find the path specified)
    at java.io.FileInputStream.open(Native Method)
    at java.io.FileInputStream.init(FileInputStream.java:120)
    at com.sun.enterprise.util.Utility.getPropertiesFromFile(Utility.java:56)
    at com.sun.enterprise.iiop.J2EEInitializer.init(J2EEInitializer.java:57)
    at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
    at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
    at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
    at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
    at java.lang.Class.newInstance0(Class.java:355)
    at java.lang.Class.newInstance(Class.java:308)
    at 
com.sun.corba.ee.internal.Interceptors.PIORB.registerORBInitializers(PIORB.java:373)
    at 
com.sun.corba.ee.internal.Interceptors.PIORB.parseProperties(PIORB.java:343)
    at com.sun.corba.ee.internal.TxPOA.TxPIORB.parseProperties(TxPIORB.java:116)
    at com.sun.corba.ee.internal.corba.ORB.set_parameters(ORB.java:449)
    at com.sun.corba.ee.internal.POA.POAORB.set_parameters(POAORB.java:152)
    at 
com.sun.corba.ee.internal.Interceptors.PIORB.set_parameters(PIORB.java:332)
    at org.omg.CORBA.ORB.init(ORB.java:337)
    at com.sun.enterprise.util.ORBManager.createORB(ORBManager.java:140)
    at com.sun.enterprise.util.ORBManager.init(ORBManager.java:69)
    at 
com.sun.enterprise.naming.SerialInitContextFactory.init(SerialInitContextFactory.java:35)
    at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
    at 
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
    at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
    at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
    at java.lang.Class.newInstance0(Class.java:355)
    at java.lang.Class.newInstance(Class.java:308)
    at javax.naming.spi.NamingManager.getInitialContext(NamingManager.java:654)
    at javax.naming.InitialContext.getDefaultInitCtx(InitialContext.java:288)
    at javax.naming.InitialContext.init(InitialContext.java:223)
    at javax.naming.InitialContext.init(InitialContext.java:175)
    at 
org.apache.solr.core.SolrResourceLoader.locateSolrHome(SolrResourceLoader.java:593)
    at org.apache.solr.core.CoreContainer.init(CoreContainer.java:82)
    at 
com.norvar.test.samples.SimpleMain.getEmbeddedSolrServer(SimpleMain.java:151)
    at com.norvar.test.samples.SimpleMain.getSolrServer(SimpleMain.java:143)
    at com.norvar.test.samples.CUVMain.run(CUVMain.java:20)
    at 

Re: a weird error of embedded server initiaizationl

2011-09-17 Thread Chris Hostetter


: to test the embedded server for solr.  There is a weird error. It throws 
: from   CoreContainer container = new CoreContainer();


these errors aren't related to Solr ... they seem to be from running in a 
J2EE/EJB setup that is missconfigured so it can't check security settings 
for the app it's running.


The first two are when Solr is trying to ask the JVM for JNDI Config...

: java.io.FileNotFoundException: C:\intelli\config\security.properties (The 
system cannot find the path specified)
...
:     at javax.naming.InitialContext.init(InitialContext.java:175)
:     at 
org.apache.solr.core.SolrResourceLoader.locateSolrHome(SolrResourceLoader.java:593)

This one just seems to be a warning, but it's also about a file that 
google says is realted to EJB/J2EE stuff (solr never asks for it)


: Warning: unable to read transaction.interoperability config property

This last one ... well it oculd be anything.  maybe just some missing 
jars? ...


: Exception in thread main java.lang.NoClassDefFoundError: 
org/apache/lucene/index/IndexReader


-Hoss

Re: a weird error of embedded server initiaizationl

2011-09-17 Thread Xue-Feng Yang
Thanks for reply. 


However, My testing project is a pure Java project. I still don't understand 
how it messed up with J2EE stuff. Actually, the same code is working for 
CommonsHttpSolrServer without any error like embedded one. The code for finding 
SolrServer is as follows:


        File home = new File(url);
        File f = new File(home, solr.xml );
        CoreContainer container = new CoreContainer();
        container.load(url, f);
        EmbeddedSolrServer server = new EmbeddedSolrServer( coreContainer, 
coreName );
 
which is copied from http://wiki.apache.org/solr/Solrj. The error was thrown at 
the line CoreContainer container = new CoreContainer()

It was amazing when I changed the code to

        String solrHome = url+coreName;
        String dataDir = solrHome + /data;
        CoreContainer coreContainer = new CoreContainer(solrHome);
        SolrConfig solrConfig=solrConfig = new SolrConfig(solrHome, 
solrconfig.xml, null);
        CoreDescriptor coreDescriptor = new CoreDescriptor(coreContainer, 
coreName, solrHome);
        SolrCore solrCore = new SolrCore(coreName, dataDir, solrConfig, null, 
coreDescriptor);
        coreContainer.register(solrCore, false);
        EmbeddedSolrServer server = new EmbeddedSolrServer( coreContainer, 
coreName );


I got another weird error:

Exception in thread main java.lang.NoClassDefFoundError: 
org/apache/lucene/util/Version
    at org.apache.solr.core.SolrConfig.init(SolrConfig.java:133)
    at org.apache.solr.core.SolrConfig.init(SolrConfig.java:119)
    at 
com.norvar.test.samples.SimpleMain.getEmbeddedSolrServer(SimpleMain.java:176)
    at com.norvar.test.samples.SimpleMain.getSolrServer(SimpleMain.java:146)
    at com.norvar.test.samples.CUVMain.run(CUVMain.java:21)
    at com.norvar.test.samples.CUVMain.main(CUVMain.java:15)
Caused by: java.lang.ClassNotFoundException: org.apache.lucene.util.Version
    at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
    at java.security.AccessController.doPrivileged(Native Method)
    at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
    at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
    at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
    ... 6 more

The error is from  the line           solrConfig = new SolrConfig(solrHome, 
solrconfig.xml, null); 

In fact, in solr package, there are no lucene jars. I don't know the relation 
between solr and lucene. Sould I download lucene and add it to my testing 
project?


At last, in this case, there are no errors with J2EE.






From: Chris Hostetter hossman_luc...@fucit.org
To: solr-user@lucene.apache.org solr-user@lucene.apache.org; Xue-Feng Yang 
just4l...@yahoo.com
Sent: Sunday, September 18, 2011 12:03:14 AM
Subject: Re: a weird error of embedded server initiaizationl


: to test the embedded server for solr.  There is a weird error. It throws : 
from   CoreContainer container = new CoreContainer();

these errors aren't related to Solr ... they seem to be from running in a 
J2EE/EJB setup that is missconfigured so it can't check security settings for 
the app it's running.

The first two are when Solr is trying to ask the JVM for JNDI Config...

: java.io.FileNotFoundException: C:\intelli\config\security.properties (The 
system cannot find the path specified)
    ...
:     at javax.naming.InitialContext.init(InitialContext.java:175)
:     at 
org.apache.solr.core.SolrResourceLoader.locateSolrHome(SolrResourceLoader.java:593)

This one just seems to be a warning, but it's also about a file that google 
says is realted to EJB/J2EE stuff (solr never asks for it)

: Warning: unable to read transaction.interoperability config property

This last one ... well it oculd be anything.  maybe just some missing jars? ...

: Exception in thread main java.lang.NoClassDefFoundError: 
org/apache/lucene/index/IndexReader


-Hoss

DIH error when nested db datasource and file data source

2011-09-17 Thread abhayd
hi 
I have a requirement where i fetch some data from db , and based on db data
i pull details from XML file to index solr.

When i try to import it gives me following error. 
--
SEVERE: Exception while processing: topic_tree document :
SolrInputDocument[{topic_id=topic_id(1.0)={934},
category_level_1=category_level_1(1.0)={Internet
Services}}]:org.apache.solr.h
ndler.dataimport.DataImportHandlerException: Unable to execute query:
C:\Projects\att\solr\catalogSOLRSearch.ear\SOLR-HOME\live_meta.xml
Processing Document # 1
at
org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72)
at
org.apache.solr.handler.dataimport.JdbcDataSource$ResultSetIterator.init(JdbcDataSource.java:252)
at
org.apache.solr.handler.dataimport.JdbcDataSource.getData(JdbcDataSource.java:209)
at
org.apache.solr.handler.dataimport.JdbcDataSource.getData(JdbcDataSource.java:38)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor.initQuery(XPathEntityProcessor.java:283)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor.fetchNextRow(XPathEntityProcessor.java:224)
at
org.apache.solr.handler.dataimport.XPathEntityProcessor.nextRow(XPathEntityProcessor.java:201)
at
org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:237)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:594)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:620)
at
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:620)
at
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:266)
at
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:185)
at
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:358)
at
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:426)
at
org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:407)

---

Here is my dih config file
dataConfig
dataSource name=esupport_db
type=JdbcDataSource   
driver=oracle.jdbc.driver.OracleDriver
url=x 
user= 
password=
convertType=true /

dataSource type=FileDataSource name=video_datasource/
  
document
entity name=topic_tree datasource=esupport_db 
query=SELECT topic_id, parent_id,
   REGEXP_SUBSTR (SYS_CONNECT_BY_PATH 
(display_name, ';'), '[^;]+', 1,
1) AS category_level_1,
   FROM src_topic
   START WITH parent_id = 933
   CONNECT BY parent_id = PRIOR 
topic_id 
deltaQuery= 
pk=TOPIC_ID
field column=topic_id name=topic_id /
field column=category_level_1 name=CATEGORY_LEVEL_1 /

entity name=f processor=FileListEntityProcessor
baseDir=${solr.solr.home} fileName=.xml 
recursive=false rootEntity=true
dataSource=video_datasource
entity name=x processor=XPathEntityProcessor 

forEach=/gvpVideoMetaData/mediaItem[@media_id='${topic_tree.topic_id}'] 
url=${f.fileAbsolutePath} 

field column=media_details
xpath=/gvpVideoMetaData/mediaItem/media_details/
/entity   
/entity
/entity
/document
/dataConfig

--
View this message in context: 
http://lucene.472066.n3.nabble.com/DIH-error-when-nested-db-datasource-and-file-data-source-tp3345664p3345664.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: DIH error when nested db datasource and file data source

2011-09-17 Thread Gora Mohanty
On Sun, Sep 18, 2011 at 10:35 AM, abhayd ajdabhol...@hotmail.com wrote:
 hi
 I have a requirement where i fetch some data from db , and based on db data
 i pull details from XML file to index solr.

 When i try to import it gives me following error.
 --
 SEVERE: Exception while processing: topic_tree document :
 SolrInputDocument[{topic_id=topic_id(1.0)={934},
 category_level_1=category_level_1(1.0)={Internet
 Services}}]:org.apache.solr.h
 ndler.dataimport.DataImportHandlerException: Unable to execute query:

^^^

This often indicates a
problem with the query syntax. The first thing to check would be to try to
run exactly the same SELECT directly against the database. One thing
that I see from your query is ...AS category_level_1, FROM src_topic...,
i.e., there is a spurious comma before the FROM.

Regards,
Gora


Re: Miscellaneous DIH related questions

2011-09-17 Thread Gora Mohanty
On Sun, Sep 18, 2011 at 4:50 AM, Pulkit Singhal pulkitsing...@gmail.com wrote:
[...]
 3) Is there some way to configure the datasource to retry 3 time or
 something like that? I have increased the values for connectionTimeout
 and readTimeout but it doesn't help when sometimes the server simply
 denies the request due to heavy load. I need to be able to retry at
 those times. The onError has only the abort,skip,continue options, non
 of which really let me retry anything.
[...]

Don't think that there is a built-in feature for this, though it sounds like
it would be useful.

I presume that you are scheduling your imports through cron, or
something like that. One possibility then would be to have the script
check the status of the import, and retry if needed.

Regards,
Gora