[jira] [Updated] (SOLR-7764) Solr indexing hangs if encounters an certain XML parse error

2017-02-23 Thread Sorin Gheorghiu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sorin Gheorghiu updated SOLR-7764:
--
Labels: bluespice indexing  (was: indexing)

> Solr indexing hangs if encounters an certain XML parse error
> 
>
> Key: SOLR-7764
> URL: https://issues.apache.org/jira/browse/SOLR-7764
> Project: Solr
>  Issue Type: Bug
>  Components: query parsers
>Affects Versions: 4.7.2
> Environment: Ubuntu 12.04.5 LTS
>Reporter: Sorin Gheorghiu
>  Labels: bluespice, indexing
> Attachments: Solr_XML_parse_error_080715.txt
>
>
> BlueSpice (http://bluespice.com/) uses Solr to index documents for the 
> 'Extended search' feature.
> Solr hangs if during indexing certain error occurs:
> 8.7.2015 15:34:26
> ERROR
> SolrCore
> org.apache.solr.common.SolrException: 
> org.apache.tika.exception.TikaException: XML parse error
> 8.7.2015 15:34:26
> ERROR
> SolrDispatchFilter
> null:org.apache.solr.common.SolrException: 
> org.apache.tika.exception.TikaException: XML parse error



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-10197) SolrException during indexing

2017-02-23 Thread Sorin Gheorghiu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-10197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sorin Gheorghiu updated SOLR-10197:
---
Attachment: BS_Solr_error_invalid_no.txt

> SolrException during indexing
> -
>
> Key: SOLR-10197
> URL: https://issues.apache.org/jira/browse/SOLR-10197
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: Server
>Affects Versions: 4.5
> Environment: Ubuntu 14.04.5 LTS
>Reporter: Sorin Gheorghiu
>  Labels: bluespice, indexing
> Attachments: BS_Solr_error_invalid_no.txt
>
>
> BlueSpice (http://bluespice.com/) uses Solr to index documents for the 
> 'Extended search' feature. Solr hangs consistently during indexing and an 
> error occurs (see attached).
> In the ExtendedSearch.log there is no error, but the latest indexed 
> document/wiki page:
> 22.02.2017 17:45:11
> Zu indexierende Artikel: 4205
> 1: Indexiere Wiki Seiten: 1% - WUI netz.xls
> 2: Indexiere Wiki Seiten: 1% - IndividArbanw.pdf
> ...
> 3526: Indexiere Wiki Seiten: 84% - 2007
> 3527: Indexiere Wiki Seiten: 84% - Buchdurchlaufzeit
> 3528: Indexiere Wiki Seiten: 84% - Mahnroutinen
> 3529: Indexiere Wiki Seiten: 84% - Software für Informationskompetenz
> Could you provide any indication of the error?



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-10197) SolrException during indexing

2017-02-23 Thread Sorin Gheorghiu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-10197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sorin Gheorghiu updated SOLR-10197:
---
Labels: bluespice indexing  (was: indexing)

> SolrException during indexing
> -
>
> Key: SOLR-10197
> URL: https://issues.apache.org/jira/browse/SOLR-10197
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: Server
>Affects Versions: 4.5
> Environment: Ubuntu 14.04.5 LTS
>Reporter: Sorin Gheorghiu
>  Labels: bluespice, indexing
> Attachments: BS_Solr_error_invalid_no.txt
>
>
> BlueSpice (http://bluespice.com/) uses Solr to index documents for the 
> 'Extended search' feature. Solr hangs consistently during indexing and an 
> error occurs (see attached).
> In the ExtendedSearch.log there is no error, but the latest indexed 
> document/wiki page:
> 22.02.2017 17:45:11
> Zu indexierende Artikel: 4205
> 1: Indexiere Wiki Seiten: 1% - WUI netz.xls
> 2: Indexiere Wiki Seiten: 1% - IndividArbanw.pdf
> ...
> 3526: Indexiere Wiki Seiten: 84% - 2007
> 3527: Indexiere Wiki Seiten: 84% - Buchdurchlaufzeit
> 3528: Indexiere Wiki Seiten: 84% - Mahnroutinen
> 3529: Indexiere Wiki Seiten: 84% - Software für Informationskompetenz
> Could you provide any indication of the error?



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-10197) SolrException during indexing

2017-02-23 Thread Sorin Gheorghiu (JIRA)
Sorin Gheorghiu created SOLR-10197:
--

 Summary: SolrException during indexing
 Key: SOLR-10197
 URL: https://issues.apache.org/jira/browse/SOLR-10197
 Project: Solr
  Issue Type: Bug
  Security Level: Public (Default Security Level. Issues are Public)
  Components: Server
Affects Versions: 4.5
 Environment: Ubuntu 14.04.5 LTS
Reporter: Sorin Gheorghiu


BlueSpice (http://bluespice.com/) uses Solr to index documents for the 
'Extended search' feature. Solr hangs consistently during indexing and an error 
occurs (see attached).

In the ExtendedSearch.log there is no error, but the latest indexed 
document/wiki page:

22.02.2017 17:45:11

Zu indexierende Artikel: 4205

1: Indexiere Wiki Seiten: 1% - WUI netz.xls
2: Indexiere Wiki Seiten: 1% - IndividArbanw.pdf
...
3526: Indexiere Wiki Seiten: 84% - 2007
3527: Indexiere Wiki Seiten: 84% - Buchdurchlaufzeit
3528: Indexiere Wiki Seiten: 84% - Mahnroutinen
3529: Indexiere Wiki Seiten: 84% - Software für Informationskompetenz

Could you provide any indication of the error?




--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9487) Solr 6.x ignores field name="type" in schema.xml

2016-09-07 Thread Sorin Gheorghiu (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15470357#comment-15470357
 ] 

Sorin Gheorghiu commented on SOLR-9487:
---

Great, it worked, the field "type" shown up after removing the *managed-schema* 
file and reloading core. 
I will attempt to create a new collection/core later. Thank you so far.

> Solr 6.x ignores field name="type" in schema.xml
> 
>
> Key: SOLR-9487
> URL: https://issues.apache.org/jira/browse/SOLR-9487
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: Schema and Analysis
>Affects Versions: 6.1
> Environment: Ubuntu 2.6.32-45-pve
>Reporter: Sorin Gheorghiu
>  Labels: rsol, sunspot
> Attachments: Solr5.0_field_type.png
>
>
> In order to use the Ruby Sunspot gem, a customised schema.xml should be used 
> [1]. The field "type" will exist in Solr 5.0 (in Schema Browser), but Solr 6.1
>  indexed="true"/>
> As consequence, Sunspot will fail to seed data with the following error:
> RSolr::Error::Http: RSolr::Error::Http - 400 Bad Request
> Error: 'ERROR: [doc=Classification 1] unknown field \'type\'','code'=>400}}
> [1] 
> https://github.com/sunspot/sunspot/tree/master/sunspot_solr/solr/solr/configsets/sunspot/conf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9487) Solr 6.x ignores field name="type" in schema.xml

2016-09-07 Thread Sorin Gheorghiu (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15470325#comment-15470325
 ] 

Sorin Gheorghiu commented on SOLR-9487:
---

Did you load the customized schema.xml [1] in your sunspottest? If not could 
you try to test it, pls?

[1] 
https://github.com/sunspot/sunspot/blob/master/sunspot_solr/solr/solr/configsets/sunspot/conf/schema.xml

> Solr 6.x ignores field name="type" in schema.xml
> 
>
> Key: SOLR-9487
> URL: https://issues.apache.org/jira/browse/SOLR-9487
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: Schema and Analysis
>Affects Versions: 6.1
> Environment: Ubuntu 2.6.32-45-pve
>Reporter: Sorin Gheorghiu
>  Labels: rsol, sunspot
> Attachments: Solr5.0_field_type.png
>
>
> In order to use the Ruby Sunspot gem, a customised schema.xml should be used 
> [1]. The field "type" will exist in Solr 5.0 (in Schema Browser), but Solr 6.1
>  indexed="true"/>
> As consequence, Sunspot will fail to seed data with the following error:
> RSolr::Error::Http: RSolr::Error::Http - 400 Bad Request
> Error: 'ERROR: [doc=Classification 1] unknown field \'type\'','code'=>400}}
> [1] 
> https://github.com/sunspot/sunspot/tree/master/sunspot_solr/solr/solr/configsets/sunspot/conf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-9487) Solr 6.x ignores field name="type" in schema.xml

2016-09-07 Thread Sorin Gheorghiu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sorin Gheorghiu updated SOLR-9487:
--
Attachment: Solr5.0_field_type.png

> Solr 6.x ignores field name="type" in schema.xml
> 
>
> Key: SOLR-9487
> URL: https://issues.apache.org/jira/browse/SOLR-9487
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: Schema and Analysis
>Affects Versions: 6.1
> Environment: Ubuntu 2.6.32-45-pve
>Reporter: Sorin Gheorghiu
>  Labels: rsol, sunspot
> Attachments: Solr5.0_field_type.png
>
>
> In order to use the Ruby Sunspot gem, a customised schema.xml should be used 
> [1]. The field "type" will exist in Solr 5.0 (in Schema Browser), but Solr 6.1
>  indexed="true"/>
> As consequence, Sunspot will fail to seed data with the following error:
> RSolr::Error::Http: RSolr::Error::Http - 400 Bad Request
> Error: 'ERROR: [doc=Classification 1] unknown field \'type\'','code'=>400}}
> [1] 
> https://github.com/sunspot/sunspot/tree/master/sunspot_solr/solr/solr/configsets/sunspot/conf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-9487) Solr 6.x ignores field name="type" in schema.xml

2016-09-07 Thread Sorin Gheorghiu (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-9487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15470299#comment-15470299
 ] 

Sorin Gheorghiu commented on SOLR-9487:
---

I still think this is a Solr issue as long as the field "type" is not showing 
up in Solr 6.x
Attached the screenshot of Solr 5.0.

> Solr 6.x ignores field name="type" in schema.xml
> 
>
> Key: SOLR-9487
> URL: https://issues.apache.org/jira/browse/SOLR-9487
> Project: Solr
>  Issue Type: Bug
>  Security Level: Public(Default Security Level. Issues are Public) 
>  Components: Schema and Analysis
>Affects Versions: 6.1
> Environment: Ubuntu 2.6.32-45-pve
>Reporter: Sorin Gheorghiu
>  Labels: rsol, sunspot
>
> In order to use the Ruby Sunspot gem, a customised schema.xml should be used 
> [1]. The field "type" will exist in Solr 5.0 (in Schema Browser), but Solr 6.1
>  indexed="true"/>
> As consequence, Sunspot will fail to seed data with the following error:
> RSolr::Error::Http: RSolr::Error::Http - 400 Bad Request
> Error: 'ERROR: [doc=Classification 1] unknown field \'type\'','code'=>400}}
> [1] 
> https://github.com/sunspot/sunspot/tree/master/sunspot_solr/solr/solr/configsets/sunspot/conf



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-9487) Solr 6.x ignores field name="type" in schema.xml

2016-09-07 Thread Sorin Gheorghiu (JIRA)
Sorin Gheorghiu created SOLR-9487:
-

 Summary: Solr 6.x ignores field name="type" in schema.xml
 Key: SOLR-9487
 URL: https://issues.apache.org/jira/browse/SOLR-9487
 Project: Solr
  Issue Type: Bug
  Security Level: Public (Default Security Level. Issues are Public)
  Components: Schema and Analysis
Affects Versions: 6.1
 Environment: Ubuntu 2.6.32-45-pve
Reporter: Sorin Gheorghiu


In order to use the Ruby Sunspot gem, a customised schema.xml should be used 
[1]. The field "type" will exist in Solr 5.0 (in Schema Browser), but Solr 6.1



As consequence, Sunspot will fail to seed data with the following error:

RSolr::Error::Http: RSolr::Error::Http - 400 Bad Request
Error: 'ERROR: [doc=Classification 1] unknown field \'type\'','code'=>400}}

[1] 
https://github.com/sunspot/sunspot/tree/master/sunspot_solr/solr/solr/configsets/sunspot/conf




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7764) Solr indexing hangs if encounters an certain XML parse error

2015-07-09 Thread Sorin Gheorghiu (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14620744#comment-14620744
 ] 

Sorin Gheorghiu commented on SOLR-7764:
---

Thank you Tim for your effort. As I said this morning, I don't think this issue 
is related to Tika, because the Tika error occurs as well when these files are 
removed and then Solr won't hung. The issue is still reproduceable and I 
noticed a bunch of solr errors like:

Jul 9, 2015 12:49:00 PM org.apache.catalina.loader.WebappClassLoader 
checkThreadLocalMapForLeaks
SEVERE: The web application [/solr] created a ThreadLocal with key of type 
[org.apache.xmlbeans.impl.store.Locale$1] (value [or
  
g.apache.xmlbeans.impl.store.Locale$1@39515054]) and a value of type 
[java.lang.ref.SoftReference] (value [java.lang.ref.SoftRe  
ference@cb2f97d]) but 
failed to remove it when the web application was stopped. Threads are going to 
be renewed over time to tr  
y and avoid a probable memory leak.
Jul 9, 2015 12:49:00 PM org.apache.catalina.loader.WebappClassLoader 
checkThreadLocalMapForLeaks
SEVERE: The web application [/solr] created a ThreadLocal with key of type 
[org.apache.solr.schema.DateField.ThreadLocalDateFor
  mat] (value 
[org.apache.solr.schema.DateField$ThreadLocalDateFormat@4f81bf75]) and a value 
of type [org.apache.solr.schema.Date
  Field.ISO8601CanonicalDateFormat] (value 
[org.apache.solr.schema.DateField$ISO8601CanonicalDateFormat@6b2ed43a]) but 
failed to   
remove it when the web application was stopped. Threads are going to be 
renewed over time to try and avoid a probable memory le 
 ak.
Jul 9, 2015 12:49:00 PM org.apache.catalina.loader.WebappClassLoader 
checkThreadLocalMapForLeaks
SEVERE: The web application [/solr] created a ThreadLocal with key of type 
[org.apache.xmlbeans.impl.store.CharUtil$1] (value [
  
org.apache.xmlbeans.impl.store.CharUtil$1@4f40c31a]) and a value of type 
[java.lang.ref.SoftReference] (value [java.lang.ref.So  
ftReference@3a19840e]) but 
failed to remove it when the web application was stopped. Threads are going to 
be renewed over time
   to try and avoid a probable memory leak.

I can attach the whole log file by request.


 Solr indexing hangs if encounters an certain XML parse error
 

 Key: SOLR-7764
 URL: https://issues.apache.org/jira/browse/SOLR-7764
 Project: Solr
  Issue Type: Bug
  Components: query parsers
Affects Versions: 4.7.2
 Environment: Ubuntu 12.04.5 LTS
Reporter: Sorin Gheorghiu
  Labels: indexing
 Attachments: Solr_XML_parse_error_080715.txt


 BlueSpice (http://bluespice.com/) uses Solr to index documents for the 
 'Extended search' feature.
 Solr hangs if during indexing certain error occurs:
 8.7.2015 15:34:26
 ERROR
 SolrCore
 org.apache.solr.common.SolrException: 
 org.apache.tika.exception.TikaException: XML parse error
 8.7.2015 15:34:26
 ERROR
 SolrDispatchFilter
 null:org.apache.solr.common.SolrException: 
 org.apache.tika.exception.TikaException: XML parse error



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7764) Solr indexing hangs if encounters an certain XML parse error

2015-07-09 Thread Sorin Gheorghiu (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14620422#comment-14620422
 ] 

Sorin Gheorghiu commented on SOLR-7764:
---

After more test it results this is not a Tika or XML related issue and the 
stacktrace is NOT related to the hang.

1) I removed the XLSX file from the index list (actually I delete it temporary 
on Mediawiki) the Tika error occured and the index didn't hung at this place. 
It seems no error is reported when it hangs permanently on this file (!).

2) A second XLSX file will hang but this time with the following error:

ERROR
PDCIDFont
Error: Could not parse predefined CMAP file for 'é.5s¢-á.?null³!null¯-UCS2'

Thus after I remove both files, the index will end successfully.

As you guessed the information of the files is private, I am allowed to share, 
but not post them. 
Could you provide an email address to send them directly to you, pls?

This issue is related to the newer Solr version, the same files were properly 
indexed before the upgrade 4.5.0 - 4.7.2

3) It is worth to mention another difference between the versions. 
For long time ago, the docx, xlsx files were not migrated with proper Type 
Content, and they were recognized as ZIP files (that's fine)

in 4.5.0 ExtendedSearch.log reports:
3940: Indexiere hochgeladene Dateien: 8% - Filetype not allowed: zip 
(AtyponJR1_2011.xlsx)

while in 4.7.2 ExtendedSearchIndex.log (different log name) same file is no 
longer recognized as a ZIP archive but it should be, the files are identical.
4117: Indexiere hochgeladene Dateien: 9% - AtyponJR1_2011.xlsx


 Solr indexing hangs if encounters an certain XML parse error
 

 Key: SOLR-7764
 URL: https://issues.apache.org/jira/browse/SOLR-7764
 Project: Solr
  Issue Type: Bug
  Components: query parsers
Affects Versions: 4.7.2
 Environment: Ubuntu 12.04.5 LTS
Reporter: Sorin Gheorghiu
  Labels: indexing
 Attachments: Solr_XML_parse_error_080715.txt


 BlueSpice (http://bluespice.com/) uses Solr to index documents for the 
 'Extended search' feature.
 Solr hangs if during indexing certain error occurs:
 8.7.2015 15:34:26
 ERROR
 SolrCore
 org.apache.solr.common.SolrException: 
 org.apache.tika.exception.TikaException: XML parse error
 8.7.2015 15:34:26
 ERROR
 SolrDispatchFilter
 null:org.apache.solr.common.SolrException: 
 org.apache.tika.exception.TikaException: XML parse error



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-7764) Solr indexing hangs if encounters an certain XML parse error

2015-07-08 Thread Sorin Gheorghiu (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sorin Gheorghiu updated SOLR-7764:
--
Attachment: Solr_XML_parse_error_080715.txt

Errors stack trace attached

 Solr indexing hangs if encounters an certain XML parse error
 

 Key: SOLR-7764
 URL: https://issues.apache.org/jira/browse/SOLR-7764
 Project: Solr
  Issue Type: Bug
  Components: query parsers
Affects Versions: 4.7.2
 Environment: Ubuntu 12.04.5 LTS
Reporter: Sorin Gheorghiu
  Labels: indexing
 Attachments: Solr_XML_parse_error_080715.txt


 BlueSpice (http://bluespice.com/) uses Solr to index documents for the 
 'Extended search' feature.
 Solr hangs if during indexing certain error occurs:
 8.7.2015 15:34:26
 ERROR
 SolrCore
 org.apache.solr.common.SolrException: 
 org.apache.tika.exception.TikaException: XML parse error
 8.7.2015 15:34:26
 ERROR
 SolrDispatchFilter
 null:org.apache.solr.common.SolrException: 
 org.apache.tika.exception.TikaException: XML parse error



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-7764) Solr indexing hangs if encounters an certain XML parse error

2015-07-08 Thread Sorin Gheorghiu (JIRA)
Sorin Gheorghiu created SOLR-7764:
-

 Summary: Solr indexing hangs if encounters an certain XML parse 
error
 Key: SOLR-7764
 URL: https://issues.apache.org/jira/browse/SOLR-7764
 Project: Solr
  Issue Type: Bug
  Components: query parsers
Affects Versions: 4.7.2
 Environment: Ubuntu 12.04.5 LTS
Reporter: Sorin Gheorghiu


BlueSpice (http://bluespice.com/) uses Solr to index documents for the 
'Extended search' feature.

Solr hangs if during indexing certain error occurs:

8.7.2015 15:34:26
ERROR
SolrCore
org.apache.solr.common.SolrException: org.apache.tika.exception.TikaException: 
XML parse error

8.7.2015 15:34:26
ERROR
SolrDispatchFilter
null:org.apache.solr.common.SolrException: 
org.apache.tika.exception.TikaException: XML parse error




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-7764) Solr indexing hangs if encounters an certain XML parse error

2015-07-08 Thread Sorin Gheorghiu (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14618935#comment-14618935
 ] 

Sorin Gheorghiu commented on SOLR-7764:
---

Yes, Solr can index other documents and it really hang at a XML file, thus I 
have to kill the related processes:
/bin/bash /opt/lucene-search-2.1.3/lsearchd
java 
-Djava.rmi.server.codebase=file:///opt/lucene-search-2.1.3/LuceneSearch.jar 
-Djava.rmi.server.hostname=WikiTestVZ -jar 
/opt/lucene-search-2.1.3/LuceneSearch.jar

The XML file is not corrupted, because it can be opened with Excel (but 
probably contains unexpected characters for XMLParser).
My expectation Solr should skip any indexing file when certain exceptions occur 
and continue with next files, but hung.

P.S. Sorry, next time I will use the user's list first 
(solr-u...@lucene.apache.org right?)



 Solr indexing hangs if encounters an certain XML parse error
 

 Key: SOLR-7764
 URL: https://issues.apache.org/jira/browse/SOLR-7764
 Project: Solr
  Issue Type: Bug
  Components: query parsers
Affects Versions: 4.7.2
 Environment: Ubuntu 12.04.5 LTS
Reporter: Sorin Gheorghiu
  Labels: indexing
 Attachments: Solr_XML_parse_error_080715.txt


 BlueSpice (http://bluespice.com/) uses Solr to index documents for the 
 'Extended search' feature.
 Solr hangs if during indexing certain error occurs:
 8.7.2015 15:34:26
 ERROR
 SolrCore
 org.apache.solr.common.SolrException: 
 org.apache.tika.exception.TikaException: XML parse error
 8.7.2015 15:34:26
 ERROR
 SolrDispatchFilter
 null:org.apache.solr.common.SolrException: 
 org.apache.tika.exception.TikaException: XML parse error



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org