[jira] [Created] (SOLR-5276) highlighter working using stemmed tokens from another field and text from another

2013-09-26 Thread Maciej Lizewski (JIRA)
Maciej Lizewski created SOLR-5276:
-

 Summary: highlighter working using stemmed tokens from another 
field and text from another
 Key: SOLR-5276
 URL: https://issues.apache.org/jira/browse/SOLR-5276
 Project: Solr
  Issue Type: Improvement
  Components: highlighter
Reporter: Maciej Lizewski
Priority: Minor


The case is like this:
I have 'content' field with content extracted with tika and several fields for 
language versions (like content_pl, content_en, content_es, content_ru, etc). 
I also use custom langid component which can copy 'content' to serveral 
content_* fields if it detects more than one language (so those parts are 
properly stemmed in every language present in text).

Now to use highlighter in such scenario I need to store all those language 
fields even if their contents is always same as the one in 'content' field.

Would be nice if I could configure language specific fields to be not stored, 
and configure highlighter to take tokens positions from those fields and apply 
them to text in 'content' field...
In other words - to say: take tokens from 'content_pl', and prepare highlight 
based on text in 'content' field.
It could be administrator responsibility to guarantee that mapped fields have 
same content.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-4841) DetectedLanguage constructor should be public

2013-05-20 Thread Maciej Lizewski (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-4841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13662013#comment-13662013
 ] 

Maciej Lizewski commented on SOLR-4841:
---

I am OK with that. My point was that it is impossible to extend 
LanguageIdentifierUpdateProcessor with own custom implementation, because 
constructor does not have any visibility modifier, so it is accessible only 
from same package.

> DetectedLanguage constructor should be public
> -
>
> Key: SOLR-4841
> URL: https://issues.apache.org/jira/browse/SOLR-4841
> Project: Solr
>  Issue Type: Bug
>Reporter: Maciej Lizewski
>
> org.apache.solr.update.processor.DetectedLanguage constructor should be 
> public. Without that it is impossible to create owne class extending 
> LanguageIdentifierUpdateProcessor.
> LanguageIdentifierUpdateProcessor base class needs detectLanguage(String 
> content) function to return listy of DetectedLanguage's but you cannot create 
> such objects because constructor is accessible only in same package 
> (org.apache.solr.update.processor).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-4841) DetectedLanguage constructor should be public

2013-05-20 Thread Maciej Lizewski (JIRA)
Maciej Lizewski created SOLR-4841:
-

 Summary: DetectedLanguage constructor should be public
 Key: SOLR-4841
 URL: https://issues.apache.org/jira/browse/SOLR-4841
 Project: Solr
  Issue Type: Bug
Reporter: Maciej Lizewski


org.apache.solr.update.processor.DetectedLanguage constructor should be public. 
Without that it is impossible to create owne class extending 
LanguageIdentifierUpdateProcessor.

LanguageIdentifierUpdateProcessor base class needs detectLanguage(String 
content) function to return listy of DetectedLanguage's but you cannot create 
such objects because constructor is accessible only in same package 
(org.apache.solr.update.processor).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-2899) Add OpenNLP Analysis capabilities as a module

2013-04-17 Thread Maciej Lizewski (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-2899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13633907#comment-13633907
 ] 

Maciej Lizewski commented on LUCENE-2899:
-

why don't you prepare this as separate project that produces some jars and 
config files with instructions on how to add it in solr configuration instead 
of publishing all changes as patches to solr sources? I am interested in doing 
some tests with your library but setting all things up seems quite complicated 
and hard to maintain in future... it is just a thought.

> Add OpenNLP Analysis capabilities as a module
> -
>
> Key: LUCENE-2899
> URL: https://issues.apache.org/jira/browse/LUCENE-2899
> Project: Lucene - Core
>  Issue Type: New Feature
>  Components: modules/analysis
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
>Priority: Minor
> Fix For: 4.3
>
> Attachments: LUCENE-2899.patch, LUCENE-2899.patch, LUCENE-2899.patch, 
> LUCENE-2899.patch, LUCENE-2899.patch, LUCENE-2899.patch, 
> LUCENE-2899-RJN.patch, OpenNLPFilter.java, OpenNLPTokenizer.java, 
> opennlp_trunk.patch
>
>
> Now that OpenNLP is an ASF project and has a nice license, it would be nice 
> to have a submodule (under analysis) that exposed capabilities for it. Drew 
> Farris, Tom Morton and I have code that does:
> * Sentence Detection as a Tokenizer (could also be a TokenFilter, although it 
> would have to change slightly to buffer tokens)
> * NamedEntity recognition as a TokenFilter
> We are also planning a Tokenizer/TokenFilter that can put parts of speech as 
> either payloads (PartOfSpeechAttribute?) on a token or at the same position.
> I'd propose it go under:
> modules/analysis/opennlp

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Resolved] (SOLR-4562) core selector not working in Chrome

2013-03-12 Thread Maciej Lizewski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maciej Lizewski resolved SOLR-4562.
---

Resolution: Not A Problem

> core selector not working in Chrome
> ---
>
> Key: SOLR-4562
> URL: https://issues.apache.org/jira/browse/SOLR-4562
> Project: Solr
>  Issue Type: Bug
>  Components: web gui
>Affects Versions: 4.2
>Reporter: Maciej Lizewski
> Attachments: Przechwytywanie.PNG
>
>
> after fresh installation of Solr 4.2 on windows 7 64bit
> I do not see any cores in Google Chrome to select in combobox. Also - when 
> trying to prepare URI by hand - I see error that there is no such core. In 
> FireFox - there is default 'collection1' core visible without problems.
> My Chrome version: 26.0.1410.28 beta-m
> I cannot se any errors in JS console...

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-4562) core selector not working in Chrome

2013-03-12 Thread Maciej Lizewski (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13600352#comment-13600352
 ] 

Maciej Lizewski commented on SOLR-4562:
---

You were right. After clearing browser cache everything is working ok.
Sorry for duplicate issue - I search for something similar but did not found 
that one.

funny thing is that earlier I tried refreshing page with SHIFT which *should* 
reload all resources from server... :)

> core selector not working in Chrome
> ---
>
> Key: SOLR-4562
> URL: https://issues.apache.org/jira/browse/SOLR-4562
> Project: Solr
>  Issue Type: Bug
>  Components: web gui
>Affects Versions: 4.2
>Reporter: Maciej Lizewski
> Attachments: Przechwytywanie.PNG
>
>
> after fresh installation of Solr 4.2 on windows 7 64bit
> I do not see any cores in Google Chrome to select in combobox. Also - when 
> trying to prepare URI by hand - I see error that there is no such core. In 
> FireFox - there is default 'collection1' core visible without problems.
> My Chrome version: 26.0.1410.28 beta-m
> I cannot se any errors in JS console...

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-4562) core selector not working in Chrome

2013-03-12 Thread Maciej Lizewski (JIRA)
Maciej Lizewski created SOLR-4562:
-

 Summary: core selector not working in Chrome
 Key: SOLR-4562
 URL: https://issues.apache.org/jira/browse/SOLR-4562
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.2
Reporter: Maciej Lizewski


after fresh installation of Solr 4.2 on windows 7 64bit
I do not see any cores in Google Chrome to select in combobox. Also - when 
trying to prepare URI by hand - I see error that there is no such core. In 
FireFox - there is default 'collection1' core visible without problems.

My Chrome version: 26.0.1410.28 beta-m
I cannot se any errors in JS console...

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-4562) core selector not working in Chrome

2013-03-12 Thread Maciej Lizewski (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Maciej Lizewski updated SOLR-4562:
--

Attachment: Przechwytywanie.PNG

> core selector not working in Chrome
> ---
>
> Key: SOLR-4562
> URL: https://issues.apache.org/jira/browse/SOLR-4562
> Project: Solr
>  Issue Type: Bug
>Affects Versions: 4.2
>Reporter: Maciej Lizewski
> Attachments: Przechwytywanie.PNG
>
>
> after fresh installation of Solr 4.2 on windows 7 64bit
> I do not see any cores in Google Chrome to select in combobox. Also - when 
> trying to prepare URI by hand - I see error that there is no such core. In 
> FireFox - there is default 'collection1' core visible without problems.
> My Chrome version: 26.0.1410.28 beta-m
> I cannot se any errors in JS console...

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Comment Edited] (SOLR-2416) Solr Cell fails to index Zip file contents

2013-01-08 Thread Maciej Lizewski (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13546719#comment-13546719
 ] 

Maciej Lizewski edited comment on SOLR-2416 at 1/8/13 9:10 AM:
---

I think this is really needed feature. Also in earlier versions of Solr it 
worked different than now: grepping code of 
org.apache.solr.handler.extraction.ExtractingDocumentLoader from version 
1.4.0.1 show that context was not created and instead autoDetectParser::parse 
function was called with 3 parameters (without context) and this caused context 
to be automatically created with Parser=autoDetectParser...
this is backward compatibility violation after adding PasswordProvider. Also 
comments in current code suggest that someone was not sure about consequences 
of such change: "TODO: should we design a way to pass in parse context?"

the patch is already attached as I see...

  was (Author: redguy666):
I think this is really needed feature. Also in earlier versions of Solr it 
worked different than now: grepping code of 
org.apache.solr.handler.extraction.ExtractingDocumentLoader from version 
1.4.0.1 show that context was not created and instead autoDetectParser::parse 
function was called with 3 parameters (without context) and this caused context 
to be automatically created with Parser=autoDetectParser...
this is backward compatibility violation after adding PasswordProvider. Also 
comments in current code suggest that someone was not sure about consequences 
of such change: "TODO: should we design a way to pass in parse context?"

the patch is already attached as I see...

anyway - does anyone have this handler refactored as external jar so it can be 
added to running solr instance without changing and recompiling core libs?
  
> Solr Cell fails to index Zip file contents
> --
>
> Key: SOLR-2416
> URL: https://issues.apache.org/jira/browse/SOLR-2416
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - DataImportHandler, contrib - Solr Cell (Tika 
> extraction)
>Affects Versions: 1.4.1
>Reporter: Jayendra Patil
> Fix For: 5.0
>
> Attachments: SOLR-2416_ExtractingDocumentLoader.patch
>
>
> Working with the latest Solr Trunk code and seems the Tika handlers for Solr 
> Cell (ExtractingDocumentLoader.java) and Data Import handler 
> (TikaEntityProcessor.java) fails to index the zip file contents again.
> It just indexes the file names again.
> This issue was addressed some time back, late last year, but seems to have 
> reappeared with the latest code.
> Jira for the Data Import handler part with the patch and the testcase - 
> https://issues.apache.org/jira/browse/SOLR-2332.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-2416) Solr Cell fails to index Zip file contents

2013-01-08 Thread Maciej Lizewski (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-2416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13546719#comment-13546719
 ] 

Maciej Lizewski commented on SOLR-2416:
---

I think this is really needed feature. Also in earlier versions of Solr it 
worked different than now: grepping code of 
org.apache.solr.handler.extraction.ExtractingDocumentLoader from version 
1.4.0.1 show that context was not created and instead autoDetectParser::parse 
function was called with 3 parameters (without context) and this caused context 
to be automatically created with Parser=autoDetectParser...
this is backward compatibility violation after adding PasswordProvider. Also 
comments in current code suggest that someone was not sure about consequences 
of such change: "TODO: should we design a way to pass in parse context?"

the patch is already attached as I see...

anyway - does anyone have this handler refactored as external jar so it can be 
added to running solr instance without changing and recompiling core libs?

> Solr Cell fails to index Zip file contents
> --
>
> Key: SOLR-2416
> URL: https://issues.apache.org/jira/browse/SOLR-2416
> Project: Solr
>  Issue Type: Bug
>  Components: contrib - DataImportHandler, contrib - Solr Cell (Tika 
> extraction)
>Affects Versions: 1.4.1
>Reporter: Jayendra Patil
> Fix For: 5.0
>
> Attachments: SOLR-2416_ExtractingDocumentLoader.patch
>
>
> Working with the latest Solr Trunk code and seems the Tika handlers for Solr 
> Cell (ExtractingDocumentLoader.java) and Data Import handler 
> (TikaEntityProcessor.java) fails to index the zip file contents again.
> It just indexes the file names again.
> This issue was addressed some time back, late last year, but seems to have 
> reappeared with the latest code.
> Jira for the Data Import handler part with the patch and the testcase - 
> https://issues.apache.org/jira/browse/SOLR-2332.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org