[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-10-10 Thread Commented
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13124366#comment-13124366 ] Jan Høydahl commented on SOLR-1979: --- Fixed overview.html in branch > Cre

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-10-10 Thread T Jake Luciani (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13124343#comment-13124343 ] T Jake Luciani commented on SOLR-1979: -- build on 3x branch still failing because solr

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-10-05 Thread Mark Miller (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13121569#comment-13121569 ] Mark Miller commented on SOLR-1979: --- Nice! Great feature to get in - thanks guys.

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-19 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13107867#comment-13107867 ] Jan Høydahl commented on SOLR-1979: --- Question: Since I plan to commit this for both 3.x a

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13102723#comment-13102723 ] Jan Høydahl commented on SOLR-1979: --- Any changes you'd like before committing this? Lance

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13102646#comment-13102646 ] Jan Høydahl commented on SOLR-1979: --- Yep, it will skip detection if the field defined in

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13102578#comment-13102578 ] Markus Jelsma commented on SOLR-1979: - Hi. This is not what i understood from reading t

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13102573#comment-13102573 ] Jan Høydahl commented on SOLR-1979: --- @Markus: Sure. If you put your pre-known language co

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13102520#comment-13102520 ] Markus Jelsma commented on SOLR-1979: - Hi Jan, Can we also use the mapping feature wit

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13102374#comment-13102374 ] Jan Høydahl commented on SOLR-1979: --- An updated documentation of the Processor is now at

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-09 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101612#comment-13101612 ] Lance Norskog commented on SOLR-1979: - I'm impressed! This is a lot of work and empiric

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-08-02 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13076259#comment-13076259 ] Jan Høydahl commented on SOLR-1979: --- This has been tested on a real, several hundred thou

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13053227#comment-13053227 ] Jan Høydahl commented on SOLR-1979: --- One question regarding the JUnit test: I now use {co

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-06-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13043448#comment-13043448 ] Jan Høydahl commented on SOLR-1979: --- Continuing on this implementing the ideas above...

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-14 Thread Tommaso Teofili (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12971400#action_12971400 ] Tommaso Teofili commented on SOLR-1979: --- bq. Keep it basic in first version. Allow for

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-14 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12971338#action_12971338 ] Jan Høydahl commented on SOLR-1979: --- {quote} Jan, do you have any updates to the patch? I'

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12971322#action_12971322 ] Grant Ingersoll commented on SOLR-1979: --- bq. What about leveraging payloads (we can ou

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-08 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12969404#action_12969404 ] Erik Hatcher commented on SOLR-1979: What about leveraging payloads (we can output term|

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12969140#action_12969140 ] Lance Norskog commented on SOLR-1979: - About Thai: there is a lot of South and East Asia

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12969138#action_12969138 ] Lance Norskog commented on SOLR-1979: - A use case for multi-language fields: PDFs with d

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968827#action_12968827 ] Robert Muir commented on SOLR-1979: --- bq. We also need to detect whether a language is part

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968820#action_12968820 ] Jan Høydahl commented on SOLR-1979: --- >>I have a plan to add profiles for the Norwegian and

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968813#action_12968813 ] Robert Muir commented on SOLR-1979: --- bq. I have a plan to add profiles for the Norwegian a

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968806#action_12968806 ] Jan Høydahl commented on SOLR-1979: --- Discussion on the process for adding language profile

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968786#action_12968786 ] Robert Muir commented on SOLR-1979: --- bq. Kind of random that Thai is thrown in there! I a

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968777#action_12968777 ] Grant Ingersoll commented on SOLR-1979: --- Sorry, you are right. See http://svn.apache

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968760#action_12968760 ] Robert Muir commented on SOLR-1979: --- bq. Have a look at http://tika.apache.org/0.8/detecti

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968757#action_12968757 ] Grant Ingersoll commented on SOLR-1979: --- Have a look at http://tika.apache.org/0.8/det

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968753#action_12968753 ] Robert Muir commented on SOLR-1979: --- bq. I also think we need to get together and add a bu

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968748#action_12968748 ] Grant Ingersoll commented on SOLR-1979: --- I'm going to be out of pocket for the next we

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Tommaso Teofili (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968633#action_12968633 ] Tommaso Teofili commented on SOLR-1979: --- bq. However, have you considered extending th

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968627#action_12968627 ] Jan Høydahl commented on SOLR-1979: --- Allow for both a "language" field and a "languages" (

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968582#action_12968582 ] Erik Hatcher commented on SOLR-1979: Oh, and don't get me wrong, I get the multivalued l

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968576#action_12968576 ] Erik Hatcher commented on SOLR-1979: If a list of fields (by name) is mapped into a corr

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968528#action_12968528 ] Grant Ingersoll commented on SOLR-1979: --- bq. So for all unmapped languages, you may wa

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12968445#action_12968445 ] Yonik Seeley commented on SOLR-1979: bq. In skimming the current patch, it looks like fi

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967214#action_12967214 ] Grant Ingersoll commented on SOLR-1979: --- bq. There should be a way to output the langu

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967211#action_12967211 ] Jan Høydahl commented on SOLR-1979: --- @Grant: "I dropped the outputField setting and a numb

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967204#action_12967204 ] Yonik Seeley commented on SOLR-1979: bq. Yonik, I wasn't planning on relying on dynamic

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967201#action_12967201 ] Robert Muir commented on SOLR-1979: --- bq. Both also rely on those fields existing. I don't

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967191#action_12967191 ] Robert Muir commented on SOLR-1979: --- bq. Agreed.The only thing we are doing now is using t

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967186#action_12967186 ] Grant Ingersoll commented on SOLR-1979: --- bq. but in solr, when designing up front, i

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967076#action_12967076 ] Robert Muir commented on SOLR-1979: --- {quote} It makes sense to allow for detecting languag

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967048#action_12967048 ] Grant Ingersoll commented on SOLR-1979: --- Note, the patch still needs more tests and ne

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967046#action_12967046 ] Grant Ingersoll commented on SOLR-1979: --- bq. @Grant: I actually planned to do the regE

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967032#action_12967032 ] Jan Høydahl commented on SOLR-1979: --- @Robert: Yes, there must be a way to tell whether or

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967019#action_12967019 ] Robert Muir commented on SOLR-1979: --- bq. Yeah, that makes sense, however, I believe Tika r

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967016#action_12967016 ] Yonik Seeley commented on SOLR-1979: bq. The new field is made by concatenating the orig

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967011#action_12967011 ] Grant Ingersoll commented on SOLR-1979: --- Another thought, here, is that, over time, th

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12967010#action_12967010 ] Grant Ingersoll commented on SOLR-1979: --- bq. I would like to see RFC 3066 instead Yea

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12966978#action_12966978 ] Robert Muir commented on SOLR-1979: --- We really need to not be using ISO 639-1 here. For

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12966972#action_12966972 ] Robert Muir commented on SOLR-1979: --- bq. cause that distance measure is kind of an interna

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12966970#action_12966970 ] Jan Høydahl commented on SOLR-1979: --- The idField input parameter is just used for decent l

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12966964#action_12966964 ] Jan Høydahl commented on SOLR-1979: --- Simply allowing to set the threshold for isReasonably

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12966955#action_12966955 ] Grant Ingersoll commented on SOLR-1979: --- See http://wiki.apache.org/solr/LanguageDetec

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-08-17 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12899568#action_12899568 ] Jan Høydahl commented on SOLR-1979: --- I have implemented a first shot patch using the Tika

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-06-30 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12884070#action_12884070 ] Chris A. Mattmann commented on SOLR-1979: - I would look at the Language Identifier i