[jira] Commented: (LUCENE-971) Create enwiki indexable data as line-per-article rather than file-per-article

2007-08-01 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516996 ] Michael McCandless commented on LUCENE-971: --- This looks great! One alternate approach here would be to cre

[jira] Commented: (LUCENE-971) Create enwiki indexable data as line-per-article rather than file-per-article

2007-08-01 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12516997 ] Steven Parkes commented on LUCENE-971: -- I can look at what it would take to avoid the line file ... but ... what

[jira] Commented: (LUCENE-971) Create enwiki indexable data as line-per-article rather than file-per-article

2007-08-01 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12517007 ] Michael McCandless commented on LUCENE-971: --- > I can look at what it would take to avoid the line file ...

[jira] Commented: (LUCENE-971) Create enwiki indexable data as line-per-article rather than file-per-article

2007-08-01 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12517048 ] Doron Cohen commented on LUCENE-971: Mmm... an additional advantage of this is not needing to extract the entire

[jira] Commented: (LUCENE-971) Create enwiki indexable data as line-per-article rather than file-per-article

2007-08-01 Thread Doron Cohen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12517047 ] Doron Cohen commented on LUCENE-971: > But, this is the case regardless of which approach we use (ie, both > appr

[jira] Commented: (LUCENE-971) Create enwiki indexable data as line-per-article rather than file-per-article

2007-08-06 Thread Steven Parkes (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518016 ] Steven Parkes commented on LUCENE-971: -- Sounds good. New patch soon. > Create enwiki indexable data as line-per

[jira] Commented: (LUCENE-971) Create enwiki indexable data as line-per-article rather than file-per-article

2007-08-08 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518524 ] Michael McCandless commented on LUCENE-971: --- Patch looks good; a few comments: * In conf/wikipedia.alg:

[jira] Commented: (LUCENE-971) Create enwiki indexable data as line-per-article rather than file-per-article

2007-08-09 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12518663 ] Michael McCandless commented on LUCENE-971: --- Super, new patch looks good. I will commit! > Create enwiki