[jira] [Updated] (LUCENE-2977) WriteLineDocTask should write gzip/bzip2/txt according to the extension of specified output file name

2011-03-23 Thread Doron Cohen (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-2977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doron Cohen updated LUCENE-2977:


Attachment: LUCENE-2977.patch

Thanks for reviewing Shai!

bq. In StreamUtils you have ".bz" -- it should be ".bz2"

Good catch!
Fixed.

bq. +1 (you mean the bzip.compression property in WLDT right?). 

Yes.

bq. I think that it's reasonable to request the user to specify an output file 
with .bz2 extension if he wants bzip compression. 

Great, I removed it.

bq. I don't see how it will simplify StreamUtils though, but I trust you :) 
(perhaps you meant it will simplify WLDT?)

It allowed to keep just one of the two variations of 
StreamUtils.outputStream(). WLDT and the tests became simpler as well.

Attaching updated patch.
(again first apply that svn mv...)

> WriteLineDocTask should write gzip/bzip2/txt according to the extension of 
> specified output file name
> -
>
> Key: LUCENE-2977
> URL: https://issues.apache.org/jira/browse/LUCENE-2977
> Project: Lucene - Java
>  Issue Type: Improvement
>  Components: contrib/benchmark
>Reporter: Doron Cohen
>Assignee: Doron Cohen
>Priority: Minor
> Fix For: 3.2, 4.0
>
> Attachments: LUCENE-2977.patch, LUCENE-2977.patch
>
>
> Since the readers behave this way it would be nice and handy if also this 
> line writer would.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-2977) WriteLineDocTask should write gzip/bzip2/txt according to the extension of specified output file name

2011-03-23 Thread Doron Cohen (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-2977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doron Cohen updated LUCENE-2977:


Attachment: LUCENE-2977.patch

Patch for auto-detecting output compression mode of result line file:

- getInputStream() moved from ContentSource to a new class StreamUtils under 
util. It is now named inputStream(File).
- outputStream() method added to StreamUtils.

Before applying this patch *svn mv 
modules/benchmark/src/test/org/apache/lucene/benchmark/byTask/feeds/ContentSourceTest.java
 
modules/benchmark/src/test/org/apache/lucene/benchmark/byTask/utils/StreamUtilsTest.java*

I kept for now the "force-bzip" logic in WriteLineDocTask but I would like to 
remove it - it is strange, and in any case LineDocSource would only auto-detect 
bzip input format if WriteLineDocTask was able to auto-detect bzip output 
format. Removing it will also simplify StreamUtils. Any opinions on removing 
this "force-bzip" option?


> WriteLineDocTask should write gzip/bzip2/txt according to the extension of 
> specified output file name
> -
>
> Key: LUCENE-2977
> URL: https://issues.apache.org/jira/browse/LUCENE-2977
> Project: Lucene - Java
>  Issue Type: Improvement
>  Components: contrib/benchmark
>Reporter: Doron Cohen
>Assignee: Doron Cohen
>Priority: Minor
> Fix For: 3.2, 4.0
>
> Attachments: LUCENE-2977.patch
>
>
> Since the readers behave this way it would be nice and handy if also this 
> line writer would.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-2977) WriteLineDocTask should write gzip/bzip2/txt according to the extension of specified output file name

2011-03-23 Thread Doron Cohen (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-2977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doron Cohen updated LUCENE-2977:


Summary: WriteLineDocTask should write gzip/bzip2/txt according to the 
extension of specified output file name  (was: WriteLineDocTask should write 
gzip/bzip2/txt according to the extension of specifie output file name)

> WriteLineDocTask should write gzip/bzip2/txt according to the extension of 
> specified output file name
> -
>
> Key: LUCENE-2977
> URL: https://issues.apache.org/jira/browse/LUCENE-2977
> Project: Lucene - Java
>  Issue Type: Improvement
>  Components: contrib/benchmark
>Reporter: Doron Cohen
>Assignee: Doron Cohen
>Priority: Minor
> Fix For: 3.2, 4.0
>
>
> Since the readers behave this way it would be nice and handy if also this 
> line writer would.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org