[jira] [Updated] (CASSANDRA-4813) Problem using BulkOutputFormat while streaming several SSTables simultaneously from a given node.

2012-11-09 Thread Michael Kjellman (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Kjellman updated CASSANDRA-4813:


Comment: was deleted

(was: [~yukim] I take it back. Ship It! Dependency issues on my end with the 
last patch, sorry for the false alarm.)

> Problem using BulkOutputFormat while streaming several SSTables 
> simultaneously from a given node.
> -
>
> Key: CASSANDRA-4813
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4813
> Project: Cassandra
>  Issue Type: Bug
>Affects Versions: 1.1.0
> Environment: I am using SLES 10 SP3, Java 6, 4 Cassandra + Hadoop 
> nodes, 3 Hadoop only nodes (datanodes/tasktrackers), 1 namenode/jobtracker. 
> The machines used are Six-Core AMD Opteron(tm) Processor 8431, 24 cores and 
> 33 GB of RAM. I get the issue on both cassandra 1.1.3, 1.1.5 and I am using 
> Hadoop 0.20.2.
>Reporter: Ralph Romanos
>Assignee: Yuki Morishita
>Priority: Minor
>  Labels: Bulkoutputformat, Hadoop, SSTables
> Fix For: 1.2.0 rc1
>
> Attachments: 4813.txt
>
>
> The issue occurs when streaming simultaneously SSTables from the same node to 
> a cassandra cluster using SSTableloader. It seems to me that Cassandra cannot 
> handle receiving simultaneously SSTables from the same node. However, when it 
> receives simultaneously SSTables from two different nodes, everything works 
> fine. As a consequence, when using BulkOutputFormat to generate SSTables and 
> stream them to a cassandra cluster, I cannot use more than one reducer per 
> node otherwise I get a java.io.EOFException in the tasktracker's logs and a 
> java.io.IOException: Broken pipe in the Cassandra logs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (CASSANDRA-4813) Problem using BulkOutputFormat while streaming several SSTables simultaneously from a given node.

2012-11-08 Thread Yuki Morishita (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuki Morishita updated CASSANDRA-4813:
--

Attachment: (was: 4813.txt)

> Problem using BulkOutputFormat while streaming several SSTables 
> simultaneously from a given node.
> -
>
> Key: CASSANDRA-4813
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4813
> Project: Cassandra
>  Issue Type: Bug
>Affects Versions: 1.1.0
> Environment: I am using SLES 10 SP3, Java 6, 4 Cassandra + Hadoop 
> nodes, 3 Hadoop only nodes (datanodes/tasktrackers), 1 namenode/jobtracker. 
> The machines used are Six-Core AMD Opteron(tm) Processor 8431, 24 cores and 
> 33 GB of RAM. I get the issue on both cassandra 1.1.3, 1.1.5 and I am using 
> Hadoop 0.20.2.
>Reporter: Ralph Romanos
>Assignee: Yuki Morishita
>Priority: Minor
>  Labels: Bulkoutputformat, Hadoop, SSTables
> Fix For: 1.2.0 rc1
>
> Attachments: 4813.txt
>
>
> The issue occurs when streaming simultaneously SSTables from the same node to 
> a cassandra cluster using SSTableloader. It seems to me that Cassandra cannot 
> handle receiving simultaneously SSTables from the same node. However, when it 
> receives simultaneously SSTables from two different nodes, everything works 
> fine. As a consequence, when using BulkOutputFormat to generate SSTables and 
> stream them to a cassandra cluster, I cannot use more than one reducer per 
> node otherwise I get a java.io.EOFException in the tasktracker's logs and a 
> java.io.IOException: Broken pipe in the Cassandra logs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (CASSANDRA-4813) Problem using BulkOutputFormat while streaming several SSTables simultaneously from a given node.

2012-11-08 Thread Yuki Morishita (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuki Morishita updated CASSANDRA-4813:
--

Attachment: 4813.txt

ok, it looks like we have CASSANDRA-3839 for BOF.
Updated patch to avoid socket re-binding.

[~mkjellman] How about this one?

> Problem using BulkOutputFormat while streaming several SSTables 
> simultaneously from a given node.
> -
>
> Key: CASSANDRA-4813
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4813
> Project: Cassandra
>  Issue Type: Bug
>Affects Versions: 1.1.0
> Environment: I am using SLES 10 SP3, Java 6, 4 Cassandra + Hadoop 
> nodes, 3 Hadoop only nodes (datanodes/tasktrackers), 1 namenode/jobtracker. 
> The machines used are Six-Core AMD Opteron(tm) Processor 8431, 24 cores and 
> 33 GB of RAM. I get the issue on both cassandra 1.1.3, 1.1.5 and I am using 
> Hadoop 0.20.2.
>Reporter: Ralph Romanos
>Assignee: Yuki Morishita
>Priority: Minor
>  Labels: Bulkoutputformat, Hadoop, SSTables
> Fix For: 1.2.0 rc1
>
> Attachments: 4813.txt
>
>
> The issue occurs when streaming simultaneously SSTables from the same node to 
> a cassandra cluster using SSTableloader. It seems to me that Cassandra cannot 
> handle receiving simultaneously SSTables from the same node. However, when it 
> receives simultaneously SSTables from two different nodes, everything works 
> fine. As a consequence, when using BulkOutputFormat to generate SSTables and 
> stream them to a cassandra cluster, I cannot use more than one reducer per 
> node otherwise I get a java.io.EOFException in the tasktracker's logs and a 
> java.io.IOException: Broken pipe in the Cassandra logs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (CASSANDRA-4813) Problem using BulkOutputFormat while streaming several SSTables simultaneously from a given node.

2012-10-31 Thread Yuki Morishita (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuki Morishita updated CASSANDRA-4813:
--

Attachment: 4813.txt

Attaching newer version. Since with this patch, we only distinguish streaming 
session by UUID, we don't need to carry around broadcast 
address(CASSANDRA-3503), so I removed it from StreamHeader.

> Problem using BulkOutputFormat while streaming several SSTables 
> simultaneously from a given node.
> -
>
> Key: CASSANDRA-4813
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4813
> Project: Cassandra
>  Issue Type: Bug
>Affects Versions: 1.1.0
> Environment: I am using SLES 10 SP3, Java 6, 4 Cassandra + Hadoop 
> nodes, 3 Hadoop only nodes (datanodes/tasktrackers), 1 namenode/jobtracker. 
> The machines used are Six-Core AMD Opteron(tm) Processor 8431, 24 cores and 
> 33 GB of RAM. I get the issue on both cassandra 1.1.3, 1.1.5 and I am using 
> Hadoop 0.20.2.
>Reporter: Ralph Romanos
>Assignee: Yuki Morishita
>Priority: Minor
>  Labels: Bulkoutputformat, Hadoop, SSTables
> Fix For: 1.2.0
>
> Attachments: 4813.txt
>
>
> The issue occurs when streaming simultaneously SSTables from the same node to 
> a cassandra cluster using SSTableloader. It seems to me that Cassandra cannot 
> handle receiving simultaneously SSTables from the same node. However, when it 
> receives simultaneously SSTables from two different nodes, everything works 
> fine. As a consequence, when using BulkOutputFormat to generate SSTables and 
> stream them to a cassandra cluster, I cannot use more than one reducer per 
> node otherwise I get a java.io.EOFException in the tasktracker's logs and a 
> java.io.IOException: Broken pipe in the Cassandra logs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (CASSANDRA-4813) Problem using BulkOutputFormat while streaming several SSTables simultaneously from a given node.

2012-10-31 Thread Yuki Morishita (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuki Morishita updated CASSANDRA-4813:
--

Attachment: (was: 4813.txt)

> Problem using BulkOutputFormat while streaming several SSTables 
> simultaneously from a given node.
> -
>
> Key: CASSANDRA-4813
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4813
> Project: Cassandra
>  Issue Type: Bug
>Affects Versions: 1.1.0
> Environment: I am using SLES 10 SP3, Java 6, 4 Cassandra + Hadoop 
> nodes, 3 Hadoop only nodes (datanodes/tasktrackers), 1 namenode/jobtracker. 
> The machines used are Six-Core AMD Opteron(tm) Processor 8431, 24 cores and 
> 33 GB of RAM. I get the issue on both cassandra 1.1.3, 1.1.5 and I am using 
> Hadoop 0.20.2.
>Reporter: Ralph Romanos
>Assignee: Yuki Morishita
>Priority: Minor
>  Labels: Bulkoutputformat, Hadoop, SSTables
> Fix For: 1.2.0
>
>
> The issue occurs when streaming simultaneously SSTables from the same node to 
> a cassandra cluster using SSTableloader. It seems to me that Cassandra cannot 
> handle receiving simultaneously SSTables from the same node. However, when it 
> receives simultaneously SSTables from two different nodes, everything works 
> fine. As a consequence, when using BulkOutputFormat to generate SSTables and 
> stream them to a cassandra cluster, I cannot use more than one reducer per 
> node otherwise I get a java.io.EOFException in the tasktracker's logs and a 
> java.io.IOException: Broken pipe in the Cassandra logs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (CASSANDRA-4813) Problem using BulkOutputFormat while streaming several SSTables simultaneously from a given node.

2012-10-30 Thread Yuki Morishita (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yuki Morishita updated CASSANDRA-4813:
--

Attachment: 4813.txt

Attaching patch to change streaming session ID from (host, counter) pair to 
Time UUID. This should hugely drop probability of session ID collision. I 
haven't tested with BOF, but I spawned 3 sstableloader simultaneously on the 
same node with C* and could finish streaming without getting errors.

> Problem using BulkOutputFormat while streaming several SSTables 
> simultaneously from a given node.
> -
>
> Key: CASSANDRA-4813
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4813
> Project: Cassandra
>  Issue Type: Bug
>Affects Versions: 1.1.0
> Environment: I am using SLES 10 SP3, Java 6, 4 Cassandra + Hadoop 
> nodes, 3 Hadoop only nodes (datanodes/tasktrackers), 1 namenode/jobtracker. 
> The machines used are Six-Core AMD Opteron(tm) Processor 8431, 24 cores and 
> 33 GB of RAM. I get the issue on both cassandra 1.1.3, 1.1.5 and I am using 
> Hadoop 0.20.2.
>Reporter: Ralph Romanos
>Assignee: Yuki Morishita
>Priority: Minor
>  Labels: Bulkoutputformat, Hadoop, SSTables
> Fix For: 1.2.0
>
> Attachments: 4813.txt
>
>
> The issue occurs when streaming simultaneously SSTables from the same node to 
> a cassandra cluster using SSTableloader. It seems to me that Cassandra cannot 
> handle receiving simultaneously SSTables from the same node. However, when it 
> receives simultaneously SSTables from two different nodes, everything works 
> fine. As a consequence, when using BulkOutputFormat to generate SSTables and 
> stream them to a cassandra cluster, I cannot use more than one reducer per 
> node otherwise I get a java.io.EOFException in the tasktracker's logs and a 
> java.io.IOException: Broken pipe in the Cassandra logs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (CASSANDRA-4813) Problem using BulkOutputFormat while streaming several SSTables simultaneously from a given node.

2012-10-23 Thread Jonathan Ellis (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jonathan Ellis updated CASSANDRA-4813:
--

 Priority: Minor  (was: Major)
Affects Version/s: (was: 1.1.5)
   (was: 1.1.3)
   1.1.0
Fix Version/s: 1.2.0

If we need to change streaming protocol to fix this then we should target 1.2.

> Problem using BulkOutputFormat while streaming several SSTables 
> simultaneously from a given node.
> -
>
> Key: CASSANDRA-4813
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4813
> Project: Cassandra
>  Issue Type: Bug
>Affects Versions: 1.1.0
> Environment: I am using SLES 10 SP3, Java 6, 4 Cassandra + Hadoop 
> nodes, 3 Hadoop only nodes (datanodes/tasktrackers), 1 namenode/jobtracker. 
> The machines used are Six-Core AMD Opteron(tm) Processor 8431, 24 cores and 
> 33 GB of RAM. I get the issue on both cassandra 1.1.3, 1.1.5 and I am using 
> Hadoop 0.20.2.
>Reporter: Ralph Romanos
>Assignee: Yuki Morishita
>Priority: Minor
>  Labels: Bulkoutputformat, Hadoop, SSTables
> Fix For: 1.2.0
>
>
> The issue occurs when streaming simultaneously SSTables from the same node to 
> a cassandra cluster using SSTableloader. It seems to me that Cassandra cannot 
> handle receiving simultaneously SSTables from the same node. However, when it 
> receives simultaneously SSTables from two different nodes, everything works 
> fine. As a consequence, when using BulkOutputFormat to generate SSTables and 
> stream them to a cassandra cluster, I cannot use more than one reducer per 
> node otherwise I get a java.io.EOFException in the tasktracker's logs and a 
> java.io.IOException: Broken pipe in the Cassandra logs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (CASSANDRA-4813) Problem using BulkOutputFormat while streaming several SSTables simultaneously from a given node.

2012-10-16 Thread Ralph Romanos (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ralph Romanos updated CASSANDRA-4813:
-

Environment: I am using SLES 10 SP3, Java 6, 4 Cassandra + Hadoop nodes, 3 
Hadoop only nodes (datanodes/tasktrackers), 1 namenode/jobtracker. The machines 
used are Six-Core AMD Opteron(tm) Processor 8431, 24 cores and 33 GB of RAM. I 
get the issue on both cassandra 1.1.3, 1.1.5 and I am using Hadoop 0.20.2.  
(was: I am using SLES 10 SP3, 4 Cassandra + Hadoop nodes, 3 Hadoop only nodes 
(datanodes/tasktrackers), 1 namenode/jobtracker. The machines used are Six-Core 
AMD Opteron(tm) Processor 8431, 24 cores and 33 GB of RAM. I get the issue on 
both cassandra 1.1.3, 1.1.5 and I am using Hadoop 0.20.2.)

> Problem using BulkOutputFormat while streaming several SSTables 
> simultaneously from a given node.
> -
>
> Key: CASSANDRA-4813
> URL: https://issues.apache.org/jira/browse/CASSANDRA-4813
> Project: Cassandra
>  Issue Type: Bug
>Affects Versions: 1.1.3, 1.1.5
> Environment: I am using SLES 10 SP3, Java 6, 4 Cassandra + Hadoop 
> nodes, 3 Hadoop only nodes (datanodes/tasktrackers), 1 namenode/jobtracker. 
> The machines used are Six-Core AMD Opteron(tm) Processor 8431, 24 cores and 
> 33 GB of RAM. I get the issue on both cassandra 1.1.3, 1.1.5 and I am using 
> Hadoop 0.20.2.
>Reporter: Ralph Romanos
>  Labels: Bulkoutputformat, Hadoop, SSTables
>
> The issue occurs when streaming simultaneously SSTables from the same node to 
> a cassandra cluster using SSTableloader. It seems to me that Cassandra cannot 
> handle receiving simultaneously SSTables from the same node. However, when it 
> receives simultaneously SSTables from two different nodes, everything works 
> fine. As a consequence, when using BulkOutputFormat to generate SSTables and 
> stream them to a cassandra cluster, I cannot use more than one reducer per 
> node otherwise I get a java.io.EOFException in the tasktracker's logs and a 
> java.io.IOException: Broken pipe in the Cassandra logs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira