[ https://issues.apache.org/jira/browse/CASSANDRA-3859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13210259#comment-13210259 ]
Erik Forsberg commented on CASSANDRA-3859: ------------------------------------------ I'm guessing the first version of the patch was then the reason I am seeing lot's of this in my cassandra server's logs: {noformat} ERROR [Thread-544] 2012-02-17 13:29:17,936 AbstractCassandraDaemon.java (line 134) Fatal exception in thread Thread[Thread-544,5,main] java.io.IOError: java.io.EOFException at org.apache.cassandra.io.util.ColumnIterator.deserializeNext(ColumnSortedMap.java:260) at org.apache.cassandra.io.util.ColumnIterator.next(ColumnSortedMap.java:276) at org.apache.cassandra.io.util.ColumnIterator.next(ColumnSortedMap.java:233) at edu.stanford.ppl.concurrent.SnapTreeMap.<init>(SnapTreeMap.java:422) at org.apache.cassandra.db.AtomicSortedColumns$Holder.<init>(AtomicSortedColumns.java:301) at org.apache.cassandra.db.AtomicSortedColumns.<init>(AtomicSortedColumns.java:77) at org.apache.cassandra.db.AtomicSortedColumns.<init>(AtomicSortedColumns.java:48) at org.apache.cassandra.db.AtomicSortedColumns$1.fromSorted(AtomicSortedColumns.java:61) at org.apache.cassandra.db.SuperColumnSerializer.deserialize(SuperColumn.java:371) at org.apache.cassandra.io.sstable.SSTableWriter.appendFromStream(SSTableWriter.java:244) at org.apache.cassandra.streaming.IncomingStreamReader.streamIn(IncomingStreamReader.java:146) at org.apache.cassandra.streaming.IncomingStreamReader.read(IncomingStreamReader.java:93) at org.apache.cassandra.net.IncomingTcpConnection.stream(IncomingTcpConnection.java:185) at org.apache.cassandra.net.IncomingTcpConnection.run(IncomingTcpConnection.java:81) Caused by: java.io.EOFException at java.io.DataInputStream.readInt(DataInputStream.java:375) at org.apache.cassandra.utils.BytesReadTracker.readInt(BytesReadTracker.java:101) at org.apache.cassandra.utils.ByteBufferUtil.readWithLength(ByteBufferUtil.java:350) at org.apache.cassandra.db.ColumnSerializer.deserialize(ColumnSerializer.java:114) at org.apache.cassandra.io.util.ColumnIterator.deserializeNext(ColumnSortedMap.java:256) ... 13 more {noformat} Retrying with new patch.. > Add Progress Reporting to Cassandra OutputFormats > ------------------------------------------------- > > Key: CASSANDRA-3859 > URL: https://issues.apache.org/jira/browse/CASSANDRA-3859 > Project: Cassandra > Issue Type: Improvement > Components: Hadoop, Tools > Affects Versions: 1.1.0 > Reporter: Samarth Gahire > Assignee: Brandon Williams > Priority: Minor > Labels: bulkloader, hadoop, mapreduce, sstableloader > Fix For: 1.1.0 > > Attachments: 0001-add-progress-reporting-to-BOF.txt, > 0002-Add-progress-to-CFOF.txt > > Original Estimate: 48h > Remaining Estimate: 48h > > When we are using the BulkOutputFormat to load the data to cassandra. We > should use the progress reporting to Hadoop Job within Sstable loader because > while loading the data for particular task if streaming is taking more time > and progress is not reported to Job it may kill the task with timeout > exception. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira