[ 
https://issues.apache.org/jira/browse/FLUME-798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13135154#comment-13135154
 ] 

Björn Edström commented on FLUME-798:
-------------------------------------

I'm experiencing the same issue, using the cloudera package 
0.9.4+25.9-1~lenny-cdh3. Below is a stack trace that is similar but not 
completely identical to the original above.

2011-10-25 11:38:42,346 INFO 
com.cloudera.flume.handlers.endtoend.AckListener$Empty: Empty Ack Listener 
ended 20111025-113813833+0000.1129353122525368.00000046
2011-10-25 11:38:42,347 INFO 
com.cloudera.flume.agent.durability.NaiveFileWALManager: File lives in 
/var/lib/flume/flume-flume/agent/machine-044.d.company.net/writing/20111025-113813833+0000.1129353122525368.00000046
2011-10-25 11:38:42,347 INFO com.cloudera.flume.handlers.hdfs.SeqfileEventSink: 
constructed new seqfile event sink: 
file=/var/lib/flume/flume-flume/agent/machine-044.d.company.net/writing/20111025-113842347+0000.1129381636287890.00000046
2011-10-25 11:38:41,797 ERROR com.cloudera.flume.core.connector.DirectDriver: 
Closing down due to exception during append calls
java.lang.InterruptedException: Blocked append interrupted by rotation event
        at 
com.cloudera.flume.handlers.rolling.RollSink.append(RollSink.java:209)
        at 
com.cloudera.flume.agent.durability.NaiveFileWALDeco.append(NaiveFileWALDeco.java:132)
        at com.cloudera.flume.agent.AgentSink.append(AgentSink.java:139)
        at 
com.cloudera.flume.core.connector.DirectDriver$PumperThread.run(DirectDriver.java:110)
2011-10-25 11:38:42,348 INFO com.cloudera.flume.core.connector.DirectDriver: 
Connector logicalNode machine-044.d.company.net-44 exited with error: Blocked 
append interrupted by rotation event
java.lang.InterruptedException: Blocked append interrupted by rotation event
        at 
com.cloudera.flume.handlers.rolling.RollSink.append(RollSink.java:209)
        at 
com.cloudera.flume.agent.durability.NaiveFileWALDeco.append(NaiveFileWALDeco.java:132)
        at com.cloudera.flume.agent.AgentSink.append(AgentSink.java:139)
        at 
com.cloudera.flume.core.connector.DirectDriver$PumperThread.run(DirectDriver.java:110)
2011-10-25 11:38:42,348 ERROR com.cloudera.flume.core.connector.DirectDriver: 
Driver interrupted attempting to close source
java.lang.InterruptedException
        at java.lang.Object.wait(Native Method)
        at java.lang.Thread.join(Thread.java:1186)
        at java.lang.Thread.join(Thread.java:1239)
        at 
com.spotify.flume.syslog2.ServerSocketSource.close(ServerSocketSource.java:121)
        at 
com.cloudera.flume.core.connector.DirectDriver$PumperThread.ensureClosed(DirectDriver.java:142)
        at 
com.cloudera.flume.core.connector.DirectDriver$PumperThread.errorCleanup(DirectDriver.java:163)
        at 
com.cloudera.flume.core.connector.DirectDriver$PumperThread.run(DirectDriver.java:116)
2011-10-25 11:38:42,348 INFO com.cloudera.flume.handlers.rolling.RollSink: 
closing RollSink 'ackingWal'
2011-10-25 11:38:42,349 INFO 
com.cloudera.flume.agent.durability.NaiveFileWALManager: opening log file 
20111025-113813833+0000.1129353122525368.00000046
2011-10-25 11:38:42,350 INFO com.cloudera.flume.agent.WALAckManager: Ack for 
20111025-113813833+0000.1129353122525368.00000046 is queued to be checked
2011-10-25 11:38:42,350 INFO com.cloudera.flume.agent.durability.WALSource: end 
of file NaiveFileWALManager 
(dir=/var/lib/flume/flume-flume/agent/machine-044.d.company.net )
2011-10-25 11:38:42,351 INFO 
com.cloudera.flume.handlers.endtoend.AckListener$Empty: Empty Ack Listener 
began 20111025-113842347+0000.1129381636287890.00000046
2011-10-25 11:38:42,352 INFO com.cloudera.flume.handlers.hdfs.SeqfileEventSink: 
closed 
/var/lib/flume/flume-flume/agent/machine-044.d.company.net/writing/20111025-113842347+0000.1129381636287890.00000046
2011-10-25 11:38:42,352 INFO 
com.cloudera.flume.handlers.endtoend.AckListener$Empty: Empty Ack Listener 
ended 20111025-113842347+0000.1129381636287890.00000046
2011-10-25 11:38:42,352 INFO 
com.cloudera.flume.agent.durability.NaiveFileWALManager: File lives in 
/var/lib/flume/flume-flume/agent/machine-044.d.company.net/writing/20111025-113842347+0000.1129381636287890.00000046
2011-10-25 11:38:42,352 INFO 
com.cloudera.flume.agent.durability.NaiveFileWALManager: NaiveFileWALManager 
shutting down
2011-10-25 11:38:42,352 INFO 
com.cloudera.flume.agent.durability.NaiveFileWALManager: opening log file 
20111025-113842347+0000.1129381636287890.00000046
2011-10-25 11:38:42,353 INFO com.cloudera.flume.agent.WALAckManager: Ack for 
20111025-113842347+0000.1129381636287890.00000046 is queued to be checked
2011-10-25 11:38:42,354 INFO com.cloudera.flume.agent.durability.WALSource: end 
of file NaiveFileWALManager 
(dir=/var/lib/flume/flume-flume/agent/machine-044.d.company.net )
2011-10-25 11:38:42,554 WARN 
com.cloudera.flume.agent.durability.NaiveFileWALManager: Already shutting down, 
but getting another shutting down notice, odd
2011-10-25 11:38:42,768 INFO 
com.cloudera.flume.agent.durability.NaiveFileWALManager: NaiveFileWALManager 
shutting down
2011-10-25 11:38:42,775 INFO 
com.cloudera.flume.handlers.thrift.ThriftEventSink: ThriftEventSink on port 
35853 closed
2011-10-25 11:38:42,852 ERROR com.cloudera.flume.core.connector.DirectDriver: 
Exiting driver logicalNode machine-044.d.company.net-44 in error state 
SyslogSocketSource | Agent because Blocked append interrupted by rotation event
.. And here it just locks ..

                
> Blocked append interrupted by rotation event 
> ---------------------------------------------
>
>                 Key: FLUME-798
>                 URL: https://issues.apache.org/jira/browse/FLUME-798
>             Project: Flume
>          Issue Type: Bug
>          Components: Node
>    Affects Versions: v0.9.5
>            Reporter: Cameron Gandevia
>
> Our flume collector seem's to work for a short period of time and then fails 
> with the following exception. When this happens the collector does not 
> reconnect and the system becomes inactive with the processes still running.
> 2011-10-14 01:49:47,386 [logicalNode collector0_log_dir-115] ERROR 
> com.cloudera.flume.core.connector.DirectDriver - Closing down due to 
> exception during append calls
> 2011-10-14 01:49:47,387 [logicalNode collector0_log_dir-115] INFO  
> com.cloudera.flume.core.connector.DirectDriver - Connector logicalNode 
> collector0_log_dir-115 exited with error: Blocked append interrupted by 
> rotation event
> java.lang.InterruptedException: Blocked append interrupted by rotation event
>         at 
> com.cloudera.flume.handlers.rolling.RollSink.append(RollSink.java:209)
>         at 
> com.cloudera.flume.core.EventSinkDecorator.append(EventSinkDecorator.java:60)
>         at com.cloudera.flume.core.MaskDecorator.append(MaskDecorator.java:43)
>         at 
> com.cloudera.flume.core.EventSinkDecorator.append(EventSinkDecorator.java:60)
>         at 
> com.cloudera.flume.handlers.debug.InsistentOpenDecorator.append(InsistentOpenDecorator.java:169)
>         at 
> com.cloudera.flume.core.EventSinkDecorator.append(EventSinkDecorator.java:60)
>         at 
> com.cloudera.flume.handlers.debug.StubbornAppendSink.append(StubbornAppendSink.java:71)
>         at 
> com.cloudera.flume.core.EventSinkDecorator.append(EventSinkDecorator.java:60)
>         at 
> com.cloudera.flume.handlers.debug.InsistentAppendDecorator.append(InsistentAppendDecorator.java:110)
>         at 
> com.cloudera.flume.core.EventSinkDecorator.append(EventSinkDecorator.java:60)
>         at 
> com.cloudera.flume.handlers.endtoend.AckChecksumChecker.append(AckChecksumChecker.java:113)
>         at 
> com.cloudera.flume.core.EventSinkDecorator.append(EventSinkDecorator.java:60)
>         at 
> com.cloudera.flume.handlers.batch.UnbatchingDecorator.append(UnbatchingDecorator.java:62)
>         at 
> com.cloudera.flume.core.EventSinkDecorator.append(EventSinkDecorator.java:60)
>         at 
> com.cloudera.flume.handlers.batch.GunzipDecorator.append(GunzipDecorator.java:81)
>         at 
> com.cloudera.flume.collector.CollectorSink.append(CollectorSink.java:222)
>         at 
> com.cloudera.flume.core.EventSinkDecorator.append(EventSinkDecorator.java:60)
>         at 
> com.cloudera.flume.core.extractors.DateExtractor.append(DateExtractor.java:129)
>         at 
> com.cloudera.flume.core.EventSinkDecorator.append(EventSinkDecorator.java:60)
>         at 
> com.cloudera.flume.core.extractors.RegexExtractor.append(RegexExtractor.java:88)
>         at 
> com.cloudera.flume.core.connector.DirectDriver$PumperThread.run(DirectDriver.java:133)
> 2011-10-14 01:49:47,388 [logicalNode collector0_log_dir-115] INFO  
> com.cloudera.flume.collector.CollectorSource - closed
> 2011-10-14 01:49:48,391 [logicalNode collector0_log_dir-115] INFO  
> com.cloudera.flume.handlers.thrift.ThriftEventSource - Closed server on port 
> 36892...
> 2011-10-14 01:49:48,391 [logicalNode collector0_log_dir-115] INFO  
> com.cloudera.flume.handlers.thrift.ThriftEventSource - Queue still has 1000 
> elements ...
> 2011-10-14 01:49:58,399 [logicalNode collector0_log_dir-115] WARN  
> com.cloudera.flume.handlers.thrift.ThriftEventSource - Close timed out due to 
> no progress.  Closing despite having 1000 values still enqueued
> 2011-10-14 01:49:58,399 [logicalNode collector0_log_dir-115] INFO  
> com.cloudera.flume.handlers.rolling.RollSink - closing RollSink 
> 'escapedCustomDfs("hdfs://van-mang-perf-hadoop-namenode1.net:8020/rawLogs/%{dateyear}-%{datemonth}-%{dateday}/%{datehr}00","raw-%{rolltag}"
>  )'
> 2011-10-14 01:49:58,400 [logicalNode collector0_log_dir-115] INFO  
> com.cloudera.flume.handlers.rolling.RollSink - double close 
> 'escapedCustomDfs("hdfs://van-mang-perf-hadoop-namenode1.net:8020/rawLogs/%{dateyear}-%{datemonth}-%{dateday}/%{datehr}00","raw-%{rolltag}"
>  )'
> 2011-10-14 01:49:58,400 [logicalNode collector0_log_dir-115] ERROR 
> com.cloudera.flume.core.connector.DirectDriver - Exiting driver logicalNode 
> collector0_log_dir-115 in error state CollectorSource | RegexExtractor 
> because Blocked append interrupted by rotation event

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira


Reply via email to