[
https://issues.apache.org/jira/browse/DISPATCH-1417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16928470#comment-16928470
]
ASF GitHub Bot commented on DISPATCH-1417:
------------------------------------------
ChugR commented on issue #565: DISPATCH-1417 - First attempt at fixing router
crash by introducing a…
URL: https://github.com/apache/qpid-dispatch/pull/565#issuecomment-530789317
This patch fixes the issue.
One sender sends 1,000,000 unsettled messages into a network. Three
permanently attached receivers on different edge routers receive all the
messages. All sender messages were confirmed and no messages were rejected or
released.
A temporary client 'attaches to one edge router, receives one message, and
disconnects' 14,000 times.
The router that was failing was the one to which the temporary client was
connecting and disconnecting. With the original 1.9.0 candidate the router
failed within a few thousand messages. With this PR the router stayed up.
I took no measurements of performance (CPU, messages per second, latency,
credit stall, etc.)
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> Crash when connection_wake ctx points to freed memory
> -----------------------------------------------------
>
> Key: DISPATCH-1417
> URL: https://issues.apache.org/jira/browse/DISPATCH-1417
> Project: Qpid Dispatch
> Issue Type: Bug
> Components: Router Node
> Affects Versions: 1.8.0
> Reporter: Chuck Rolke
> Assignee: Ganesh Murthy
> Priority: Major
> Fix For: 1.9.0
>
>
> Test clients are streaming unsettled multicast messages to and from an edge
> router. Another client repeats the cycle "connect, receive one message from
> the stream, disconnect". Soon the edge router core dumps with:
> {{(gdb) bt
> #0 get_pconnection (c=0x9999999999999999) at
> /home/chug/git/qpid-proton/c/src/proactor/epoll.c:578
> #1 0x00007fc8c0582a1c in pn_connection_wake (c=0x9999999999999999) at
> /home/chug/git/qpid-proton/c/src/proactor/epoll.c:1439
> #2 0x00007fc8c0668472 in connection_wake (ctx=0x1a43658) at
> /home/chug/Downloads/qpid-dispatch-1.9.0/src/server.c:505
> #3 0x00007fc8c066b2af in qd_server_activate (ctx=0x1a43658) at
> /home/chug/Downloads/qpid-dispatch-1.9.0/src/server.c:1304
> #4 0x00007fc8c064f3dd in qdr_activate_connections_CT (core=0x19c8ce0) at
> /home/chug/Downloads/qpid-dispatch-1.9.0/src/router_core/router_core_thread.c:65
> #5 0x00007fc8c064fa1d in router_core_thread (arg=0x19c8ce0) at
> /home/chug/Downloads/qpid-dispatch-1.9.0/src/router_core/router_core_thread.c:171
> #6 0x00007fc8c056258e in start_thread () from /usr/lib64/libpthread.so.0
> #7 0x00007fc8c0201713 in clone () from /usr/lib64/libc.so.6
> (gdb) info threads
> Id Target Id Frame
> * 1 Thread 0x7fc8b1e44700 (LWP 21706) get_pconnection (c=0x9999999999999999)
> at /home/chug/git/qpid-proton/c/src/proactor/epoll.c:578
> 2 Thread 0x7fc8bf8ff240 (LWP 21696) 0x00007fc8c0201a47 in epoll_wait () from
> /usr/lib64/libc.so.6
> 3 Thread 0x7fc8b0e42700 (LWP 21708) 0x00007fc8c0201a47 in epoll_wait () from
> /usr/lib64/libc.so.6
> 4 Thread 0x7fc8abfff700 (LWP 21709) 0x00007fc8c0201a47 in epoll_wait () from
> /usr/lib64/libc.so.6
> 5 Thread 0x7fc8b1643700 (LWP 21707) 0x00007fc8c01f6481 in poll () from
> /usr/lib64/libc.so.6
> 6 Thread 0x7fc8ab7fe700 (LWP 21710) 0x00007fc8c0201a47 in epoll_wait () from
> /usr/lib64/libc.so.6}}
--
This message was sent by Atlassian Jira
(v8.3.2#803003)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]