That sounds similar to an issue I saw on an RB2011 after I upgraded it to, I think, 6.18. In my case, downgrading to what it had before (I don't remember the specific version off hand), didn't completely fix the problem... it was a couple months ago so I don't remember specifically what it did after I downgraded. I had to reset the router to defaults and re-configure it to get it working again, after which it ran perfectly fine with 6.18.
________________________________ From: Af [af-bounces+mathew=litewire....@afmug.com] on behalf of Rory Conaway via Af [af@afmug.com] Sent: Thursday, September 25, 2014 11:58 AM To: af@afmug.com Subject: Re: [AFMUG] CCR-1036 fun with PPPoE Sorry, I’d have to get my network admin in on this to be more specific. We didn’t have BGP on our routers. I can only tell you the symptoms that I was aware of. CPU utilization went to the roof, remote access was really, really slow, and users had high-latency or were dropping connections and this usually occurred within 24-48 hours. I know we didn’t do a lot of trouble-shooting, just went back to 6.15 since that was stable on everything else we had. We didn’t mess around with it much. Rory From: Af [mailto:af-bounces+rory=triadwireless....@afmug.com] On Behalf Of James Howard via Af Sent: Thursday, September 25, 2014 9:54 AM To: af@afmug.com Subject: Re: [AFMUG] CCR-1036 fun with PPPoE I’m a bit confused. What symptoms did you see with your routers? What I got from Chris’ description was that the CCR caused his Edge router to bomb. Replacing the CCR didn’t fix the problem until they rebooted the Edge router. Did your routers cause other routers to degrade or crash? I’m not sure about there being an issue with 6.19 on the CCR but I can tell you that we saw a similar situation with a CCR that had 6.17 (or possibly older, not sure when we updated it) recently. I would suspect that he’s having an issue with BGP on the CCR. In our case, the CCR had 2 full BGP tables, PPPOE and OSPF on it. It took down one of the BGP peers on the Edge router (PowerRouter V3 in our case). I disabled the BGP peer on the Edge for about 5 minutes and everything worked happily. When I started it back up, everything was fine until it randomly happened again. We then shut down the BGP link between the PowerRouter and the CCR. The CCR does not seem to be able to handle more than one BGP table if it’s doing anything else. Another CCR seems to be happy as an edge router with 2 full tables on it. It’s not doing any other function though and we are in process of ordering an x86 replacement. From: Af [mailto:af-bounces+james=litewire....@afmug.com] On Behalf Of Rory Conaway via Af Sent: Thursday, September 25, 2014 11:38 AM To: af@afmug.com<mailto:af@afmug.com> Subject: Re: [AFMUG] CCR-1036 fun with PPPoE No, our needs are much simpler but we saw similar issues on 5 routers from 750’s to 1100’s. Went back to 6.15 and haven’t had a problem in 3 weeks. Rory From: Af [mailto:af-bounces+rory=triadwireless....@afmug.com] On Behalf Of Chris Wright via Af Sent: Thursday, September 25, 2014 9:24 AM To: af@afmug.com<mailto:af@afmug.com> Subject: Re: [AFMUG] CCR-1036 fun with PPPoE Rory, thanks for your reply. Is your setup fairly similar to ours? PPPoE, Accounting, and BGP all done by the Mikrotik? How many sessions do you have and what kind of throughput? Chris Wright Velociter Wireless<http://www.velociter.net/> From: Af [mailto:af-bounces+chris=velociter....@afmug.com] On Behalf Of Rory Conaway via Af Sent: Thursday, September 25, 2014 7:16 AM To: af@afmug.com<mailto:af@afmug.com> Subject: Re: [AFMUG] CCR-1036 fun with PPPoE On the older routers, we went back to 6.15 and things have been rock solid. We saw similar problems with 6.19. Rory From: Af [mailto:af-bounces+rory=triadwireless....@afmug.com] On Behalf Of James Howard via Af Sent: Thursday, September 25, 2014 6:39 AM To: af@afmug.com<mailto:af@afmug.com> Subject: Re: [AFMUG] CCR-1036 fun with PPPoE Have you tried taking down the BGP session on the Edge router for a couple minutes and then restart it? From: Af [mailto:af-bounces+james=litewire....@afmug.com] On Behalf Of Chris Wright via Af Sent: Wednesday, September 24, 2014 5:37 PM To: af@afmug.com<mailto:af@afmug.com> Subject: [AFMUG] CCR-1036 fun with PPPoE CCR-1036 running RouterOS 6.19 After some serious amounts of testing, we felt our CCR was ready to take the plunge. The core router talks BGP to our two Imagestream Edge routers and gets all 500k+ routes from each in about three minutes. Its PPPoE server manages to authenticate the bulk of nearly 1800 customers in four minutes. All’s fine and dandy for about 12 hours, then not so fine and dandy things start happening. Overall traffic that should be near 600mbps seems to top off around 400mbps. Edge 1 goes unresponsive, VRRP doesn’t kick in on Edge 2 and the entire network degrades. All devices on our public switch go partially unresponsive to pings including our DNS servers, other various VM’s, and ESXi hosts themselves. Here’s the fun part: We took the CCR out, just flat out unplugged it and turned on our old Core routers. They start authenticating customers but they’re insanely slow in doing it. It’s not until we reboot our Edge 1 router that things get back to normal and the old Core routers authenticate at acceptable speeds. Could the CCR be inducing a problem in our Edge routers perhaps? Chris Wright Velociter Wireless<http://www.velociter.net/> ________________________________ Total Control Panel Login<https://asp.reflexion.net/login?domain=litewire.net> To: ja...@litewire.net<https://asp.reflexion.net/address-properties?aID=242260993&domain=litewire.net> From: 00000148a9cf7626-53a62885-89e0-4e1c-9c7f-b4d8519c55eb-000...@amazonses.com<https://asp.reflexion.net/address-properties?aID=2633740531&domain=litewire.net> Message Score: 2 High (60): Pass My Spam Blocking Level: High Medium (75): Pass Low (90): Pass Block<https://asp.reflexion.net/FooterAction?ver=2&bl-sender-address=1&rID=242260993&aID=2633740531&domain=litewire.net> this sender / Block<https://asp.reflexion.net/FooterAction?ver=2&ent=1&bl-sender-address=1&rID=242260993&aID=2633740531&domain=litewire.net> this sender enterprise-wide Block<https://asp.reflexion.net/FooterAction?ver=2&bl-sender-domain=1&rID=242260993&aID=2633740531&domain=litewire.net> amazonses.com / Block<https://asp.reflexion.net/FooterAction?ver=2&ent=1&bl-sender-domain=1&rID=242260993&aID=2633740531&domain=litewire.net> amazonses.com enterprise-wide This message was delivered because the content filter score did not exceed your filter level. ________________________________ Total Control Panel Login<https://asp.reflexion.net/login?domain=litewire.net> To: ja...@litewire.net<https://asp.reflexion.net/address-properties?aID=242260993&domain=litewire.net> From: 00000148adacf3bf-5f506628-914e-4ce6-9d2a-4598e0293a55-000...@amazonses.com<https://asp.reflexion.net/address-properties?aID=2636291500&domain=litewire.net> Message Score: 2 High (60): Pass My Spam Blocking Level: High Medium (75): Pass Low (90): Pass Block<https://asp.reflexion.net/FooterAction?ver=2&bl-sender-address=1&rID=242260993&aID=2636291500&domain=litewire.net> this sender / Block<https://asp.reflexion.net/FooterAction?ver=2&ent=1&bl-sender-address=1&rID=242260993&aID=2636291500&domain=litewire.net> this sender enterprise-wide Block<https://asp.reflexion.net/FooterAction?ver=2&bl-sender-domain=1&rID=242260993&aID=2636291500&domain=litewire.net> amazonses.com / Block<https://asp.reflexion.net/FooterAction?ver=2&ent=1&bl-sender-domain=1&rID=242260993&aID=2636291500&domain=litewire.net> amazonses.com enterprise-wide This message was delivered because the content filter score did not exceed your filter level.