Hi Jason: Thanks for your information.
We did experience the SSH error, such as bad packet length or Corrupted MAC address. I am not exactly sure if it is a software bug or not. I had used SSH for a long time. I did not see any connection broken so frequently like Roach. The Ethernet problem on one of our Roachs is link problem. It has some difficulties to establish link. And another Roach has stability problem. I use Ethernet to transfer big file located on Roach external USB hard disk. I found I have no chance to finish a success transfer. The file could be transferred for a few hundred MB. Then the transfer will be broken. Could you please help me to verify if you can upload and download 1GB file with Roach? Thanks Wan -----Original Message----- From: Jason Manley [mailto:jasonman...@gmail.com] Sent: Friday, 30 October 2009 4:30 PM To: John Ford; Wormnes, Kjetil (ATNF, Marsfield); Cheng, Wan (ATNF, Marsfield) Cc: Dan Werthimer; casper@lists.berkeley.edu Subject: Re: [casper] Fwd: Re: SPDO ROACH spectrometer Um, no, this is probably a different problem. You are getting these errors while using SSH/SCP, right? The hardware problem with faulty PHY manifests as one or more of the PHY LEDs flashing on/off (there are three red ones next to the PHY chip). If your link is stable, then I believe the hardware is fine. The "MAC" problem appears to be software related, and comes and goes depending on the kernel build. It does not refer to the MAC address, but rather ssh's Machine Authentication Code. Check out http://blogs.sun.com/janp/entry/ssh_messages_code_bad_packet for some info. Dave's made various changes to try'n fix it, and increasing some software buffer has solved it for me. I no longer see this problem, but it's probably been masked rather than solved. Also, you never see it using KATCP, which is one more reason to use that method for larger transfers. WRT large (>1GB) transfers, remember that it will take a long time to pull that much data off the FPGA. It does so in pages of ~4000Bytes at a time. Also make sure you're using the latest kernel. We discovered a bug in this paging system during the workshop. http://casper.berkeley.edu/svn/trunk/roach/sw/binaries/linux/uImage-20091006-mmcfix should be good. I have never tried pulling such volumes over the SSH shell, but it works fine with KATCP. I will ask him to comment further. Jason On 30 Oct 2009, at 01:25, John Ford wrote: >> >> casper collaborators, >> >> appended below is further info on roach ethernet problems seen at >> CSIRO: >> any ideas? > > If I recall correctly, Alan mentioned this problem at the workshop, > and > the problem was that some of the PHY chips were faulty at one > point. This > may be what's going on. Hopefully someone knows for sure! > > John > >> >> thanks, >> >> dan >> >> -------- Original Message -------- >> Subject: Re: SPDO ROACH spectrometer >> Date: Fri, 30 Oct 2009 09:19:01 +1100 >> From: Kjetil Wormnes <kjetil.worm...@csiro.au> >> To: Dan Werthimer <d...@ssl.berkeley.edu> >> >> Hi Dan and Wan >> >> I can confirm that we are seeing at least some of the problems with >> another ROACH board as well. This time it is connected directly to a >> computer with a short CATY5 cable. >> >> So maybe this indicates that it is less likely to be a hardware >> problem? >> Incidentally, the error message that happens when attempting to >> download >> a large file over sftp is "Corrupted MAC on input". >> >> cheers >> >> Kjetil >> >> Dan Werthimer wrote: >>> hi wan, >>> >>> i don't know of anyone who has roach ethernet >>> problems at 100 Mbit/sec. >>> >>> i'm cc'ing casper community to see if anyone has any ideas. >>> in general, it's good to post questions to cas...@lists, >>> so that everyone can help answer, and everyone can see the answers, >>> and the info will be captured in the wiki/email archive. >>> >>> if you want you can buy or ask digicom if they can send you >>> another national PHY chip and see if this helps. >>> >>> also you might want to try using short cable, and/or a cat6 cable. >>> is your roach connected directly to a computer, or going >>> through a switch? might be interesting to try a different NIC >>> or different switch or different computer. >>> >>> best, >>> >>> dan >>> >>> >>> On 10/29/2009 02:47 PM, wan.ch...@csiro.au wrote: >>> >>>> Hi Dan: >>>> >>>> I believe you have done a very nice job. >>>> >>>> My problem is Ethernet port is not very reliable. Even running at >>>> 100MHz, the Ethernet port will be disconnected at some times. >>>> Normally, >>>> it can resume after reboot whole system. >>>> >>>> And I could not transfer big file through ethernet. Small files >>>> like a >>>> few MB are all right. But I could not download 1GB file from >>>> Roach at >>>> all. >>>> >>>> So Dan, could this problem be solved by replacing the on board PHY? >>>> >>>> Thanks >>>> >>>> Wan >>>> >>>> >> >> > > >