Re: [casper] Question of chose Correlator Architecture
Hi, Peter, On Dec 2, 2014, at 9:34 AM, Peter Niu wrote: > Thanks for your reply.Our switch ports are enough,but the Nics on roach is > not enough(we only have 4 ports each roach),so if we send packets to x box > though switch,we only need 4 10Gbe ports on each roach.that is why we want to > use the pre-build paper model. Yes, to do "F -> SWITCH -> X" you only need four 10 GbE ports on the ROACH2. In theory this could be done with 1 SFP+ card in the ROACH2, but if you want to use the pre-built model available on the internet to do this, you will still need 2 SFP+ cards in each ROACH2 due to how the interfaces are allocated. If you modify the PAPER model, you can reassign the four "eth_N_sw" cores to be on 1 SFP+ card. The "eth_N_gpu" cores would be unused and could be deleted to save resources. > if we want to use the pre-build model,the initial ruby scripts may should be > modified.the eth_n_gpu codes should be delete,the arp codes of eth_n_sw > should be changed.except the ethernet part of ruby codes should be > modified,need the other > parts codes like the PFB ,EQ parts be changed? Yes, the network config stuff will have to change somewhat, but the rest of the configuration will remain the same. Once you have settled on a network configuration, it should be fairly straightforward to change the paper_feng_init.rb script accordingly. > could the pre-build model and scripts find on the websites? All my pre-built PAPER F engine models and scripts are on the internet and I think you already have them. Dave
Re: [casper] ROACH serial connection issues
Hi David, The problem is that I don't see the counter anymore and any command I give the roach such as run bit, run sailboat etc are ignored or at least no feedback is receive after the "Hit any key" message now. U boot is running using bootstrap option C. Should it be H? The dip switches are currently all in the off position. I have also tried different combinations of the dip switches such as bootstrap option B (slow mode) I will try to use a different USB to serial adapter maybe mine is faulty or something weird is happening. Kind regards, Norbert On 2 December 2014 at 18:07, David MacMahon wrote: > Hi, Norbert, > > If you hit any key to stop autoboot when it says "Hit any key to stop > autoboot", does it in fact stop the autoboot? If so, you could use u-boot's > "printenv" command to see what commands get run as part of autoboot and then > try to run them "by hand" to try to figure out where things go bad. > > HTH, > Dave > > On Dec 2, 2014, at 12:16 AM, Norbert Bonnici wrote: > >> Hi Marc, >> >> The USB dongle's baud rate should have been set properly. When set to >> different baud rates no readable data is received through the serial >> port. Added line wrapping but it didn't change anything. >> >> In addition, recently the communications are being disabled when the >> GND wire is connected to the USB dongle. Data is only being received >> when only the tx and rx wires are connected. >> >> Regards, >> Norbert >> >> On 2 December 2014 at 08:39, Marc Welz wrote: >>> >>> >>> On Mon, Dec 1, 2014 at 2:47 PM, Norbert Bonnici >>> wrote: Dear Marc, I've have tried all the possible CR+LF combinations. >>> >>> Any ideas? >>> >>> >>> Then I am not sure - I know that some USB dongles attempt to autodetect the >>> serial >>> speed - maybe something is going wrong there ? Also, maybe enable line >>> wrapping (Control-A W) might help. >>> >>> BTW: CC'ing the mailing list is good form - it helps others who might have >>> the same problem, and you might also get suggestions from other people >>> >>> regards >>> >>> marc >>> >>> >> > >
Re: [casper] Question of chose Correlator Architecture
Hi,Dave, Thanks for your reply.Our switch ports are enough,but the Nics on roach is not enough(we only have 4 ports each roach),so if we send packets to x box though switch,we only need 4 10Gbe ports on each roach.that is why we want to use the pre-build paper model. if we want to use the pre-build model,the initial ruby scripts may should be modified.the eth_n_gpu codes should be delete,the arp codes of eth_n_sw should be changed.except the ethernet part of ruby codes should be modified,need the other parts codes like the PFB ,EQ parts be changed?could the pre-build model and scripts find on the websites? Best Regards! Peter -- 发自 Android 网易邮箱 On 2014-12-03 01:01 , David MacMahon Wrote: Hi, Peter, If you have enough ports on the switch then you certainly can configure things to send the packets directly from the ROACHs to the various X boxes. The pre-built PAPER model will support this if properly configured. The one thing to keep in mind is the IP-to-MAC address table in the 10 GbE cores. These are setup by paper_feng_init.rb. You could use the "factory" MAC addresses of the X-box 10 GbE interfaces or you could configure the X boxes to set their 10 GbE MAC addresses to predefined values. If using the latter approach, we often use 02:02:ww:xx:yy:zz where ww:xx:yy:zz corresponds to the IP address. Hope this helps, Dave On Dec 1, 2014, at 5:21 AM, Peter Niu wrote: > Hi,Dave, > Thanks for your Document about EQ,and suggestion about the sample rate.Now I have a question about the correlator architecture. > I have saw your PPT : > Correlator Architectures > Present and Future > CASPER Workshop 2011 > > The structure mentioned in the PPT is the structure PAPER used now .Using a set of precise IP assignment to avoid Loop Back is ok. However,If we use the Packetized F/X Concept:Uses two ports on switch per F/X pair.It may not meet the Loop Back problem.On the switch, The IP address will tell the packet which Xeng to go .The structure which The PAPER model using now is the eth_?_gpu port on ROACH connect HPC port directly.Is this only for saving ports on switch?Well,Our switch have 64 ports,If we use the two ports on switch per F/X pair Concept,the ports may be sufficient . > This is the question asked by my teacher Wu fengquan. As PAPER provide a lot of ruby control scripts online to use,I'd rather use this model exits now.What should I say to him?Is there some more advantages to use this structure instead the two ports on switch per F/X pair? > Thanks for your help! > Best wishes! > Peter > > > >
Re: [casper] NFS setup: TFTP permissions problem
Hi, Michael, In addition to the other suggestions, you should check whether you are running dnsmasq in "tftp-secure" more. That might impose ownership and/or permission restrictions. See "man dnsmasq" for more details. Dave On Dec 2, 2014, at 6:07 AM, Michael D'Cruze wrote: > Hi everyone > > I'm following the NFS setup guide, and have come across a problem with the > /srv/roach_boot/boot directory permissions. I restart the dnsmasq service and > receive the following error: > > Starting dnsmasq: > dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission denied >[FAILED] > > The output of ls -l from /srv/roach_boot is > > [root@roach-workstation roach_boot]# ls -l > total 8 > drwxrwxrwx. 2 root root 4096 Dec 1 16:31 boot > drwxrwxrwx. 23 root root 4096 Feb 2 2009 etch > > and from within /boot is > > [root@roach-workstation boot]# ls -l > total 1360 > -rwxrwxrwx. 1 michael michael 1390149 Dec 1 15:35 uImage-20110812-mmcomitfix > > The output of ls --context from within /boot is > > [root@roach-workstation boot]# ls --context > -rwxrwxrwx. michael michael unconfined_u:object_r:tftpdir_t:s0 > uImage-20110812-mmcomitfix > > All of these permissions and contexts look correct according to the > guideso I'm at a bit of a loss. Has anyone seen this problem before, > given all of the above conditions? > > Does the /boot directory have to have the same context as the uImage file > within it? > > Suggestions or guidance greatly appreciated. > > Michael
Re: [casper] ROACH serial connection issues
Hi, Norbert, If you hit any key to stop autoboot when it says "Hit any key to stop autoboot", does it in fact stop the autoboot? If so, you could use u-boot's "printenv" command to see what commands get run as part of autoboot and then try to run them "by hand" to try to figure out where things go bad. HTH, Dave On Dec 2, 2014, at 12:16 AM, Norbert Bonnici wrote: > Hi Marc, > > The USB dongle's baud rate should have been set properly. When set to > different baud rates no readable data is received through the serial > port. Added line wrapping but it didn't change anything. > > In addition, recently the communications are being disabled when the > GND wire is connected to the USB dongle. Data is only being received > when only the tx and rx wires are connected. > > Regards, > Norbert > > On 2 December 2014 at 08:39, Marc Welz wrote: >> >> >> On Mon, Dec 1, 2014 at 2:47 PM, Norbert Bonnici >> wrote: >>> >>> Dear Marc, >>> >>> I've have tried all the possible CR+LF combinations. >> >> >>> >>> Any ideas? >> >> >> Then I am not sure - I know that some USB dongles attempt to autodetect the >> serial >> speed - maybe something is going wrong there ? Also, maybe enable line >> wrapping (Control-A W) might help. >> >> BTW: CC'ing the mailing list is good form - it helps others who might have >> the same problem, and you might also get suggestions from other people >> >> regards >> >> marc >> >> >
Re: [casper] Question of chose Correlator Architecture
Hi, Peter, If you have enough ports on the switch then you certainly can configure things to send the packets directly from the ROACHs to the various X boxes. The pre-built PAPER model will support this if properly configured. The one thing to keep in mind is the IP-to-MAC address table in the 10 GbE cores. These are setup by paper_feng_init.rb. You could use the "factory" MAC addresses of the X-box 10 GbE interfaces or you could configure the X boxes to set their 10 GbE MAC addresses to predefined values. If using the latter approach, we often use 02:02:ww:xx:yy:zz where ww:xx:yy:zz corresponds to the IP address. Hope this helps, Dave On Dec 1, 2014, at 5:21 AM, Peter Niu wrote: > Hi,Dave, > Thanks for your Document about EQ,and suggestion about the sample rate.Now I > have a question about the correlator architecture. > I have saw your PPT : > Correlator Architectures > Present and Future > CASPER Workshop 2011 > > The structure mentioned in the PPT is the structure PAPER used now .Using a > set of precise IP assignment to avoid Loop Back is ok. However,If we use the > Packetized F/X Concept:Uses two ports on switch per F/X pair.It may not meet > the Loop Back problem.On the switch, The IP address will tell the packet > which Xeng to go .The structure which The PAPER model using now is the > eth_?_gpu port on ROACH connect HPC port directly.Is this only for saving > ports on switch?Well,Our switch have 64 ports,If we use the two ports on > switch per F/X pair Concept,the ports may be sufficient . > This is the question asked by my teacher Wu fengquan. As PAPER provide a lot > of ruby control scripts online to use,I'd rather use this model exits > now.What should I say to him?Is there some more advantages to use this > structure instead the two ports on switch per F/X pair? > Thanks for your help! > Best wishes! > Peter > > > >
Re: [casper] NFS setup: TFTP permissions problem
On Tue, Dec 2, 2014 at 2:07 PM, Michael D'Cruze < michael.dcr...@postgrad.manchester.ac.uk> wrote: > > Starting dnsmasq: > dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission > denied >[FAILED] > In addition to whatever SElinux has gotten in the way, there is also the fact that recent kernels have changed the behaviour of symbolic links of different uids - I think somewhere in /proc there is an option to deal with that - unclear if you will encounter that, but something to remember regards marc
Re: [casper] NFS setup: TFTP permissions problem
Hi Jack, I'm running RHEL 6.6. THanks Michael From: Jack Hickish Sent: Tuesday, December 2, 2014 2:15 PM To: Michael D'Cruze; casper@lists.berkeley.edu Subject: Re: [casper] NFS setup: TFTP permissions problem Hi Michael, Do you have SELinux running? I've just checked and I get a similar permissions error if I reactivate SELinux on my Centos 6 server. On Tue Dec 02 2014 at 14:07:45 Michael D'Cruze mailto:michael.dcr...@postgrad.manchester.ac.uk>> wrote: Hi everyone I'm following the NFS setup guide, and have come across a problem with the /srv/roach_boot/boot directory permissions. I restart the dnsmasq service and receive the following error: Starting dnsmasq: dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission denied [FAILED] The output of ls -l from /srv/roach_boot is [root@roach-workstation roach_boot]# ls -l total 8 drwxrwxrwx. 2 root root 4096 Dec 1 16:31 boot drwxrwxrwx. 23 root root 4096 Feb 2 2009 etch and from within /boot is [root@roach-workstation boot]# ls -l total 1360 -rwxrwxrwx. 1 michael michael 1390149 Dec 1 15:35 uImage-20110812-mmcomitfix The output of ls --context from within /boot is [root@roach-workstation boot]# ls --context -rwxrwxrwx. michael michael unconfined_u:object_r:tftpdir_t:s0 uImage-20110812-mmcomitfix All of these permissions and contexts look correct according to the guideso I'm at a bit of a loss. Has anyone seen this problem before, given all of the above conditions? Does the /boot directory have to have the same context as the uImage file within it? Suggestions or guidance greatly appreciated. Michael
Re: [casper] NFS setup: TFTP permissions problem
Hi Michael, Do you have SELinux running? I've just checked and I get a similar permissions error if I reactivate SELinux on my Centos 6 server. On Tue Dec 02 2014 at 14:07:45 Michael D'Cruze < michael.dcr...@postgrad.manchester.ac.uk> wrote: > Hi everyone > > > I'm following the NFS setup guide, and have come across a problem with > the /srv/roach_boot/boot directory permissions. I restart the dnsmasq > service and receive the following error: > > > Starting dnsmasq: > dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission > denied >[FAILED] > > > The output of ls -l from /srv/roach_boot is > > > [root@roach-workstation roach_boot]# ls -l > total 8 > drwxrwxrwx. 2 root root 4096 Dec 1 16:31 boot > drwxrwxrwx. 23 root root 4096 Feb 2 2009 etch > > > and from within /boot is > > > [root@roach-workstation boot]# ls -l > total 1360 > -rwxrwxrwx. 1 michael michael 1390149 Dec 1 15:35 > uImage-20110812-mmcomitfix > > > The output of ls --context from within /boot is > > > [root@roach-workstation boot]# ls --context > -rwxrwxrwx. michael michael unconfined_u:object_r:tftpdir_t:s0 > uImage-20110812-mmcomitfix > > > All of these permissions and contexts look correct according to the > guideso I'm at a bit of a loss. Has anyone seen this problem before, > given all of the above conditions? > > > Does the /boot directory have to have the same context as the uImage > file within it? > > > Suggestions or guidance greatly appreciated. > > > Michael >
[casper] NFS setup: TFTP permissions problem
Hi everyone I'm following the NFS setup guide, and have come across a problem with the /srv/roach_boot/boot directory permissions. I restart the dnsmasq service and receive the following error: Starting dnsmasq: dnsmasq: TFTP directory /srv/roach_boot/boot inaccessible: Permission denied [FAILED] The output of ls -l from /srv/roach_boot is [root@roach-workstation roach_boot]# ls -l total 8 drwxrwxrwx. 2 root root 4096 Dec 1 16:31 boot drwxrwxrwx. 23 root root 4096 Feb 2 2009 etch and from within /boot is [root@roach-workstation boot]# ls -l total 1360 -rwxrwxrwx. 1 michael michael 1390149 Dec 1 15:35 uImage-20110812-mmcomitfix The output of ls --context from within /boot is [root@roach-workstation boot]# ls --context -rwxrwxrwx. michael michael unconfined_u:object_r:tftpdir_t:s0 uImage-20110812-mmcomitfix All of these permissions and contexts look correct according to the guideso I'm at a bit of a loss. Has anyone seen this problem before, given all of the above conditions? Does the /boot directory have to have the same context as the uImage file within it? Suggestions or guidance greatly appreciated. Michael
Re: [casper] ROACH serial connection issues
Hi Marc, The USB dongle's baud rate should have been set properly. When set to different baud rates no readable data is received through the serial port. Added line wrapping but it didn't change anything. In addition, recently the communications are being disabled when the GND wire is connected to the USB dongle. Data is only being received when only the tx and rx wires are connected. Regards, Norbert On 2 December 2014 at 08:39, Marc Welz wrote: > > > On Mon, Dec 1, 2014 at 2:47 PM, Norbert Bonnici > wrote: >> >> Dear Marc, >> >> I've have tried all the possible CR+LF combinations. > > >> >> Any ideas? > > > Then I am not sure - I know that some USB dongles attempt to autodetect the > serial > speed - maybe something is going wrong there ? Also, maybe enable line > wrapping (Control-A W) might help. > > BTW: CC'ing the mailing list is good form - it helps others who might have > the same problem, and you might also get suggestions from other people > > regards > > marc > >