Did you use something like iperf with a long and heavy load? a bad nic or
driver might cause this, so it might still be the network.

On Mon, May 13, 2019 at 4:15 PM Bjørn Nachtwey <bjoern.nacht...@gwdg.de>
wrote:

> Hi all,
>
> we planned to switch from COPYPOOL to Replication for having a second
> copy of the data, therefore we bought a new server that should become
> the primary TSM/ISP server and then make the old one holding the
> replicates.
>
> what we did:
>
> we started by exporting the nodes, which worked well. But as the
> "incremental" exports even took some time, we set up a replication from
> old server "A" to the new one "B". For all nodes already exported we set
> up the replication vice versa: TSM "B" replicates them to TSM "A".
>
> well, the replication jobs did not finish, some data and files were
> missing as long as we replicated using a node group. Now we use
> replication for each single node and it works -- for most of them :-(
>
> Replication the "bad" nodes from "TSM A" to "TSM B" first the sessions
> hang for many minutes, sometimes even hours, then they got "terminated -
> forced by administrator" (ANR0483W), e.g.:
>
> 05/13/2019 15:23:16    ANR2017I Administrator GK issued command:
> REPLICATE NODE vsbck  (SESSION: 26128)
> 05/13/2019 15:23:16    ANR1626I The previous message (message number
> 2017) was repeated 1 times.
> 05/13/2019 15:23:16    ANR0984I Process 494 for Replicate Node started
> in the BACKGROUND at 15:23:16. (SESSION: 26128, PROCESS: 494)
> 05/13/2019 15:23:16    ANR2110I REPLICATE NODE started as process 494.
> (SESSION: 26128, PROCESS: 494)
> 05/13/2019 15:23:16    ANR0408I Session 26184 started for server SM283
> (Linux/x86_64) (Tcp/Ip) for replication.  (SESSION: 26128, PROCESS: 494)
> 05/13/2019 15:23:16    ANR0408I Session 26185 started for server SM283
> (Linux/x86_64) (Tcp/Ip) for replication.  (SESSION: 26128, PROCESS: 494)
> 05/13/2019 15:23:16    ANR0408I Session 26186 started for server SM283
> (Linux/x86_64) (Tcp/Ip) for replication.  (SESSION: 26128, PROCESS: 494)
> 05/13/2019 15:23:17    ANR0408I Session 26187 started for server SM283
> (Linux/x86_64) (Tcp/Ip) for replication.  (SESSION: 26128, PROCESS: 494)
> 05/13/2019 15:23:17    ANR0408I Session 26188 started for server SM283
> (Linux/x86_64) (Tcp/Ip) for replication.  (SESSION: 26128, PROCESS: 494)
> 05/13/2019 15:23:17    ANR0408I Session 26189 started for server SM283
> (Linux/x86_64) (Tcp/Ip) for replication.  (SESSION: 26128, PROCESS: 494)
> 05/13/2019 15:23:17    ANR0408I Session 26190 started for server SM283
> (Linux/x86_64) (Tcp/Ip) for replication.  (SESSION: 26128, PROCESS: 494)
> 05/13/2019 15:23:17    ANR0408I Session 26191 started for server SM283
> (Linux/x86_64) (Tcp/Ip) for replication.  (SESSION: 26128, PROCESS: 494)
>
> 05/13/2019 15:24:57    ANR0483W Session 26187 for node SM283
> (Linux/x86_64) terminated - forced by administrator. (SESSION: 26128,
> PROCESS: 494)
>
> on the target server we observe at that time:
>
> 13.05.2019 15:25:51 ANR8213E Socket 34 aborted due to send error; error
> 104.
> 13.05.2019 15:25:51 ANR3178E A communication error occurred during
> session 65294 with replication server TSM.
> 13.05.2019 15:25:51 ANR0479W Session 65294 for server TSM (Windows)
> terminated - connection with server severed.
> 13.05.2019 15:25:51 ANR8213E Socket 34 aborted due to send error; error 32.
>
> => Any idea why this replication aborts?
>
> => why is there a "socket abortion error"?
>
>
> well, we already opened a SR case, send lots of logs and traces. as IBM
> suspects a network problem, now both serves use a cross link connection
> without nothing but NIC/GBICs, plugs and wires.
>
> thanks & best
>
> Bjørn
>
> --
>
> --------------------------------------------------------------------------------------------------
> Bjørn Nachtwey
>
> Arbeitsgruppe "IT-Infrastruktur“
> Tel.: +49 551 201-2181, E-Mail:bjoern.nacht...@gwdg.de
>
> --------------------------------------------------------------------------------------------------
> Gesellschaft für wissenschaftliche Datenverarbeitung mbH Göttingen (GWDG)
> Am Faßberg 11, 37077 Göttingen, URL:http://www.gwdg.de
> Tel.: +49 551 201-1510, Fax: +49 551 201-2150, E-Mail:g...@gwdg.de
> Service-Hotline: Tel.: +49 551 201-1523, E-Mail:supp...@gwdg.de
> Geschäftsführer: Prof. Dr. Ramin Yahyapour
> Aufsichtsratsvorsitzender: Prof. Dr. Christian Griesinger
> Sitz der Gesellschaft: Göttingen
> Registergericht: Göttingen, Handelsregister-Nr. B 598
>
> --------------------------------------------------------------------------------------------------
> Zertifiziert nach ISO 9001
>
> --------------------------------------------------------------------------------------------------
>

Reply via email to