rubbberrabbit opened a new issue, #21019:
URL: https://github.com/apache/incubator-mxnet/issues/21019

   ## Description
   Hello, we try to use keras as the front-end to run Mxnet, but find several 
Mxnet crashes, we are not should if there is a real bug trigger by those model, 
so we collected the Execution stack information when Mxnet crashes, most of 
them are related to libmxnet.so which is hard to compile in debug mode. Here is 
the list of the Mxnet version and Execution stack information of our models.
   
   Further, the triggering-crash models and replay script is provided in 
https://drive.google.com/drive/folders/1he3I-1PKGI01t09E2FAin0_mUnmu2-oz?usp=sharing
   
   ### Error Message
   Some of stack informations are shown below
   
![image-20220504220358522](https://user-images.githubusercontent.com/38725110/167261868-b68150ca-2519-4d67-8e59-6e42b09e79e1.png)
   
![image-20220504220358524](https://user-images.githubusercontent.com/38725110/167262002-d7fb99a4-c938-4880-b821-5902b8ced9df.png)
   
![image-20220504222003296](https://user-images.githubusercontent.com/38725110/167262007-75d30b05-9199-4964-a1f7-88977ec0ab5f.png)
   
   
   
   
   ## To Reproduce
   
   
   ### Steps to reproduce
   (Paste the commands you ran that produced the error.)
   
   1. Download the scripts and models from the cloud links
   2. Chang the path in /scripts/bugs_replay.conf into the path of model folder
   3. Run mxnet_test.py in the corresponding environment
    
   ## What have you tried to solve it?
   
   To analysis the Execution stack information in libmxnet.so, we try to 
compile Mxnet with choice Debug=1 in config.mk but face a error report of 
"relocation trcuncated to fit" in several different environments. we assume 
that is because too much redundant code is added when compiling with DEBUG 
mode. 
   
   ## Environment
   
   <details>
   <summary>Environment Information</summary>
    Mxnet 1.5.1 Keras-Mxnet 2.2.4.2 CUDA 10.1 python 3.6.12
   
    Mxnet 1.4.1 Keras-Mxnet 2.2.4.2 CUDA 10.0 python 3.6.12 
    Mxnet 1.3.1 Keras-Mxnet 2.2.4.2 CUDA 9.0 python 3.6.12 
    
   
   
   </details>
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to