thomas loubrieu created SDAP-225: ------------------------------------ Summary: max number of parrallel ingestion jobs in kubernetes Key: SDAP-225 URL: https://issues.apache.org/jira/browse/SDAP-225 Project: Apache Science Data Analytics Platform Issue Type: Bug Components: nexus Reporter: thomas loubrieu
While doing the ingestion on the podaac dev cluster (master node podaac-devk8s) and configuring the number of parallel ingestion jobs to 200, the cluster became unresponsive and we had to restart 2 of the 3 nodes. We need to understand why this happened whereas kubernetes would be supposed to keep the job in a pending status until enought resources are available. Apparently the cpu or memory were doing ok. Might be the network, or the number of NFS mount points. This need to be re-produced in DEV environment after the deployment is made and working in TEST environment. -- This message was sent by Atlassian Jira (v8.3.4#803005)