thomas loubrieu created SDAP-225:
------------------------------------
Summary: max number of parrallel ingestion jobs in kubernetes
Key: SDAP-225
URL: https://issues.apache.org/jira/browse/SDAP-225
Project: Apache Science Data Analytics Platform
Issue Type: Bug
Components: nexus
Reporter: thomas loubrieu
While doing the ingestion on the podaac dev cluster (master node podaac-devk8s)
and configuring the number of parallel ingestion jobs to 200, the cluster
became unresponsive and we had to restart 2 of the 3 nodes.
We need to understand why this happened whereas kubernetes would be supposed to
keep the job in a pending status until enought resources are available.
Apparently the cpu or memory were doing ok. Might be the network, or the number
of NFS mount points.
This need to be re-produced in DEV environment after the deployment is made and
working in TEST environment.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)