[jira] [Created] (YUNIKORN-722) Refactor YuniKorn core's queue-level resource metrics

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-722:
-

 Summary: Refactor YuniKorn core's queue-level resource metrics
 Key: YUNIKORN-722
 URL: https://issues.apache.org/jira/browse/YUNIKORN-722
 Project: Apache YuniKorn
  Issue Type: Sub-task
  Components: core - common
Reporter: Chenya Zhang
Assignee: Chenya Zhang


To make YuniKorn core's queue resource related metrics and functions more 
meaningful:
 * Refactor and add related metrics operation functions
 * Use meaningful metrics naming and help messages 
 * Update in-line comments and documentations



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-721) Refactor YuniKorn core's queue-level application metrics

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-721:
-

 Summary: Refactor YuniKorn core's queue-level application metrics
 Key: YUNIKORN-721
 URL: https://issues.apache.org/jira/browse/YUNIKORN-721
 Project: Apache YuniKorn
  Issue Type: Sub-task
  Components: core - common
Reporter: Chenya Zhang
Assignee: Chenya Zhang


To make YuniKorn core's queue "app_metrics" and related functions more 
meaningful:
 * Differentiate queue level metrics with scheduler metrics, e.g. using 
"IncQueueTotalApplicationsAccepted"
 * Refactor related functions
 * Update in-line comments and documentations
 * Use meaningful metrics naming and help messages 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-720) Add and improve queue metrics throughout the scheduling cycle

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-720:
-

 Summary: Add and improve queue metrics throughout the scheduling 
cycle
 Key: YUNIKORN-720
 URL: https://issues.apache.org/jira/browse/YUNIKORN-720
 Project: Apache YuniKorn
  Issue Type: Improvement
  Components: core - common
Reporter: Chenya Zhang
Assignee: Chenya Zhang


Add and improve queue metrics throughout the scheduling cycle



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-719) Refactor YuniKorn core's scheduler metrics for sorting latency

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-719:
-

 Summary: Refactor YuniKorn core's scheduler metrics for sorting 
latency
 Key: YUNIKORN-719
 URL: https://issues.apache.org/jira/browse/YUNIKORN-719
 Project: Apache YuniKorn
  Issue Type: Sub-task
  Components: core - scheduler
Reporter: Chenya Zhang
Assignee: Chenya Zhang


Defining "nodeSortingLatency", "appSortingLatency", and "queueSortingLatency" 
to initialize scheduler metrics is redundant in code.
 * It can be combined into "sortingLatency"
 * It can use a "prometheus.CounterVec" with different Prometheus labels



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-718) Redefine YuniKorn core's scheduler metrics names and help messages for usability and clarity

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-718:
-

 Summary: Redefine YuniKorn core's scheduler metrics names and help 
messages for usability and clarity
 Key: YUNIKORN-718
 URL: https://issues.apache.org/jira/browse/YUNIKORN-718
 Project: Apache YuniKorn
  Issue Type: Sub-task
  Components: core - scheduler
Reporter: Chenya Zhang
Assignee: Chenya Zhang


Some issues observed:
 * Metrics name is not meaningful
 * Help messages are lack of some details
 * Prometheus labels are not consistent

Need to improve for usability and clarity.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-717) Refactor YuniKorn core's scheduler metrics for node status

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-717:
-

 Summary: Refactor YuniKorn core's scheduler metrics for node status
 Key: YUNIKORN-717
 URL: https://issues.apache.org/jira/browse/YUNIKORN-717
 Project: Apache YuniKorn
  Issue Type: Sub-task
  Components: core - scheduler
Reporter: Chenya Zhang
Assignee: Chenya Zhang


Defining "totalNodeActive" and "totalNodeFailed" to initialize scheduler 
metrics is redundant in code.
 * It can be combined into "nodeStatus"
 * It can use a "prometheus.CounterVec" with different Prometheus labels



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-716) Refactor YuniKorn core's scheduler metrics for application running and completed

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-716:
-

 Summary: Refactor YuniKorn core's scheduler metrics for 
application running and completed
 Key: YUNIKORN-716
 URL: https://issues.apache.org/jira/browse/YUNIKORN-716
 Project: Apache YuniKorn
  Issue Type: Sub-task
  Components: core - scheduler
Reporter: Chenya Zhang
Assignee: Chenya Zhang


Defining "totalApplicationRunning" and "totalApplicationCompleted" to 
initialize scheduler metrics is redundant in code.
 * It can be combined into one "application" metrics
 * It can use a "prometheus.CounterVec" with different Prometheus labels



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-715) Refactor YuniKorn core's scheduler metrics for container allocation

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-715:
-

 Summary: Refactor YuniKorn core's scheduler metrics for container 
allocation
 Key: YUNIKORN-715
 URL: https://issues.apache.org/jira/browse/YUNIKORN-715
 Project: Apache YuniKorn
  Issue Type: Sub-task
  Components: core - scheduler
Reporter: Chenya Zhang
Assignee: Chenya Zhang


Defining "allocatedContainers", "rejectedContainers", "schedulingErrors" and 
"releasedContainers" to initialize scheduler metrics is redundant in code.
 * It can be combined into "containerAllocation"
 * It can use a "prometheus.CounterVec" with different Prometheus labels



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-714) Refactor YuniKorn core's scheduler metrics for application submission

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-714:
-

 Summary: Refactor YuniKorn core's scheduler metrics for 
application submission
 Key: YUNIKORN-714
 URL: https://issues.apache.org/jira/browse/YUNIKORN-714
 Project: Apache YuniKorn
  Issue Type: Sub-task
  Components: core - scheduler
Reporter: Chenya Zhang
Assignee: Chenya Zhang


Defining "totalApplicationsAccepted" and "totalApplicationsRejected" to 
initialize scheduler metrics is redundant in code.

It can be combined into "applicationSubmissions" using a 
"prometheus.CounterVec" and different Prometheus labels.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-713) Align YuniKorn core's scheduler metrics with queue metrics for total applications accepted

2021-06-21 Thread Chenya Zhang (Jira)
Chenya Zhang created YUNIKORN-713:
-

 Summary: Align YuniKorn core's scheduler metrics with queue 
metrics for total applications accepted
 Key: YUNIKORN-713
 URL: https://issues.apache.org/jira/browse/YUNIKORN-713
 Project: Apache YuniKorn
  Issue Type: Sub-task
  Components: core - scheduler
Reporter: Chenya Zhang
Assignee: Chenya Zhang


The metrics naming and operations are sometimes confusing in the code due to 
not aligning with each other on similar concepts.
 * Replace "totalApplicationsAdded" with "totalApplicationsAccepted"
 * Update related metrics operation functions



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-711) Add license file to the release repo

2021-06-21 Thread Chaoran Yu (Jira)
Chaoran Yu created YUNIKORN-711:
---

 Summary: Add license file to the release repo
 Key: YUNIKORN-711
 URL: https://issues.apache.org/jira/browse/YUNIKORN-711
 Project: Apache YuniKorn
  Issue Type: Improvement
  Components: release
Reporter: Chaoran Yu


Add Apache 2.0 license file to the release repo. The file should reside at the 
project root directory.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Created] (YUNIKORN-712) Add license file to the site repo

2021-06-21 Thread Chaoran Yu (Jira)
Chaoran Yu created YUNIKORN-712:
---

 Summary: Add license file to the site repo
 Key: YUNIKORN-712
 URL: https://issues.apache.org/jira/browse/YUNIKORN-712
 Project: Apache YuniKorn
  Issue Type: Improvement
  Components: website
Reporter: Chaoran Yu


Add Apache 2.0 license file to the site repo. The file should reside at the 
project root directory.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org



[jira] [Reopened] (YUNIKORN-3) Add scheduling metrics throughout the scheduling cycle

2021-06-21 Thread Chenya Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/YUNIKORN-3?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chenya Zhang reopened YUNIKORN-3:
-

Need more improvement on the scheduling metrics part. Creating subtasks.

> Add scheduling metrics throughout the scheduling cycle
> --
>
> Key: YUNIKORN-3
> URL: https://issues.apache.org/jira/browse/YUNIKORN-3
> Project: Apache YuniKorn
>  Issue Type: Improvement
>  Components: core - common
>Reporter: Wilfred Spiegelenburg
>Assignee: Chenya Zhang
>Priority: Critical
> Fix For: 0.11
>
>
> The current metrics collection is limited to a small number of collections 
> points.
> We need to add metric collections through out the process.
> See PR89 for details.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: dev-h...@yunikorn.apache.org