[ 
https://issues.apache.org/jira/browse/YARN-5983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhankun Tang updated YARN-5983:
-------------------------------
    Description: 
As various big data workload running on YARN, CPU will no longer scale 
eventually and heterogeneous systems will become more important. ML/DL is a 
rising star in recent years, applications focused on these areas have to 
utilize GPU or FPGA to boost performance. Also, hardware vendors such as Intel 
also invest in such hardware. It is most likely that FPGA will become popular 
in data centers like CPU in the near future.

So YARN as a resource managing and scheduling system, would be great to evolve 
to support this. This JIRA proposes FPGA to be a first-class citizen. The 
changes roughly includes:
1. FPGA resource detection and heartbeat
2. Scheduler changes (YARN-3926 invlolved)
3. FPGA related preparation and isolation before launch container
We know that YARN-3926 is trying to extend current resource model. But still we 
can leave some FPGA related discussion here

  was:
As various big data workload running on YARN, CPU will no longer scale 
eventually and heterogeneous systems will become more important. ML/DL is a 
rising star in recent years, applications focused on these areas have to 
utilize GPU or FPGA to boost performance. Also, hardware vendors such as Intel 
also invest in such hardware. It is most likely that FPGA will become popular 
in data centers like CPU in the near future.

So YARN as a resource managing and scheduling system, would be great to evolve 
to support this. This JIRA proposes FPGA to be a first-class citizen. The 
changes roughly includes:
1. FPGA resource detection and heartbeat
2. Scheduler changes
3. FPGA related preparation and isolation before launch container
We know that YARN-3926 is trying to extend current resource model. But still we 
can leave some FPGA related discussion here


> [Umbrella] Support for FPGA as a Resource in YARN
> -------------------------------------------------
>
>                 Key: YARN-5983
>                 URL: https://issues.apache.org/jira/browse/YARN-5983
>             Project: Hadoop YARN
>          Issue Type: New Feature
>          Components: yarn
>            Reporter: Zhankun Tang
>            Assignee: Zhankun Tang
>         Attachments: YARN-5983-Support-FPGA-resource-on-NM-side_v1.pdf, 
> YARN-5983-implementation-notes.pdf, YARN-5983_end-to-end_test_report.pdf
>
>
> As various big data workload running on YARN, CPU will no longer scale 
> eventually and heterogeneous systems will become more important. ML/DL is a 
> rising star in recent years, applications focused on these areas have to 
> utilize GPU or FPGA to boost performance. Also, hardware vendors such as 
> Intel also invest in such hardware. It is most likely that FPGA will become 
> popular in data centers like CPU in the near future.
> So YARN as a resource managing and scheduling system, would be great to 
> evolve to support this. This JIRA proposes FPGA to be a first-class citizen. 
> The changes roughly includes:
> 1. FPGA resource detection and heartbeat
> 2. Scheduler changes (YARN-3926 invlolved)
> 3. FPGA related preparation and isolation before launch container
> We know that YARN-3926 is trying to extend current resource model. But still 
> we can leave some FPGA related discussion here



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to