zhangzhaoqi created SINGA-506:
---------------------------------

             Summary: add autograd operators for NLP models
                 Key: SINGA-506
                 URL: https://issues.apache.org/jira/browse/SINGA-506
             Project: Singa
          Issue Type: New Feature
            Reporter: zhangzhaoqi


*We are going to support these three NLP models, called, Bidirectional 
Attention Flow, BERT-Squad and GPT-2.*

*Totally, there are still 21 operators that we need to add as following,*
ArgMax
Cast
CategoryMapper
Ceil
Compress
ConstantOfShape
Dropout
Gather
Hardmax
Identity
NonZero
ReduceMax
ReduceMean
ReduceSum
Scan
Shape
Slice
Split
Squeeze
Tile
Transpose

 

*For details, these 21 operators belong to these three models separately:*

*Bidirectional Attention Flow:*
ArgMax
Cast
CategoryMapper
Ceil
Compress
ConstantOfShape
Dropout
Gather
Hardmax
ReduceMax
ReduceSum
Scan
Shape
Slice
Squeeze
Transpose


*BERT-Squad:*
Slice
Squeeze
Shape
Identity
Gather
ReduceMean
Cast
Tile
Transpose
Split

*GPT-2:*
ConstantOfShape
Slice
Shape
Gather
ReduceMean
NonZero
Cast
Transpose
Split

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to