Skip to content

Commit

Permalink
[LinalgExt] Implement PartialReductionOpInterface for OnlineAttention…
Browse files Browse the repository at this point in the history
…Op (#19684)
  • Loading branch information
Groverkss authored Jan 14, 2025
1 parent 3c963dd commit 6fd0fd0
Show file tree
Hide file tree
Showing 3 changed files with 462 additions and 15 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -649,7 +649,12 @@ def IREELinalgExt_OnlineAttentionOp : IREELinalgExt_PureOp<"online_attention",
["getIterationDomain",
"getLoopIteratorTypes",
"getResultTilePosition",
"getTiledImplementation"]>]> {
"getTiledImplementation"]>,
DeclareOpInterfaceMethods<PartialReductionOpInterface,
["generateInitialTensorForPartialReduction",
"tileToPartialReduction",
"mergeReductions",
"getPartialResultTilePosition"]>]> {
let summary = "Online Attention operator";
let description = [{
Traditional scaled dot product attention computes:
Expand Down
Loading

0 comments on commit 6fd0fd0

Please sign in to comment.