Skip to content

Conversation

@e-nugmanova
Copy link
Contributor

@e-nugmanova e-nugmanova commented May 28, 2024

Details:

  • relax restriction on L from query and attention_mask being equal

Tickets:

@e-nugmanova e-nugmanova requested a review from a team as a code owner May 28, 2024 12:54
@github-actions github-actions bot added the category: Core OpenVINO Core (aka ngraph) label May 28, 2024
@e-nugmanova e-nugmanova changed the title [SDPA] relax restriction on L from query and attention_master being equal [SDPA] relax restriction on L from query and attention_mask being equal May 28, 2024
Copy link
Contributor

@mitruska mitruska left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, but would be great to cover this change with corresponding tests.
The SDPA spec clarification can be handled as a follow up.

@p-durandin p-durandin added this pull request to the merge queue Jun 3, 2024
Merged via the queue into openvinotoolkit:master with commit fc0b82c Jun 3, 2024
allnes pushed a commit to allnes/openvino that referenced this pull request Jun 27, 2024
…al (openvinotoolkit#24745)

### Details:
 - *relax restriction on L from query and attention_mask being equal*

### Tickets:
 - *CVS-129000*
github-merge-queue bot pushed a commit that referenced this pull request Jul 1, 2024
### Details:
- Update attention mask expected shape, allow for broadcast (aligned
with shape_infer, decomposition, and CPU plugin)
 - Fix existing batch broadcast example
 - Update `scale` input shape info (allow for single element 1D tensor)
 - related PRs: 
  #24745
 #25213
 

### Tickets:
 - 143009
github-merge-queue bot pushed a commit that referenced this pull request Jul 1, 2024
… input broadcast (#25213)

### Details:
- More ScaledDotProductAttention tests to cover attention mask input
broadcast
 - related PRs: 
 #24745
 #25213
 

### Tickets:
 - 143009
AsyaPronina pushed a commit to AsyaPronina/openvino that referenced this pull request Jul 1, 2024
…it#25208)

### Details:
- Update attention mask expected shape, allow for broadcast (aligned
with shape_infer, decomposition, and CPU plugin)
 - Fix existing batch broadcast example
 - Update `scale` input shape info (allow for single element 1D tensor)
 - related PRs: 
  openvinotoolkit#24745
 openvinotoolkit#25213
 

### Tickets:
 - 143009
AsyaPronina pushed a commit to AsyaPronina/openvino that referenced this pull request Jul 1, 2024
… input broadcast (openvinotoolkit#25213)

### Details:
- More ScaledDotProductAttention tests to cover attention mask input
broadcast
 - related PRs: 
 openvinotoolkit#24745
 openvinotoolkit#25213
 

### Tickets:
 - 143009
AsyaPronina pushed a commit to AsyaPronina/openvino that referenced this pull request Jul 1, 2024
…it#25208)

### Details:
- Update attention mask expected shape, allow for broadcast (aligned
with shape_infer, decomposition, and CPU plugin)
 - Fix existing batch broadcast example
 - Update `scale` input shape info (allow for single element 1D tensor)
 - related PRs: 
  openvinotoolkit#24745
 openvinotoolkit#25213
 

### Tickets:
 - 143009
AsyaPronina pushed a commit to AsyaPronina/openvino that referenced this pull request Jul 1, 2024
… input broadcast (openvinotoolkit#25213)

### Details:
- More ScaledDotProductAttention tests to cover attention mask input
broadcast
 - related PRs: 
 openvinotoolkit#24745
 openvinotoolkit#25213
 

### Tickets:
 - 143009
@e-nugmanova e-nugmanova deleted the sdpa_shape branch May 11, 2025 12:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: Core OpenVINO Core (aka ngraph)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants