Skip to content

Navigation Menu

Appearance settings

View all features
- BY COMPANY SIZE
  Enterprises
  Small and medium teams
  Startups
  Nonprofits
- BY USE CASE
  App Modernization
  DevSecOps
  DevOps
  CI/CD
  View all use cases
- BY INDUSTRY
  Healthcare
  Financial services
  Manufacturing
  Government
  View all industries
View all solutions
- EXPLORE BY TOPIC
  AI
  Software Development
  DevOps
  Security
  View all topics
- EXPLORE BY TYPE
  Customer stories
  Events & webinars
  Ebooks & reports
  Business insights
  GitHub Skills
- SUPPORT & SERVICES
  Documentation
  Customer support
  Community forum
  Trust center
  Partners
View all resources
- COMMUNITY
  GitHub SponsorsFund open source developers
- PROGRAMS
  Security Lab
  Maintainer Community
  Accelerator
  GitHub Stars
  Archive Program
- REPOSITORIES
  Topics
  Trending
  Collections
- ENTERPRISE SOLUTIONS
  Enterprise platformAI-powered developer platform
- AVAILABLE ADD-ONS
  GitHub Advanced SecurityEnterprise-grade security features
  Copilot for BusinessEnterprise-grade AI features
  Premium SupportEnterprise-grade 24/7 support
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Xilinx / finn Public

Notifications You must be signed in to change notification settings
Fork 291
Star 955

Code
Issues 69
Pull requests 44
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Streamlining of Scaled Dot-Product Attention#901

Merged

auphelia merged 22 commits intoXilinx:devXilinx/finn:devfrom

iksnagreb:feature/attention-streamlineiksnagreb/finn:feature/attention-streamlineCopy head branch name to clipboard

May 27, 2025

Conversation Commits22 (22)Checks Files changed

Merged

Streamlining of Scaled Dot-Product Attention#901
auphelia merged 22 commits intoXilinx:devfrom
iksnagreb:feature/attention-streamline

Conversation

Copy link

Contributor

iksnagreb commented Sep 30, 2023 •

edited

Loading

Trying ideas and bug fixes for streamlining the scaled dot-product attention operator. Related to issue/discussion #878

Refactor MoveScalarMulPastMatMul for two-input join-node matmuls
Validate that these changes do not break something or change the behavior in subtle ways
Fix Absorb1BitMulIntoMatMul and Absorb1BitMulIntoConv test for the presence of weight initializers
Debug InferShapes fails after FoldTransposeIntoQuantInit
Circumvent MoveScalarAddPastMatMul by preferring AbsorbSignBiasIntoMultiThreshold
Fix the FoldQuantWeights transformation currently propagating shapes backwards and maybe generating the inverse of the scale factor
Fix the AbsorbAddIntoMultiThreshold transformation assuming input and initializer order which might not always hold true
~~Fix (and include?) the MoveLinearPastEltwiseAdd transformation which does not correctly propagate the shapes~~ (Seems to be fixed by fixing one of the other issues, was probably caused by faulty rewiring of the graph in FoldQuantWeights, transformation seems not to be required anymore, maybe reopen)
~~Suggest Brevitas to change all the quantizers to signed quantizers to be finn compatible~~
~~Suggest Brevitas to change order of quantizer and transpose of the key matrix to make detecting the pattern easier and treat all three inputs the same~~
Streamlining of scale multiplication through multi-head slicing operations
Debug streamlining support for packed input projections
Fix RemoveIdentityOps handling fork-node producer

Sorry, something went wrong.

Uh oh!

There was an error while loading. Please reload this page.

All reactions

Loading

Uh oh!

There was an error while loading. Please reload this page.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

auphelia auphelia approved these changes

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

Uh oh!

There was an error while loading. Please reload this page.

2 participants

Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Footer

© 2026 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Community
Docs
Contact

You can’t perform that action at this time.