Skip to content

Conversation

@akote123
Copy link

@akote123 akote123 commented Jan 17, 2025

SVE intrinsic code is added to improve the performance of SDDMMCOO Op when bacst is disabled and reduce_size=1

32C(ms) No. calls default with change
[[240398, 128], [240398, 128]] 10 94.093 62.38
[24039, 128], [24039, 128] 10 14.069 12.015
[1000, 200], [1000, 128] 10 3.96 3.54
[1000, 2096], [1000, 4096] 10 14.1 10.9

Results are benchmarked in c7g.8xlarge machine .

@Rhett-Ying @frozenbugs @tpatejko

@abhijain1204fujitsu
Copy link

@Rhett-Ying , @frozenbugs , @tpatejko , kindly support to review this PR

SVE intrinsic code is added to improve the performance of SDDMMCOO Op
when bacst is disabled and reduce_size=1
@akote123
Copy link
Author

CC : @frozenbugs @Rhett-Ying

@choudhary-devang
Copy link

Hi @Rhett-Ying, @drivanov, can you please support to review this pr.
thankyou

@Rhett-Ying Rhett-Ying requested a review from classicsong May 29, 2025 08:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants