Skip to content

Conversation

@choudhary-devang
Copy link

@choudhary-devang choudhary-devang commented Jan 17, 2025

Description

Optimized the Gspmm kernel for arm platform using libxsmm, achieved good performance gain, previously it was using naive implementation.

#Results achieved

Screenshot 2024-12-03 105627

Achieved almost 15x performance gain.

@choudhary-devang
Copy link
Author

Hi @zheng-da, @zhenliangqiu, @mli, @vmiheer can you please review this pr, thankyou

@choudhary-devang
Copy link
Author

Hi @Rhett-Ying, @jermainewang, @mufeili, @BarclayII can you please review this pr, thankyou

@abhijain1204fujitsu
Copy link

Hi @Rhett-Ying, @jermainewang, @mufeili , @BarclayII
Kindly support us to review & share feedback on this PR

@abhijain1204fujitsu
Copy link

@Rhett-Ying , kindly support to review the PR.

@choudhary-devang
Copy link
Author

Hi @Rhett-Ying, @drivanov, can you please support to review this pr.
thankyou

@Rhett-Ying Rhett-Ying requested a review from classicsong May 29, 2025 08:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants