Skip to content

Conversation

@avbokovoy
Copy link

Cleaned-up branch history for easier review. This branch contains merged bwd and fwd pass optimizations and targeting relatively up-to-date upstream. Current limitations:

BWD:

  1. Only 128 and 160 support for embedding_size
  2. no vbe
  3. no ssd
  4. no gwd

FWD:

  1. Only L % 4 == 0

avbokovoy and others added 30 commits October 4, 2024 11:18
This reverts commit cb8cfc9.

This is not due to missing files but broken #include paths
* Rename common header

* Delete redundant files related to fwd pass

* Add hip bwd kernel template generation

* Remove old instantiation code, clean-up leftovers (except debug prints)
Copy link

@xinyazhang xinyazhang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Need more clean up imo.

kEmbeddingDim,
kWeighDecayMode
)
}}

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I like this. No indents look weird at first glance but much more readable than nested inner loop.


{%- if weighted %}
vals[_j*kMaxVecsPerThread + i] = weights_slice;
//{%- if weighted %}

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consider removing them if they are not useful anymore.
If it's some upcoming feature, we can add them back later.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

{%- endif %}
{%- endmacro %}

{#-/*

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is one question. Should we keep the comment in generated source files?

(Just a concern, not a blocking factor)

@avbokovoy
Copy link
Author

Need more clean up imo.

Definitely. I did a bit, but still more to come

Copy link

@xinyazhang xinyazhang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@amathews-amd amathews-amd mentioned this pull request Oct 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants