Fix: Use attn_implementation='eager' for MPS compatibility #78

murshidm · 2025-02-15T18:00:51Z

Description:
When running the colqwen2 model on an MPS device (Apple Silicon), the default attention implementation causes compatibility issues.
The fix ensures that attn_implementation="eager" is used when the device is MPS, allowing stable execution.

Steps to Reproduce:
1. Run the model on a MacBook Pro with an MPS device.
# Initialize RAGMultiModalModel model = RAGMultiModalModel.from_pretrained( "vidore/colqwen2-v0.1", device="mps",)
2. Observe that the attention implementation may cause an error

Error
IndexError: Dimension out of range (expected to be in range of [-3, 2], but got 3)

Proposed Fix:
Update the from_pretrained method to set attn_implementation="eager" when running on MPS.
attn_implementation = "eager" if device == "mps" or ( isinstance(device, torch.device) and device.type == "mps" ) else None

File: byaldi/colpali.py

Location: Inside the from_pretrained method.

Rationale:
• MPS devices currently face issues with the default attention implementation.
• "eager" mode provides a stable alternative.

murshidm added 2 commits February 15, 2025 23:24

Fix: Use attn_implementation='eager' for MPS compatibility

5f31192

Fix: Correct the comma missing issue in the pull request

93804d6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Use attn_implementation='eager' for MPS compatibility #78

Fix: Use attn_implementation='eager' for MPS compatibility #78

Uh oh!

murshidm commented Feb 15, 2025

Uh oh!

Uh oh!

Fix: Use attn_implementation='eager' for MPS compatibility #78

Are you sure you want to change the base?

Fix: Use attn_implementation='eager' for MPS compatibility #78

Uh oh!

Conversation

murshidm commented Feb 15, 2025

Uh oh!

Uh oh!