Skip to content

RPC: How to offload specific tensors / layers? #15020

Answered by rgerganov
Mushoz asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, you can offload specific tensors to RPC with -ot and the RPC device name, e.g. -ot 'blk.1*=RPC[localhost:50052]'. I'll appreciate if you share the results with this approach, so we can document it.

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@Mushoz
Comment options

@rgerganov
Comment options

@Mushoz
Comment options

@Mushoz
Comment options

Answer selected by Mushoz
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants