模型预测问题

按照步骤使用p-tuning v2微调chatglm2-6b，微调之后切换v0.1，并在predict_pt中使用微调模型路径，运行后报如下错误：
    raise RuntimeError(f"Error(s) in loading state_dict for {model.__class__.__name__}:\n\t{error_msg}")
RuntimeError: Error(s) in loading state_dict for ChatGLMForConditionalGeneration:
	size mismatch for transformer.prefix_encoder.embedding.weight: copying a param with shape torch.Size([16, 14336]) from checkpoint, the shape in current model is torch.Size([16, 4096]).
	size mismatch for transformer.prefix_encoder.trans.0.weight: copying a param with shape torch.Size([4096, 14336]) from checkpoint, the shape in current model is torch.Size([4096, 4096]).
	size mismatch for transformer.prefix_encoder.trans.2.weight: copying a param with shape torch.Size([14336, 4096]) from checkpoint, the shape in current model is torch.Size([229376, 4096]).
	size mismatch for transformer.prefix_encoder.trans.2.bias: copying a param with shape torch.Size([14336]) from checkpoint, the shape in current model is torch.Size([229376]).
	You may consider adding `ignore_mismatched_sizes=True` in the model `from_pretrained` method.

Process finished with exit code 1

希望作者能更新一下预测步骤。

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

模型预测问题 #96

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

模型预测问题 #96

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions