Skip to content

[Bad Case]: 微调MiniCPM 4.1-8B时,其loss显著高于其他模型 #327

@moonmengmeng

Description

@moonmengmeng

Description / 描述

如图第一步loss为500多,在其他模型刚开始的时候都是3-4左右

Image

Case Explaination / 案例解释

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions