Skip to content

bug 计算sft损失的时候 #48

@shyoulala

Description

@shyoulala

计算sft损失的时候label和logits貌似没有shift,是我理解有问题吗?
应该是new_logits = logits[:,:-1,:]

image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions