Skip to content

why not use lr_mult, decay_mult like {1, 1, 2, 0}? #60

@ujsyehao

Description

@ujsyehao

In alexnet network, it uses lr_mult/decay_mult param {1, 1, 2, 0},
In squeezenet, it doesn't set param, so caffe uses its default value, lr_mult and decay_mult is default set to 1. so its param {1, 1, 1, 1}
As we all know, we should not add weight decay to bias. So why you use default lr_mult and decay_mult?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions