Skip to content

大batch_size和多目标检测的双层for循环导致ATSS算法执行时间长,GPU利用率低,如何提升性能与效率? #102

@chenrui17

Description

@chenrui17

如题,batch _size = 15, 检测目标类别=6,则atss算法会连续调用90次,通过profile,该部分GPU利用率低,耗时占比训练 30%左右,如何考虑优化,从而提升性能 与GPU利用率呢?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions