Skip to content

Different convolutions within same TCN layer #14

@FabianB98

Description

@FabianB98

Hi,

I'm currently trying to implement the network architecture described in the paper "User-Driven Fine-Tuning for Beat Tracking" by Pinto et al., 2021. Within this network architecture, the authors propose the usage of a TCN where each TCN layer uses two separate sets of dilated convolutions, where one of the dilated convolutions has a dilation of twice that of the first dilated convolution. In figure 2 of that paper, they depict their TCN layout as follows:

Bildschirmfoto vom 2024-05-22 14-24-49

As you can see, there are two dilated convolutions per TCN layer: "Dilated Convolution 1" with a dilation of dr1, and "Dilated Convolution 2" with a dilation of dr2 = 2 * dr1. The results of these dilations are then concatenated before the activation function, dropout and a 1x1 convolution (as a way of keeping the dimensionality equal throughout the TCN layers) are applied.

From what I could find so far, it appears as if this package only supports a single dilation rate within each TCN layer, which leads me to believe that it is not possible to implement this architecture using this Python package. Is my understanding of this correct? Or am I simply missing something (potentially obvious) and it is possible to implement the proposed network architecture with this package?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions