Skip to content

Best practices on size of random training subset for policytree #141

@njawadekar

Description

@njawadekar

I notice much variation in the variables that are selected by the policytree, depending on the size of the random training subset that is specified, in this step:

Fit a depth 2 tree on a random training subset.

n <- 250
train <- sample(1:n, 200)

opt.tree <- policy_tree(X[train, ], Gamma.matrix[train, ], depth = 2)
opt.tree

Are there recommended best practices regarding the size (or % of the original sample) for the random training subset when setting up the policytree?

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions