[Bug] Dynamic Filtering Data Precision Error

```
def check_reward_nonzero_std(args, samples: list[Sample], **kwargs):
    rewards = [sample.get_reward_value(args) for sample in samples]
    keep = torch.tensor(rewards, dtype=torch.float).std() > 0.0
    return DynamicFilterOutput(
        keep=keep,
        reason=None if keep else f"zero_std_{round(rewards[0], 1)}",
    )
```

There will be a floating-point precision issue, when the reward is Non-0/1 cases.

For example,
```
torch.tensor([0.1]*16, dtype=torch.float).std() > 0.0
>tensor(True)

torch.tensor([0.25]*16, dtype=torch.float).std() > 0.0
>tensor(False)

torch.tensor([0.1]*16, dtype=torch.float64).std() > 0.0
>tensor(False)

torch.tensor([0.1]*1024, dtype=torch.float64).std() > 0.0
>tensor(True)
```

Suggest using higher precision and a small epsilon for comparision

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] Dynamic Filtering Data Precision Error #570

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug] Dynamic Filtering Data Precision Error #570

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions