[Bug]: hadamard dtype and inplace tranform

### Problem Description

1 hadamard is performed at float64 in prior arts while ours are on bflaot16
2 weight transform could be conducted inplace, no need to run the transform in each iter of AR tuning
3 shared layers like moe, qkv
4 real random
5 fuse  to ar block wise tuning，otherwise ram is high
### Reproduction Steps

~

### Environment Information

~

### Error Logs

```shell
~
```

### Additional Context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: hadamard dtype and inplace tranform #1631

Problem Description

Reproduction Steps

Environment Information

Error Logs

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Bug]: hadamard dtype and inplace tranform #1631

Description

Problem Description

Reproduction Steps

Environment Information

Error Logs

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions