Import ERROR in FP8 quantization

I am trying to quantize a model to FP8 following the script at [https://github.com/OpenPPL/ppq/blob/master/ppq/samples/FP8/fp8_sample.py]( https://github.com/OpenPPL/ppq/blob/master/ppq/samples/FP8/fp8_sample.py) . But  receiving the following error 
``` bash
ImportError: /local/mnt/workspace/tvura/PPQ/penv/lib/python3.10/site-packages/ppq-0.6.6-py3.10.egg/ppq/csrc/build/PPQ_Cuda_Impls.so: cannot open shared object file: No such file or directory
``` 

In this snippet,

```python
pipeline = PFL.Pipeline([
    ParameterQuantizePass(),
    RuntimeCalibrationPass(),
    LearnedStepSizePass(
        steps=1000, is_scale_trainable=False, 
        lr=1e-4, block_size=4, collecting_device='cuda'),
    ParameterBakingPass()
])

with ENABLE_CUDA_KERNEL():
    # 调用管线完成量化
    pipeline.optimize(
        graph=graph, dataloader=dataset, verbose=True, 
        calib_steps=32, collate_fn=collate_fn, executor=executor)

    # 执行量化误差分析
    graphwise_error_analyse(
        graph=graph, running_device='cuda', 
        dataloader=dataset, collate_fn=collate_fn)
``` 
At pipeline.optimize()

I'm using the latest version of ppq 0.6.6 and am able to perform INT8 quantization and export the model


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Import ERROR in FP8 quantization #595

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Import ERROR in FP8 quantization #595

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions