使用命令: OMP_NUM_THREADS=1 torchrun --nproc_per_node
Export.py --ckpt_dir <llama_dir>
--tokenizer_path <llama_tokenizer_dir>/tokenizer.model
--export_path <path_to_store_exported_llama7b>
--fused_qkv 1 --fused_kvcache 1 --auto_causal 1
--quantized_cache 1 --dynamic_batching 1
export 模型的时候,报一些warning信息,这是什么原因?会有什么影响?需要解决么?谢谢
[W shape_type_inference.cpp:1920] Warning: The shape inference of pmx::SwiGLU type is missing, so it may result in wrong shape inference for the exported graph. Please consider adding it in symbolic function. (function UpdateReliable)
[W shape_type_inference.cpp:1920] Warning: The shape inference of pmx::RowParallelLinear type is missing, so it may result in wrong shape inference for the exported graph. Please consider adding it in symbolic function. (function UpdateReliable)