In lib/Target/LLVM/NVVM/Target.cpp, NVPTXSerializer compile PTX to binary with two different flows controlled by MLIR_ENABLE_NVPTXCOMPILER.
If building mlir with -DMLIR_ENABLE_NVPTXCOMPILER=ON, the flow does not check if the target is gpu::CompilationTarget::Fatbin, and compile PTX to cubin directly, which is not consistent with another flow.
Is this expected?