First, thanks to the original author. Since my deployment environment is Windows, I modified the original repo to run on Windows 10/11. Inference environment: CUDA 11.8 + CUDNN 8.9 + TensorRT 8.6.1.6 ...