基于预热感知噪声复用的高效自适应扩散模型推理方法

杨迎春; 陈静波; 彭雪梅; 纪东; 刘松冉

doi:10.7544/issn1000-1239.202550290

基于预热感知噪声复用的高效自适应扩散模型推理方法

Wander: Warmup-Aware Noise Reuse for Efficient Adaptive Diffusion Inference

摘要

摘要: 扩散模型在图像生成领域取得了显著成就，但在图像生成时仍然面临计算成本高、推理延迟长和生成质量差的挑战．本研究在前人研究的基础上，提出了一种基于预热感知噪声复用的高效自适应扩散模型推理方法，用于根据输入条件优化扩散模型的性能，解决了推理速度慢和生成质量差的问题．具体而言，我们的预热感知方法能够根据输入文本提示的语义复杂度动态调整扩散过程初始阶段的预热时间步的数量，以确保高效的初始化．随后，我们的噪声复用方法动态决定在迭代细化阶段噪声复用的时间步数量，减少不必要的计算开销．如图 1所示，我们在4个NVIDIA V100 GPUs上实现了2.66倍的加速，与DistriFusion相比，我们的推理速度提高了12.8%，生成质量提升了5.03倍．这些进步表明，我们提出的方法在平衡推理效率与生成质量方面具有显著的效果，为实时图像生成应用提供了一种可扩展的解决方案．

Abstract: The diffusion models have made significant achievements in the field of image generation, but they still face some challenges such as high computational cost, long inference latency, and poor image generation quality during image generation. Based on previous research, this study proposes an efficient adaptive diffusion model inference method based on the warmup-aware and noise reuse methods, which can optimize the performance of the diffusion models according to input conditions and solve the problems of slow inference speed and poor image generation quality. Specifically, our warmup-aware method can dynamically adjust the number of warmup time steps in the initial stage of the diffusion process according to the semantic complexity of the input text prompts to ensure efficient initialization. Subsequently, our noise reuse method dynamically determines the number of time steps for noise reuse in the iterative refinement stage, and reduces unnecessary computational overhead. As shown in Figure 1, We achieved 2.66 times acceleration on 4 NVIDIA V100 GPUs, and compared with DistriFusion, our inference speed increased by 12.8% and the image generation quality improved by 5.03 times. These improvements indicate that our proposed method has significant effects in balancing inference efficiency and image generation quality, and provides a scalable solution for real-time image generation applications．

HTML全文

参考文献(0)

施引文献

资源附件(0)