Transformer based Pluralistic Image Completion with Reduced Information Loss
They quantize 256^3 RGB values to a small number (such as 512) of quantized color values. The indices of quantized pixels are used as tokens for the inputs and prediction targets of the transformer.
In addition, pluralistic images can be produced when the content of masked regions is predicted and sampled in an autoregressive manner.
全栈爱好者,欢迎交流学习

浙公网安备 33010602011771号