Transformer based Pluralistic Image Completion with Reduced Information Loss

They quantize 256^3 RGB values to a small number (such as 512) of quantized color values. The indices of quantized pixels are used as tokens for the inputs and prediction targets of the transformer.
In addition, pluralistic images can be produced when the content of masked regions is predicted and sampled in an autoregressive manner.

posted @ 2024-08-13 11:13  Trkly  阅读(17)  评论(0)    收藏  举报