👨💻 Style Transfer and Synthesis (1/3): Style Transfer in Image Synthesis
# Artificial Intelligence # Style Tranfer # Image Synthesis · 258 words · 2 min · Pbulished On: January 1, 1 (Last updated on: June 23, 2023)
There are multiple papers in the area of style transfer and image inversion or reconstruction. Here are some papers I read and would like to share with you:
Progressive GAN
- Paper URL: ICLR 2018 - PROGRESSIVE GROWING OF GANS FOR IMPROVED QUALITY, STABILITY, AND VARIATION |
- Code URL: https://git.cklau.cc/terenceliu/gans-models/-/tree/main/PGGAN |
Key point: They propose a novel training process of GAN model: progressive training, i.e. from small model to big model/from low resolution to high resolution
Some Hightlights
- The training (see Figure A) is from left to right,
- start from feature map, the model produce a (3x4x4) output from the generator $G_1$ and as the input of the discriminator $D_1$
- the second process is to upsample the feature map from $4 \times 4$ to $8 \times 8$ and produce a ($3 \times 8 \times 8$) output from the generator $G_2$ and as the input of the discriminator $D_2$
- continue with the process, for the last part of the progression, the model output the $1024 \times 1024$ image from the generator and as the input of the discriminator $D_m$
- To avoid the influence/damage of the transition from low resolution to high resolution, they fade in the new layer smoothly (see Figure B):
- they treat the layers that operate on the higher resolution like a residual block, whose weight $\alpha$ increases linearly from $0$ to $1$.
- By adjusting the weight of convolution-based output and upsampling-based output to control the final output: $(\alpha \cdot O_{\text{convolution layer}} + (1 - \alpha) \cdot O_{\text{upsample layer}})$
- Minibatch Standard Deviation
- Equalize the Learning Rate
- Pixelwised Noramlization