hw3

MLDS2018SPRING/hw3

3-0. Requirements

tensorflow-gpu==1.6.0
numpy==1.14.3
scipy==1.1.0
matplotlib==2.2.2
opencv-python==3.4.0.12

3-1. Image Generation

Run bash to Generate Images

bash run_gan.sh

./samples/gan_original.png

Test on Baseline Model

cd gan-baseline
python3.6 baseline.py --input ../samples/gan_original.png

./gan-baseline/baseline_result_gan.png

Compare Our Model (WGAN_GP) with WGAN (50 epochs)

See more details for WGAN_GP, WGAN.

WGAN_GP	WGAN

Training Tips for Improvement

Here's a link to the document of tips and tricks to make GANs work

Tip 1: Normalize the inputs

Normalize the images between -1 and 1
Tanh as the last layer of the generator output

Tip 3: Use a spherical Z

Don't sample from a Uniform distribution
Sample from a gaussian distribution
When doing interpolations, do the interpolation via a great circle, rather than a straight line from point A to point B
Tom White's Sampling Generative Networks ref code https://github.com/dribnet/plat has more details

Tip 4: BatchNorm

Construct different mini-batches for real and fake, i.e. each mini-batch needs to contain only all real images or all generated images.
When batchnorm is not an option use instance normalization (for each sample, subtract mean and divide by standard deviation).

Tip 5: Avoid Sparse Gradients: ReLU, MaxPool

The stability of the GAN game suffers if you have sparse gradients
LeakyReLU = good (in both G and D)
For Downsampling, use: Average Pooling, Conv2d + stride
For Upsampling, use: PixelShuffle, ConvTranspose2d + stride
- PixelShuffle: https://arxiv.org/abs/1609.05158

Tip 14: Train discriminator more (sometimes)

Especially when you have noise
Hard to find a schedule of number of D iterations vs G iterations

Without Tip 1: Normalize the inputs

Normalize the images between 0 and 1
Sigmoid as the last layer of the generator output
See more details for WGAN_GP Without Tip 1

With Tip 1, 3, 4, 5, 14	Without Tip 1

Without Tip 3: Use a spherical Z

Change sampled Z from np.random.normal(0, np.exp(-1 / np.pi)) to np.random.uniform(-1, 1)
See more details for WGAN_GP Without Tip 3

With Tip 1, 3, 4, 5, 14	Without Tip 3

Without Tip 14: Train discriminator more (sometimes)

Change self.d_iter, self.g_iter from (2, 1) to (1, 1)
See more details for WGAN_GP Without Tip 14

With Tip 1, 3, 4, 5, 14	Without Tip 14

3-2. Text-to-Image Generation

Run bash to Generate Images

bash run_cgan.sh ./AnimeDataset/testing_tags.txt

Testing Tags	./samples/cgan_original.png
blue hair blue eyes blue hair green eyes blue hair red eyes green hair blue eyes green hair red eyes

Test on Baseline Model

cd gan-baseline
python3.6 baseline.py --input ../samples/cgan_original.png

Testing Tags	./gan-baseline/baseline_result_cgan.png
blue hair blue eyes blue hair green eyes blue hair red eyes green hair blue eyes green hair red eyes

3-3. Style Transfer

See more details for Style Transfer

Name		Name	Last commit message	Last commit date
parent directory ..
gan-baseline		gan-baseline
hw3_1		hw3_1
hw3_2		hw3_2
hw3_3		hw3_3
samples		samples
README.md		README.md
reprot.pdf		reprot.pdf
run_cgan.sh		run_cgan.sh
run_gan.sh		run_gan.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hw3

hw3

README.md

MLDS2018SPRING/hw3

3-0. Requirements

3-1. Image Generation

Run bash to Generate Images

Test on Baseline Model

Compare Our Model (WGAN_GP) with WGAN (50 epochs)

Training Tips for Improvement

Tip 1: Normalize the inputs

Tip 3: Use a spherical Z

Tip 4: BatchNorm

Tip 5: Avoid Sparse Gradients: ReLU, MaxPool

Tip 14: Train discriminator more (sometimes)

Without Tip 1: Normalize the inputs

Without Tip 3: Use a spherical Z

Without Tip 14: Train discriminator more (sometimes)

3-2. Text-to-Image Generation

Run bash to Generate Images

Test on Baseline Model

3-3. Style Transfer

Files

hw3

Directory actions

More options

Directory actions

More options

Latest commit

History

hw3

Folders and files

parent directory

README.md

MLDS2018SPRING/hw3

3-0. Requirements

3-1. Image Generation

Run bash to Generate Images

Test on Baseline Model

Compare Our Model (WGAN_GP) with WGAN (50 epochs)

Training Tips for Improvement

Tip 1: Normalize the inputs

Tip 3: Use a spherical Z

Tip 4: BatchNorm

Tip 5: Avoid Sparse Gradients: ReLU, MaxPool

Tip 14: Train discriminator more (sometimes)

Without Tip 1: Normalize the inputs

Without Tip 3: Use a spherical Z

Without Tip 14: Train discriminator more (sometimes)

3-2. Text-to-Image Generation

Run bash to Generate Images

Test on Baseline Model

3-3. Style Transfer