Cat Image Generator with GANs

Architecture

Generator

The generator creates synthetic images from random noise vectors. It uses a series of transposed convolution layers to progressively upsample from the latent space to a 32x32 RGB image.

Input: Random noise vector of size 100
Architecture:
- Linear layer to reshape noise (100 → 4×4×512)
- Four upsampling blocks with transposed convolutions
- Final convolutional layer with tanh activation to produce normalized RGB output
Output: 32×32×3 RGB image with pixel values normalized between -1 and 1

Discriminator

The discriminator evaluates whether an image is real or generated. It uses convolutional layers to progressively downsample the image to a single scalar output.

Input: 32×32×3 RGB image
Architecture:
- Three convolutional blocks with LeakyReLU activations
- Flatten layer followed by linear layer to produce a single value
- Sigmoid activation to output probability
Output: Scalar value between 0 and 1 (1 = real, 0 = fake)

Training Process

Data Preparation:
- The CIFAR-10 dataset is loaded and filtered to extract only cat images (class 3)
- Images are normalized to range [-1, 1]
Adversarial Training:
- For each epoch, batches of real images are sampled
- The discriminator is trained to classify real images as real (1) and generated images as fake (0)
- The generator is trained to produce images that the discriminator classifies as real
- Loss functions: Binary Cross Entropy for both networks
Training Parameters:
- Batch size: 64
- Learning rate: 0.0002
- Adam optimizer with betas=(0.5, 0.999)
- Epochs: 100
- Latent dimension: 100

Results

The training shows characteristic GAN behavior:

Initially both networks have high losses as they learn the data distribution
The discriminator loss gradually decreases as it gets better at distinguishing real from fake
The generator loss initially increases then fluctuates as it tries to fool the increasingly effective discriminator

The generated images progressively improve in quality throughout training. The final model produces plausible cat-like images with recognizable features.

Usage

Requirements

PyTorch
torchvision
matplotlib

Training

# Train the model
python train_gan.py

Generate Images

# Generate new cat images using the trained model
python generate_images.py

Future Improvements

Architecture Enhancements:
- Add residual connections for more stable training
- Implement progressive growing for higher resolution images
Training Stability:
- Implement spectral normalization or gradient penalty for improved Wasserstein GAN performance
- Experiment with different learning rates and batch sizes
Image Quality:
- Train on a larger, higher-resolution dataset of cat images
- Implement conditional GAN for more controlled generation

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
custom-gan.ipynb		custom-gan.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cat Image Generator with GANs

Architecture

Generator

Discriminator

Training Process

Results

Usage

Requirements

Training

Generate Images

Future Improvements

References

About

Releases

Packages

Languages

License

muhammadsaadx/Custom-GAN-Generating-Fake-Images

Folders and files

Latest commit

History

Repository files navigation

Cat Image Generator with GANs

Architecture

Generator

Discriminator

Training Process

Results

Usage

Requirements

Training

Generate Images

Future Improvements

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages