Yana Khare — Updated On July 7th, 2023
Artificial Intelligence GANs Generative AI Image News Use Cases

Introduction

Researchers from Google, the Max Planck Institute of Informatics, and MIT CSAIL have released a new AI technique. It allows users to manipulate images in seconds with just a click and drag. The new DragGAN is an AI editing tool that leverages a pre-trained GAN (Generative Adversarial Network) to synthesize ideas that precisely follow user input while remaining on the manifold of realistic images.

What is DragGAN?

DragGAN is an interactive approach for intuitive point-based image editing far more powerful than Photoshop’s Warp tool. Unlike Photoshop, which merely smushes pixels around, DragGAN uses AI to regenerate the underlying object. With DragGAN, users can rotate images as if they were 3D, change the dimensions of cars, manipulate smiles into frowns, and adjust reflections on lakes. Moreover, they can change the direction someone faces.

Working of DragGAN AI

General Framework and Optimisation of Latent Codes

What sets DragGAN apart from other approaches is its general framework which does not rely on domain-specific modeling or auxiliary networks. To achieve this, the researchers used an optimization of latent codes that incrementally moved multiple handle points toward their target locations alongside a point-tracking procedure to trace the trajectory of the handle points faithfully. Both components use the discriminative quality of intermediate feature maps of the GAN to yield pixel-precise image deformations and interactive performance.

Outperforming SOTA in GAN-Based Manipulation

According to the researchers, DragGAN by Google outperforms the state-of-the-art (SOTA) in GAN-based manipulation. Furthermore, it opens new directions for powerful image editing using generative priors. They look to extend point-based editing to 3D generative models in the coming months.

Importance of GAN Models

This new technique shows that GAN models are more impactful than pretty pictures generated from diffusion models, such as those used in tools like DALLE.2, Stable Diffusion, and Midjourney. While there are apparent reasons why diffusion models are gaining popularity for image synthesis, GANs saw the same rage and sparked interest three years after Ian Goodfellow proposed them. GAN uses two neural networks—a generator and a discriminator—to generate new and synthesized data instances.

Precise Control Over Image Manipulation

When editing images of diverse subjects, users can “deform an image with precise control over where pixels go. Thus manipulating the pose, shape, expression, and layout,” explains the researchers.

You can find code for DragGAN AI here.

Source: Dragganaitool.com

How to Use DragGAN AI?

To utilize the DragGAN AI tool for photo editing, follow these straightforward steps:

  1. Access the DragGAN website – https://dragganaitool.com/
  2. Upload the image you want to modify.
  3. Drag a point on the image to the desired location.
  4. Upon releasing the point, DragGAN will instantly adapt the image according to your modifications.
  5. Repeat the process by dragging additional points and fine-tuning the image as required.
  6. Once you’ve finished editing, simply click the “Save” button to preserve the edited image.

Features of DragGAN AI Image Editing Tool

Here are the features of DragGAN AI image editing tool:

  • Point-based editing: DragGAN allows users to adjust images by directly dragging and dropping points on the screen. This ensures accurate and realistic modifications beyond traditional editing software capabilities.
  • 3D model integration: DragGAN generates a 3D model of the image, enabling users to alter the posture, shape, emotion, and arrangement of objects within the image while maintaining coherence and realism.
  • User-friendly interface: DragGAN prioritizes a straightforward, user-friendly experience for seasoned editors and newcomers to AI-powered picture editing tools. The interface simplifies editing, empowering users to achieve their desired results easily.
  • Potential for revolutionizing image editing: DragGAN has the potential to redefine image editing due to its unique features. By combining point-based editing and 3D modeling, DragGAN expands the horizons of artistic expression, offering new pathways for creativity and transforming how we approach picture editing.
Source: Learn and Burn

Also Read: How to Become an AI Video Editor?

Tips to Use DragGAN AI

Maximize your experience with DragGAN using these helpful tips:

  1. Start with high-quality images: Opt for well-captured, high-resolution photographs with good lighting. Clear and detailed original images enhance the accuracy and quality of DragGAN’s modifications.
  2. Exercise precision in dragging points: Take your time and make precise adjustments when selecting and modifying points in DragGAN. This ensures more precise and realistic transformations. Small tweaks can make a significant difference.
  3. Explore different settings: Experiment with DragGAN’s various options and parameters to achieve your desired results. Play with different combinations to find the ideal settings for your specific image alteration needs.
  4. Embrace the creative process: Enjoy the freedom and artistic expression that DragGAN offers. Let your imagination run wild, and have fun exploring the possibilities. You can create captivating photographs that captivate your friends and family with practice.

Conclusion

This tool is a game-changing tool for editors! DragGAN simplifies the editing process while offering advanced features, revolutionizing the way you work. We hope this article provided valuable insights into the capabilities of DragGAN. Share your thoughts in the comment section below. Stay connected with us at Analytics Vidhya Blogs to stay updated on the latest advancements in the field of generative AI.

Frequently Asked Questions

Q1. Is DragGAN AI tool free?

A. DragGAN is an easy-to-use photo editing tool that is free to use. 

Q2. What is Google DragGAN?

A. DragGAN is a cutting-edge GAN created by Google researchers. GANs comprise two key elements: a generator and a discriminator. The generator produces lifelike images, while the discriminator assesses whether an image is genuine or artificially generated.

Q3. How do I access DragGAN?

A. After installing DragGAN, use the provided command to launch the editor tool via Gradio. Gradio operates on port 7860 (http://localhost:7860). To access it in your browser without specifying the port, you can open it through your firewall or set up an Nginx reverse proxy.

Q4. What is DragGAN website?

A. This is the link to the DragGAN AI tool – https://dragganaitool.com/ 

About the Author

Yana Khare

Our Top Authors

Download Analytics Vidhya App for the Latest blog/Article