Skip to main navigation Skip to search Skip to main content

Video Modification in Drone and Satellite Imagery

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The ability to create and detect synthetic video is becoming critically important to scene understanding. Techniques for synthetic manipulation and augmentation of data increase diversity within available datasets, while not requiring laborious labeling efforts. That is, the ability to create synthetic video can enable augmentation of small realistic datasets on which to further train Artificial Intelligence and Machine Learning (AI/ML) algorithms. Thus, it may be desirable to add, remove, or modify vehicles in satellite and overhead video. In our previous work, we leveraged generative adversarial networks (GANs) to transform cars into trucks (and vice versa) in static images. We utilized an attention-based masking approach that assists the network in transformation of the object and not background. In addition, we demonstrated the benefits of numerous data augmentation procedures, including presenting a new artificial dataset of vehicles from an aerial perspective and introducing novel augmentation techniques appropriate for our network architectures. This work extends the applied techniques from still imagery to video. We employ a few different architectures: (1) a fully dynamic 3D convolutional discriminator network with static generators, (2) a fully dynamic 3D convolutional discriminator and generator network, and (3) an architecture that computes "warp" between frames for input to a static generator. Additionally, to help enforce consistency, we experiment with an interframe classifier that verifies whether two frames belong to the same video sequence or not. We run experiments on a real-world dataset, presenting promising results in terms of FID, KID, and metrics developed from a classifier trained on our dataset.

Original languageEnglish
Title of host publicationDisruptive Technologies in Information Sciences VIII
EditorsMisty Blowers, Bryant T. Wysocki
PublisherSPIE
ISBN (Electronic)9781510674349
DOIs
StatePublished - 2024
EventDisruptive Technologies in Information Sciences VIII 2024 - National Harbor, United States
Duration: Apr 22 2024Apr 25 2024

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume13058

Conference

ConferenceDisruptive Technologies in Information Sciences VIII 2024
Country/TerritoryUnited States
CityNational Harbor
Period04/22/2404/25/24

Keywords

  • GAN
  • deep learning
  • image translation
  • inpainting
  • video transformation

Fingerprint

Dive into the research topics of 'Video Modification in Drone and Satellite Imagery'. Together they form a unique fingerprint.

Cite this