I. Introduction
In this study, our objective is to generate a video that is consistent in both temporal and spatial aspects based on text input. Additionally, we aim to enhance the diversity of the generated videos to cater to user preferences.
In this study, our objective is to generate a video that is consistent in both temporal and spatial aspects based on text input. Additionally, we aim to enhance the diversity of the generated videos to cater to user preferences.
2020 International Symposium ELMAR
Published: 2020
IEEE Transactions on Multimedia
Published: 2019
A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.
© Copyright 2024 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.