Image Captioning with Style Using Generative Adversarial Networks
DOI: http://dx.doi.org/10.30630/joiv.6.1.709
Abstract
Keywords
Full Text:
PDFReferences
​M. Z. Hossain, F. Sohel, M. F. Shiratuddin and H. Laga, "A Comprehensive Survey of Deep Learning for Image," 2018.
​C. Gan, Z. Gan, X. He, J. Gao and L. Deng, "StyleNet: Generating Attractive Visual Captions with Styles," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
​A. Mathews, L. Xie and X. He, "SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8591-8600, 2018.
​J. D. Lannoy, "The effect of chatbot personality on emotional connection and customer satisfaction," 17 November 2017.
​A. Matthews, L. Xie and X. He, "SentiCap: Generating Image Descriptions with Sentiments," Proceedings of the AAAI Conference on Artificial Intelligence, pp. 1-12, 2016.
​B. Dai, S. Fidler, R. Urtasun and D. Lin, "Towards Diverse and Natural Image Descriptions via a Conditional GAN," 2017.
​P. Dognin, I. Melnyk, Y. Mroueh, J. Ross and T. Sercu, "Adversarial Semantic Alignment for Improved Image Captions," p. 1, 2018.
​O. M. Nezami, M. Dras, S. Wan, C. Paris and L. Hamey, "Towards Generating Stylized Image Captions via," Pacific Rim International Conference on Artificial Intelligence, pp. 270-284, 2019.
​A. Bibi, H. Abidi and O. Dhaouadi, "SeqCapsGAN: Generating Stylized Image Captions," 2020.
​S. Sabour, N. Frosst and G. E. Hinton, "Dynamic Routing Between Capsules," CoRR, 2017.
​A. Shah, E. Kadam, H. Shah, S. Shinde and S. Shingade, "Deep Residual Networks with Exponential Linear Unit," Proceedings of the Third International Symposium on Computer Vision and the Internet, pp. 59-65, 2016.
​T. Dozat, "Incorporating Nesterov Momentum Into Adam," ICLR 2016, 2016.
​P. Zhou, J. Feng, C. Ma, C. Xiong, S. HOI and W. E, "Towards Theoretically Understanding Why SGD Generalizes Better Than Adam in Deep Learning," arXiv preprint arXiv:2010.05627, 2020.
​D. Masters and C. Luschi, "Revisiting Small Batch Training for Deep Neural Networks," arXiv preprint arXiv:1804.07612, 2018.
​P. M. Radiuk, "Impact of Training Set Batch Size on the Performance of Convolutional Neural Networks for Diverse Datasets," vol. 20, pp. 20-24, 17 December 2017.
​Q. Fu, Y. Liu and Z. Xie, "EECS442 Final Project Report," pp. 1-9, 2019.
​COCO Consortium, "COCO 2015 Image Captioning Task," 1 April 2015. [Online]. Available: https://cocodataset.org/#download.
​M. Arjovsky, S. Chintala and L. Bottou, "Wasserstein GAN," Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 214-223, 2017.
​C. Lin, "Rouge: A package for automatic evaluation of summaries," Text Summarization Branches Out, 2004.
R. Vedantam, C. Lawrence Zitnick and D. Parikh, "Cider: Consensus-based image description," Proceedings of the IEEE conference on computer vision and pattern recognition, p. 4566–4575, 2015.