Frome, Andrea; Corrado, Greg S.; Shlens, Jonathon; Bengio, Samy; Dean, Jeffrey; Ranzato, Marc'Aurelio; Mikolov, Tomas (2013-12-05). “DeViSE: a deep visual-semantic embedding model”. Proceedings of the 26th International Conference on Neural Information Processing Systems - Volume 2 (Lake Tahoe, Nevada: Curran Associates Inc.): 2121–2129. https://dl.acm.org/doi/10.5555/2999792.2999849.