Mostrando 5 resultados de: 5
Filtros aplicados
Publisher
Pattern Recognition Letters(1)
Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020(1)
Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022(1)
Proceedings of the IEEE International Conference on Computer Vision(1)
Proceedings of the International Conference on Document Analysis and Recognition, ICDAR(1)
Área temáticas
Métodos informáticos especiales(4)
Funcionamiento de bibliotecas y archivos(2)
Imprenta y actividades conexas(2)
Biblioteconomía y Documentación informatica(1)
Ciencias de la computación(1)
Origen
scopus(5)
Fine-grained image classification and retrieval by combining visual and locally pooled textual features
Conference ObjectAbstract: Text contained in an image carries high-level semantics that can be exploited to achieve richer imagPalabras claves:Autores:Dey S., Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusMultimodal grid features and cell pointers for scene text visual question answering
ArticleAbstract: This paper presents a new model for the task of scene text visual question answering. In this task qPalabras claves:41A05, 41A10, 65D05, 65D17, deep learning, MSC, Multi-modal learning, Scene text, Visual question answeringAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusICDAR 2019 competition on scene text visual question answering
Conference ObjectAbstract: This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (STPalabras claves:Scene text, Scene understanding, Vision and language, Visual question answeringAutores:Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Mathew M., Rusiñol M., Tito R., Valveny E.Fuentes:scopusLet there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Conference ObjectAbstract: Explaining an image with missing or non-existent objects is known as object bias (hallucination) inPalabras claves:Vision and LanguagesAutores:Furkan Biten A., Karatzas D., Lluís Álvarez GómezFuentes:scopusScene text visual question answering
Conference ObjectAbstract: Current visual question answering datasets do not consider the rich semantic information conveyed byPalabras claves:Autores:Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopus