Mostrando 10 resultados de: 10
Publisher
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)(2)
Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021(2)
Pattern Recognition(1)
Pattern Recognition Letters(1)
Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020(1)
Área temáticas
Métodos informáticos especiales(7)
Biblioteconomía y Documentación informatica(3)
Funcionamiento de bibliotecas y archivos(3)
Imprenta y actividades conexas(2)
Programación informática, programas, datos, seguridad(2)
Área de conocimiento
Ciencias de la computación(10)
Visión por computadora(7)
Aprendizaje automático(2)
Computadora(1)
Inteligencia artificial(1)
Origen
scopus(10)
Fine-grained image classification and retrieval by combining visual and locally pooled textual features
Conference ObjectAbstract: Text contained in an image carries high-level semantics that can be exploited to achieve richer imagPalabras claves:Autores:Dey S., Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusMulti-modal reasoning graph for scene-text based fine-grained image classification and retrieval
Conference ObjectAbstract: Scene text instances found in natural images carry explicit semantic information that can provide imPalabras claves:Autores:Dey S., Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusMultimodal grid features and cell pointers for scene text visual question answering
ArticleAbstract: This paper presents a new model for the task of scene text visual question answering. In this task qPalabras claves:41A05, 41A10, 65D05, 65D17, deep learning, MSC, Multi-modal learning, Scene text, Visual question answeringAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusICDAR 2019 competition on scene text visual question answering
Conference ObjectAbstract: This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (STPalabras claves:Scene text, Scene understanding, Vision and language, Visual question answeringAutores:Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Mathew M., Rusiñol M., Tito R., Valveny E.Fuentes:scopusMUST-VQA: MUltilingual Scene-Text VQA
Conference ObjectAbstract: In this paper, we present a framework for Multilingual Scene Text Visual Question Answering that deaPalabras claves:Multilingual models, Power of language models, Scene text, Translation robustness, Visual question answering, Zero-shot transferAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A., Vivoli E.Fuentes:scopusIs An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Conference ObjectAbstract: The task of image-text matching aims to map representations from different modalities into a commonPalabras claves:Vision and LanguagesAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusScene text visual question answering
Conference ObjectAbstract: Current visual question answering datasets do not consider the rich semantic information conveyed byPalabras claves:Autores:Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusSingle shot scene text retrieval
Conference ObjectAbstract: Textual information found in scene images provides high level semantic information about the image aPalabras claves:convolutional neural networks, Image retrieval, PHOC, Region proposals networks, Scene text, Word spottingAutores:Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M.Fuentes:scopusReal-time Lexicon-free Scene Text Retrieval
ArticleAbstract: In this work, we address the task of scene text retrieval: given a text query, the system returns alPalabras claves:convolutional neural networks, Image retrieval, PHOC, Region proposal networks, Scene text detection, Scene text recognition, Word spottingAutores:Dey S., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusStacMR: Scene-text aware cross-modal retrieval
Conference ObjectAbstract: Recent models for cross-modal retrieval have benefited from an increasingly rich understanding of viPalabras claves:Autores:Karatzas D., Larlus D., Lluís Álvarez Gómez, Mafla A., Rezende R.S.Fuentes:scopus