Mostrando 4 resultados de: 4
Filtros aplicados
Publisher
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)(2)
Pattern Recognition Letters(1)
Proceedings of the International Conference on Document Analysis and Recognition, ICDAR(1)
Área temáticas
Métodos informáticos especiales(3)
Funcionamiento de bibliotecas y archivos(2)
Biblioteconomía y Documentación informatica(1)
Imprenta y actividades conexas(1)
Área de conocimiento
Ciencias de la computación(4)
Visión por computadora(3)
Inteligencia artificial(1)
Origen
scopus(4)
Multimodal grid features and cell pointers for scene text visual question answering
ArticleAbstract: This paper presents a new model for the task of scene text visual question answering. In this task qPalabras claves:41A05, 41A10, 65D05, 65D17, deep learning, MSC, Multi-modal learning, Scene text, Visual question answeringAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusICDAR 2019 competition on scene text visual question answering
Conference ObjectAbstract: This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (STPalabras claves:Scene text, Scene understanding, Vision and language, Visual question answeringAutores:Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Mathew M., Rusiñol M., Tito R., Valveny E.Fuentes:scopusMUST-VQA: MUltilingual Scene-Text VQA
Conference ObjectAbstract: In this paper, we present a framework for Multilingual Scene Text Visual Question Answering that deaPalabras claves:Multilingual models, Power of language models, Scene text, Translation robustness, Visual question answering, Zero-shot transferAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A., Vivoli E.Fuentes:scopusSingle shot scene text retrieval
Conference ObjectAbstract: Textual information found in scene images provides high level semantic information about the image aPalabras claves:convolutional neural networks, Image retrieval, PHOC, Region proposals networks, Scene text, Word spottingAutores:Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M.Fuentes:scopus