Mostrando 7 resultados de: 7
Filtros aplicados
Publisher
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)(1)
Pattern Recognition(1)
Pattern Recognition Letters(1)
Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021(1)
Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022(1)
Área temáticas
Biblioteconomía y Documentación informatica(2)
Funcionamiento de bibliotecas y archivos(2)
Imprenta y actividades conexas(2)
Artes(1)
Filosofía y teoría(1)
Área de conocimiento
Ciencias de la computación(7)
Visión por computadora(4)
Aprendizaje automático(2)
Inteligencia artificial(1)
Semiótica(1)
Origen
scopus(7)
Multi-modal reasoning graph for scene-text based fine-grained image classification and retrieval
Conference ObjectAbstract: Scene text instances found in natural images carry explicit semantic information that can provide imPalabras claves:Autores:Dey S., Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusMultimodal grid features and cell pointers for scene text visual question answering
ArticleAbstract: This paper presents a new model for the task of scene text visual question answering. In this task qPalabras claves:41A05, 41A10, 65D05, 65D17, deep learning, MSC, Multi-modal learning, Scene text, Visual question answeringAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusICDAR 2019 competition on scene text visual question answering
Conference ObjectAbstract: This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (STPalabras claves:Scene text, Scene understanding, Vision and language, Visual question answeringAutores:Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Mathew M., Rusiñol M., Tito R., Valveny E.Fuentes:scopusMUST-VQA: MUltilingual Scene-Text VQA
Conference ObjectAbstract: In this paper, we present a framework for Multilingual Scene Text Visual Question Answering that deaPalabras claves:Multilingual models, Power of language models, Scene text, Translation robustness, Visual question answering, Zero-shot transferAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A., Vivoli E.Fuentes:scopusIs An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Conference ObjectAbstract: The task of image-text matching aims to map representations from different modalities into a commonPalabras claves:Vision and LanguagesAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusScene text visual question answering
Conference ObjectAbstract: Current visual question answering datasets do not consider the rich semantic information conveyed byPalabras claves:Autores:Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusReal-time Lexicon-free Scene Text Retrieval
ArticleAbstract: In this work, we address the task of scene text retrieval: given a text query, the system returns alPalabras claves:convolutional neural networks, Image retrieval, PHOC, Region proposal networks, Scene text detection, Scene text recognition, Word spottingAutores:Dey S., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopus