Mostrando 5 resultados de: 5
Publisher
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)(1)
Pattern Recognition(1)
Pattern Recognition Letters(1)
Proceedings of the IEEE International Conference on Computer Vision(1)
Proceedings of the International Conference on Document Analysis and Recognition, ICDAR(1)
Área temáticas
Métodos informáticos especiales(4)
Funcionamiento de bibliotecas y archivos(3)
Biblioteconomía y Documentación informatica(2)
Imprenta y actividades conexas(2)
Instrumentos de precisión y otros dispositivos(1)
Origen
scopus(5)
Multimodal grid features and cell pointers for scene text visual question answering
ArticleAbstract: This paper presents a new model for the task of scene text visual question answering. In this task qPalabras claves:41A05, 41A10, 65D05, 65D17, deep learning, MSC, Multi-modal learning, Scene text, Visual question answeringAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusOCR-IDL: OCR Annotations for Industry Document Library Dataset
Conference ObjectAbstract: Pretraining has proven successful in Document Intelligence tasks where deluge of documents are usedPalabras claves:Autores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Tito R., Valveny E.Fuentes:scopusICDAR 2019 competition on scene text visual question answering
Conference ObjectAbstract: This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (STPalabras claves:Scene text, Scene understanding, Vision and language, Visual question answeringAutores:Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Mathew M., Rusiñol M., Tito R., Valveny E.Fuentes:scopusScene text visual question answering
Conference ObjectAbstract: Current visual question answering datasets do not consider the rich semantic information conveyed byPalabras claves:Autores:Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusReal-time Lexicon-free Scene Text Retrieval
ArticleAbstract: In this work, we address the task of scene text retrieval: given a text query, the system returns alPalabras claves:convolutional neural networks, Image retrieval, PHOC, Region proposal networks, Scene text detection, Scene text recognition, Word spottingAutores:Dey S., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopus