Mostrando 10 resultados de: 11
Filtros aplicados
Publisher
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)(2)
Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022(2)
Proceedings of the International Conference on Document Analysis and Recognition, ICDAR(2)
Multimodal Scene Understanding: Algorithms, Applications and Deep Learning(1)
Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020(1)
Área temáticas
Funcionamiento de bibliotecas y archivos(4)
Métodos informáticos especiales(4)
Ciencias de la computación(2)
Medios documentales, educativos, informativos; periodismo(2)
Programación informática, programas, datos, seguridad(2)
Área de conocimiento
Ciencias de la computación(11)
Comunicación(2)
Algoritmo(1)
Análisis de datos(1)
Semiótica(1)
Origen
scopus(11)
Good news, everyone! context driven entity-aware captioning for news images
Conference ObjectAbstract: Current image captioning systems perform at a merely descriptive level, essentially enumerating thePalabras claves:Document Analysis, Vision + LanguageAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Rusiñol M.Fuentes:scopusExploring hate speech detection in multimodal publications
Conference ObjectAbstract: In this work we target the problem of hate speech detection in multimodal publications formed by a tPalabras claves:Autores:Gibert J., Gómez R., Karatzas D., Lluís Álvarez GómezFuentes:scopusMulti-modal reasoning graph for scene-text based fine-grained image classification and retrieval
Conference ObjectAbstract: Scene text instances found in natural images carry explicit semantic information that can provide imPalabras claves:Autores:Dey S., Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusObject proposals for text extraction in the wild
Conference ObjectAbstract: Object Proposals is a recent computer vision technique receiving increasing interest from the researPalabras claves:Autores:Karatzas D., Lluís Álvarez GómezFuentes:scopusOne-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Conference ObjectAbstract: Low resource Handwritten Text Recognition (HTR) is a hard problem due to the scarce annotated data aPalabras claves:Document AnalysisAutores:Dey S., Fornes A., Furkan Biten A., Karatzas D., Kessentini Y., Llados J., Lluís Álvarez Gómez, Souibgui M.A.Fuentes:scopusLearning to Rank Words: Optimizing Ranking Metrics for Word Spotting
Conference ObjectAbstract: In this paper, we explore and evaluate the use of ranking-based objective functions for learning simPalabras claves:Ranking loss, Smooth-AP, Smooth-nDCG, Word spottingAutores:Llados J., Lluís Álvarez Gómez, Molina A., Ramos-Terrades O., Riba P.Fuentes:scopusLearning to learn from web data through deep semantic embeddings
Conference ObjectAbstract: In this paper we propose to learn a multimodal image and text embedding from Web and Social Media daPalabras claves:Multimodal embeddings, Multimodal retrieval, Self-supervised learning, Text embeddings, Webly supervised learningAutores:Gibert J., Gómez R., Karatzas D., Lluís Álvarez GómezFuentes:scopusIs An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Conference ObjectAbstract: The task of image-text matching aims to map representations from different modalities into a commonPalabras claves:Vision and LanguagesAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusLSDE: Levenshtein Space Deep Embedding for Query-by-String Word Spotting
Conference ObjectAbstract: In this paper we present the LSDE string representation and its application to handwritten word spotPalabras claves:CNNs, Deep embeddings, Handwritten Keyword Spotting, Query by stringAutores:Karatzas D., Lluís Álvarez Gómez, Rusiñol M.Fuentes:scopusSelf-supervised learning from web data for multimodal retrieval
Book PartAbstract: Self-supervised learning from multimodal image and text data allows deep neural networks to learn poPalabras claves:Multimodal embedding, Multimodal retrieval, Self-supervised learning, Text embeddings, Webly supervised learningAutores:Gibert J., Gómez R., Karatzas D., Lluís Álvarez GómezFuentes:scopus