Mostrando 10 resultados de: 10
Filtros aplicados
Publisher
Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2022(2)
Proceedings of the International Conference on Document Analysis and Recognition, ICDAR(2)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)(1)
Multimodal Scene Understanding: Algorithms, Applications and Deep Learning(1)
Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020(1)
Área temáticas
Métodos informáticos especiales(4)
Funcionamiento de bibliotecas y archivos(3)
Ciencias de la computación(2)
Medios documentales, educativos, informativos; periodismo(2)
Programación informática, programas, datos, seguridad(2)
Origen
scopus(10)
Good news, everyone! context driven entity-aware captioning for news images
Conference ObjectAbstract: Current image captioning systems perform at a merely descriptive level, essentially enumerating thePalabras claves:Document Analysis, Vision + LanguageAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Rusiñol M.Fuentes:scopusExploring hate speech detection in multimodal publications
Conference ObjectAbstract: In this work we target the problem of hate speech detection in multimodal publications formed by a tPalabras claves:Autores:Gibert J., Gómez R., Karatzas D., Lluís Álvarez GómezFuentes:scopusMulti-modal reasoning graph for scene-text based fine-grained image classification and retrieval
Conference ObjectAbstract: Scene text instances found in natural images carry explicit semantic information that can provide imPalabras claves:Autores:Dey S., Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusObject proposals for text extraction in the wild
Conference ObjectAbstract: Object Proposals is a recent computer vision technique receiving increasing interest from the researPalabras claves:Autores:Karatzas D., Lluís Álvarez GómezFuentes:scopusOne-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Conference ObjectAbstract: Low resource Handwritten Text Recognition (HTR) is a hard problem due to the scarce annotated data aPalabras claves:Document AnalysisAutores:Dey S., Fornes A., Furkan Biten A., Karatzas D., Kessentini Y., Llados J., Lluís Álvarez Gómez, Souibgui M.A.Fuentes:scopusLearning to learn from web data through deep semantic embeddings
Conference ObjectAbstract: In this paper we propose to learn a multimodal image and text embedding from Web and Social Media daPalabras claves:Multimodal embeddings, Multimodal retrieval, Self-supervised learning, Text embeddings, Webly supervised learningAutores:Gibert J., Gómez R., Karatzas D., Lluís Álvarez GómezFuentes:scopusIs An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Conference ObjectAbstract: The task of image-text matching aims to map representations from different modalities into a commonPalabras claves:Vision and LanguagesAutores:Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Mafla A.Fuentes:scopusLSDE: Levenshtein Space Deep Embedding for Query-by-String Word Spotting
Conference ObjectAbstract: In this paper we present the LSDE string representation and its application to handwritten word spotPalabras claves:CNNs, Deep embeddings, Handwritten Keyword Spotting, Query by stringAutores:Karatzas D., Lluís Álvarez Gómez, Rusiñol M.Fuentes:scopusSelf-supervised learning from web data for multimodal retrieval
Book PartAbstract: Self-supervised learning from multimodal image and text data allows deep neural networks to learn poPalabras claves:Multimodal embedding, Multimodal retrieval, Self-supervised learning, Text embeddings, Webly supervised learningAutores:Gibert J., Gómez R., Karatzas D., Lluís Álvarez GómezFuentes:scopusSelf-supervised learning of visual features through embedding images into text topic spaces
Conference ObjectAbstract: End-to-end training from scratch of current deep architectures for new computer vision problems woulPalabras claves:Autores:Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Patel Y., Rusiñol M.Fuentes:scopus