Mostrando 6 resultados de: 6
Publisher
Pattern Recognition(2)
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)(1)
Multimodal Scene Understanding: Algorithms, Applications and Deep Learning(1)
Proceedings - 13th IAPR International Workshop on Document Analysis Systems, DAS 2018(1)
Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021(1)
Área temáticas
Métodos informáticos especiales(3)
Biblioteconomía y Documentación informatica(2)
Funcionamiento de bibliotecas y archivos(2)
Física aplicada(1)
Área de conocimiento
Ciencias de la computación(5)
Visión por computadora(4)
Aprendizaje automático(2)
Análisis de datos(1)
Origen
scopus(6)
Cutting Sayre's Knot: Reading Scene Text without Segmentation. Application to Utility Meters
Conference ObjectAbstract: In this paper we present a segmentation-free system for reading text in natural scenes. A CNN architPalabras claves:cnn, End-to-end Systems, Robust reading, Utility MetersAutores:Karatzas D., Lluís Álvarez Gómez, Rusiñol M.Fuentes:scopusImproving patch-based scene text script identification with ensembles of conjoined networks
ArticleAbstract: This paper focuses on the problem of script identification in scene text images. Facing this problemPalabras claves:convolutional neural networks, Ensemble of conjoined networks, Multi-language OCR, Scene text understanding, script identificationAutores:Karatzas D., Lluís Álvarez Gómez, Nicolaou A.Fuentes:scopusLearning to learn from web data through deep semantic embeddings
Conference ObjectAbstract: In this paper we propose to learn a multimodal image and text embedding from Web and Social Media daPalabras claves:Multimodal embeddings, Multimodal retrieval, Self-supervised learning, Text embeddings, Webly supervised learningAutores:Gibert J., Gómez R., Karatzas D., Lluís Álvarez GómezFuentes:scopusSelf-supervised learning from web data for multimodal retrieval
Book PartAbstract: Self-supervised learning from multimodal image and text data allows deep neural networks to learn poPalabras claves:Multimodal embedding, Multimodal retrieval, Self-supervised learning, Text embeddings, Webly supervised learningAutores:Gibert J., Gómez R., Karatzas D., Lluís Álvarez GómezFuentes:scopusReal-time Lexicon-free Scene Text Retrieval
ArticleAbstract: In this work, we address the task of scene text retrieval: given a text query, the system returns alPalabras claves:convolutional neural networks, Image retrieval, PHOC, Region proposal networks, Scene text detection, Scene text recognition, Word spottingAutores:Dey S., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.Fuentes:scopusStacMR: Scene-text aware cross-modal retrieval
Conference ObjectAbstract: Recent models for cross-modal retrieval have benefited from an increasingly rich understanding of viPalabras claves:Autores:Karatzas D., Larlus D., Lluís Álvarez Gómez, Mafla A., Rezende R.S.Fuentes:scopus