Documentos de Karatzas D. | REDI

Regresar

Mostrando 6 resultados de: 6

Filtros aplicados

Área temáticas: "Programación informática, programas, datos, seguridad"

Subtipo de publicación

Conference Object(3)

Publisher

Pattern Recognition(2)

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)(1)

Multimodal Scene Understanding: Algorithms, Applications and Deep Learning(1)

Proceedings - 13th IAPR International Workshop on Document Analysis Systems, DAS 2018(1)

Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021(1)

Área temáticas

Métodos informáticos especiales(3)

Biblioteconomía y Documentación informatica(2)

Funcionamiento de bibliotecas y archivos(2)

Física aplicada(1)

Área de conocimiento

Ciencias de la computación(5)

Visión por computadora(4)

Aprendizaje automático(2)

Análisis de datos(1)

Año de Publicación

Origen

Palabras Claves

Multimodal retrieval(2)

Self-supervised learning(2)

Text embeddings(2)

Webly supervised learning(2)

convolutional neural networks(2)

Cutting Sayre's Knot: Reading Scene Text without Segmentation. Application to Utility Meters

Conference Object

Abstract: In this paper we present a segmentation-free system for reading text in natural scenes. A CNN archit

Palabras claves:

cnn, End-to-end Systems, Robust reading, Utility Meters

Karatzas D., Lluís Álvarez Gómez, Rusiñol M.

Improving patch-based scene text script identification with ensembles of conjoined networks

Abstract: This paper focuses on the problem of script identification in scene text images. Facing this problem

Palabras claves:

convolutional neural networks, Ensemble of conjoined networks, Multi-language OCR, Scene text understanding, script identification

Karatzas D., Lluís Álvarez Gómez, Nicolaou A.

Learning to learn from web data through deep semantic embeddings

Conference Object

Abstract: In this paper we propose to learn a multimodal image and text embedding from Web and Social Media da

Palabras claves:

Multimodal embeddings, Multimodal retrieval, Self-supervised learning, Text embeddings, Webly supervised learning

Gibert J., Gómez R., Karatzas D., Lluís Álvarez Gómez

Self-supervised learning from web data for multimodal retrieval

Abstract: Self-supervised learning from multimodal image and text data allows deep neural networks to learn po

Palabras claves:

Multimodal embedding, Multimodal retrieval, Self-supervised learning, Text embeddings, Webly supervised learning

Gibert J., Gómez R., Karatzas D., Lluís Álvarez Gómez

Real-time Lexicon-free Scene Text Retrieval

Abstract: In this work, we address the task of scene text retrieval: given a text query, the system returns al

Palabras claves:

convolutional neural networks, Image retrieval, PHOC, Region proposal networks, Scene text detection, Scene text recognition, Word spotting

Dey S., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.

StacMR: Scene-text aware cross-modal retrieval

Conference Object

Abstract: Recent models for cross-modal retrieval have benefited from an increasingly rich understanding of vi

Palabras claves:

Karatzas D., Larlus D., Lluís Álvarez Gómez, Mafla A., Rezende R.S.