Documentos de Rusiñol M. | REDI

Regresar

Mostrando 10 resultados de: 14

Subtipo de publicación

Conference Object(12)

Publisher

Proceedings of the International Conference on Document Analysis and Recognition, ICDAR(3)

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)(2)

Proceedings - 13th IAPR International Workshop on Document Analysis Systems, DAS 2018(2)

ICMR 2019 - Proceedings of the 2019 ACM International Conference on Multimedia Retrieval(1)

Pattern Recognition(1)

Área temáticas

Métodos informáticos especiales(9)

Funcionamiento de bibliotecas y archivos(5)

Ciencias de la computación(3)

Biblioteconomía y Documentación informatica(2)

Imprenta y actividades conexas(2)

Área de conocimiento

Ciencias de la computación(10)

Visión por computadora(9)

Aprendizaje automático(3)

Comunicación(1)

Objetivos de Desarrollo Sostenible

ODS 4: Educación de calidad(13)

ODS 9: Industria, innovación e infraestructura(13)

ODS 17: Alianzas para lograr los objetivos(11)

ODS 11: Ciudades y comunidades sostenibles(1)

ODS 16: Paz, justicia e instituciones sólidas(1)

Año de Publicación

Origen

Palabras Claves

Image retrieval(2)

Robust reading(2)

Scene text detection(2)

ICDAR 2019 competition on scene text visual question answering

Conference Object

Abstract: This paper presents final results of ICDAR 2019 Scene Text Visual Question Answering competition (ST

Palabras claves:

Scene text, Scene understanding, Vision and language, Visual question answering

Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Mathew M., Rusiñol M., Tito R., Valveny E.

Cutting Sayre's Knot: Reading Scene Text without Segmentation. Application to Utility Meters

Conference Object

Abstract: In this paper we present a segmentation-free system for reading text in natural scenes. A CNN archit

Palabras claves:

cnn, End-to-end Systems, Robust reading, Utility Meters

Karatzas D., Lluís Álvarez Gómez, Rusiñol M.

Dynamic lexicon generation for natural scene images

Conference Object

Abstract: Many scene text understanding methods approach the endto-end recognition problem from a word-spottin

Palabras claves:

cnn, Lexicon generation, Photo OCR, Scene text, Scene understanding, Topic modeling

Karatzas D., Lluís Álvarez Gómez, Patel Y., Rusiñol M.

Good news, everyone! context driven entity-aware captioning for news images

Conference Object

Abstract: Current image captioning systems perform at a merely descriptive level, essentially enumerating the

Palabras claves:

Document Analysis, Vision + Language

Furkan Biten A., Karatzas D., Lluís Álvarez Gómez, Rusiñol M.

LSDE: Levenshtein Space Deep Embedding for Query-by-String Word Spotting

Conference Object

Abstract: In this paper we present the LSDE string representation and its application to handwritten word spot

Palabras claves:

CNNs, Deep embeddings, Handwritten Keyword Spotting, Query by string

Karatzas D., Lluís Álvarez Gómez, Rusiñol M.

The robust reading competition annotation and evaluation platform

Conference Object

Abstract: The ICDAR Robust Reading Competition (RRC), initiated in 2003 and re-established in 2011, has become

Palabras claves:

data annotation, ground truthing, ONLINE PLATFORM, Performance evaluation, Robust reading

Karatzas D., Lluís Álvarez Gómez, Nicolaou A., Rusiñol M.

Scene text visual question answering

Conference Object

Abstract: Current visual question answering datasets do not consider the rich semantic information conveyed by

Palabras claves:

Furkan Biten A., Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Mafla A., Rusiñol M., Tito R., Valveny E.

Selective style transfer for text

Conference Object

Abstract: This paper explores the possibilities of image style transfer applied to text maintaining the origin

Palabras claves:

data augmentation, Scene text detection, Style transfer, Text style transfer

Furkan Biten A., Gibert J., Gómez R., Karatzas D., Lluís Álvarez Gómez, Rusiñol M.

Self-supervised learning of visual features through embedding images into text topic spaces

Conference Object

Abstract: End-to-end training from scratch of current deep architectures for new computer vision problems woul

Palabras claves:

Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Patel Y., Rusiñol M.

Self-supervised visual representations for cross-modal retrieval

Conference Object

Abstract: Cross-modal retrieval methods have been significantly improved in last years with the use of deep ne

Palabras claves:

Cross-modal retrieval, Self-supervised learning, Visual representations

Jawahar C.V., Karatzas D., Lluís Álvarez Gómez, Patel Y., Rusiñol M.