Skip to main content

Showing 1–15 of 15 results for author: Otto, C

  1. arXiv:2407.01074  [pdf, other

    cs.CV cs.GR

    Multimodal Conditional 3D Face Geometry Generation

    Authors: Christopher Otto, Prashanth Chandran, Sebastian Weiss, Markus Gross, Gaspard Zoss, Derek Bradley

    Abstract: We present a new method for multimodal conditional 3D face geometry generation that allows user-friendly control over the output identity and expression via a number of different conditioning signals. Within a single model, we demonstrate 3D faces generated from artistic sketches, 2D face landmarks, Canny edges, FLAME face model parameters, portrait photos, or text prompts. Our approach is based o… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2310.19580  [pdf, other

    cs.CV cs.GR

    A Perceptual Shape Loss for Monocular 3D Face Reconstruction

    Authors: Christopher Otto, Prashanth Chandran, Gaspard Zoss, Markus Gross, Paulo Gotardo, Derek Bradley

    Abstract: Monocular 3D face reconstruction is a wide-spread topic, and existing approaches tackle the problem either through fast neural network inference or offline iterative reconstruction of face geometry. In either case carefully-designed energy functions are minimized, commonly including loss terms like a photometric loss, a landmark reprojection loss, and others. In this work we propose a new loss fun… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to PG 2023. Project page: https://studios.disneyresearch.com/2023/10/09/a-perceptual-shape-loss-for-monocular-3d-face-reconstruction/ Video: https://www.youtube.com/watch?v=RYdyoIZEuUI

    Journal ref: Computer Graphics Forum, vol. 42, no. 7, 2023

  3. Predicting Knowledge Gain for MOOC Video Consumption

    Authors: Christian Otto, Markos Stamatakis, Anett Hoppe, Ralph Ewerth

    Abstract: Informal learning on the Web using search engines as well as more structured learning on MOOC platforms have become very popular in recent years. As a result of the vast amount of available learning resources, intelligent retrieval and recommendation methods are indispensable -- this is true also for MOOC videos. However, the automatic assessment of this content with regard to predicting (potentia… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: 13 pages, 1 figure, 3 tables

    Journal ref: AIED 2022. Lecture Notes in Computer Science, vol 13356, pp. 458-462

  4. arXiv:2205.01989  [pdf, other

    cs.CL cs.AI cs.CV cs.MM cs.SI

    MM-Claims: A Dataset for Multimodal Claim Detection in Social Media

    Authors: Gullal S. Cheema, Sherzod Hakimov, Abdul Sittar, Eric Müller-Budack, Christian Otto, Ralph Ewerth

    Abstract: In recent years, the problem of misinformation on the web has become widespread across languages, countries, and various social media platforms. Although there has been much work on automated fake news detection, the role of images and their variety are not well explored. In this paper, we investigate the roles of image and text at an earlier stage of the fake news detection pipeline, called claim… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: Accepted to Findings of NAACL 2022

  5. SaL-Lightning Dataset: Search and Eye Gaze Behavior, Resource Interactions and Knowledge Gain during Web Search

    Authors: Christian Otto, Markus Rokicki, Georg Pardi, Wolfgang Gritz, Daniel Hienert, Ran Yu, Johannes von Hoyer, Anett Hoppe, Stefan Dietze, Peter Holtz, Yvonne Kammerer, Ralph Ewerth

    Abstract: The emerging research field Search as Learning investigates how the Web facilitates learning through modern information retrieval systems. SAL research requires significant amounts of data that capture both search behavior of users and their acquired knowledge in order to obtain conclusive insights or train supervised machine learning models. However, the creation of such datasets is costly and re… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: To be published at the 2022 ACM SIGIR Conference on Human Information Interaction and Retrieval (CHIIR '22)

  6. arXiv:2106.06244  [pdf, other

    cs.IR

    Predicting Knowledge Gain during Web Search based on Multimedia Resource Consumption

    Authors: Christian Otto, Ran Yu, Georg Pardi, Johannes von Hoyer, Markus Rokicki, Anett Hoppe, Peter Holtz, Yvonne Kammerer, Stefan Dietze, Ralph Ewerth

    Abstract: In informal learning scenarios the popularity of multimedia content, such as video tutorials or lectures, has significantly increased. Yet, the users' interactions, navigation behavior, and consequently learning outcome, have not been researched extensively. Related work in this field, also called search as learning, has focused on behavioral or text resource features to predict learning outcome a… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

    Comments: 13 pages, 2 figures, 2 tables

  7. Investigating Correlations of Automatically Extracted Multimodal Features and Lecture Video Quality

    Authors: Jianwei Shi, Christian Otto, Anett Hoppe, Peter Holtz, Ralph Ewerth

    Abstract: Ranking and recommendation of multimedia content such as videos is usually realized with respect to the relevance to a user query. However, for lecture videos and MOOCs (Massive Open Online Courses) it is not only required to retrieve relevant videos, but particularly to find lecture videos of high quality that facilitate learning, for instance, independent of the video's or speaker's popularity.… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    ACM Class: H.5.1

    Journal ref: SALMM '19: Proceedings of the 1st International Workshop on Search as Learning with Multimedia Information, co-located with ACM Multimedia 2019

  8. Visual Summarization of Scholarly Videos using Word Embeddings and Keyphrase Extraction

    Authors: Hang Zhou, Christian Otto, Ralph Ewerth

    Abstract: Effective learning with audiovisual content depends on many factors. Besides the quality of the learning resource's content, it is essential to discover the most relevant and suitable video in order to support the learning process most effectively. Video summarization techniques facilitate this goal by providing a quick overview over the content. It is especially useful for longer recordings such… ▽ More

    Submitted 25 November, 2019; originally announced December 2019.

    Comments: 12 pages, 5 figures

  9. Understanding, Categorizing and Predicting Semantic Image-Text Relations

    Authors: Christian Otto, Matthias Springstein, Avishek Anand, Ralph Ewerth

    Abstract: Two modalities are often used to convey information in a complementary and beneficial manner, e.g., in online news, videos, educational resources, or scientific publications. The automatic understanding of semantic correlations between text and associated images as well as their interplay has a great potential for enhanced multimodal web search and recommender systems. However, automatic understan… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: 8 pages, 8 Figures, 5 tables

    Journal ref: In Proceedings of the 2019 on International Conference on Multimedia Retrieval (ICMR '19). ACM, New York, NY, USA, 168-176

  10. Bridging the Gap Between Computational Photography and Visual Recognition

    Authors: Rosaura G. VidalMata, Sreya Banerjee, Brandon RichardWebster, Michael Albright, Pedro Davalos, Scott McCloskey, Ben Miller, Asong Tambo, Sushobhan Ghosh, Sudarshan Nagesh, Ye Yuan, Yueyu Hu, Junru Wu, Wenhan Yang, Xiaoshuai Zhang, Jiaying Liu, Zhangyang Wang, Hwann-Tzong Chen, Tzu-Wei Huang, Wen-Chi Chin, Yi-Chun Li, Mahmoud Lababidi, Charles Otto, Walter J. Scheirer

    Abstract: What is the current state-of-the-art for image restoration and enhancement applied to degraded images acquired under less than ideal circumstances? Can the application of such algorithms as a pre-processing step to improve image interpretability for manual analysis or automatic visual recognition to classify scene content? While there have been important advances in the area of computational photo… ▽ More

    Submitted 19 February, 2020; v1 submitted 27 January, 2019; originally announced January 2019.

    Comments: CVPR Prize Challenge: http://www.ug2challenge.org

  11. arXiv:1901.07878  [pdf, other

    cs.LG cs.CL cs.CV cs.IR stat.ML

    "Is this an example image?" -- Predicting the Relative Abstractness Level of Image and Text

    Authors: Christian Otto, Sebastian Holzki, Ralph Ewerth

    Abstract: Successful multimodal search and retrieval requires the automatic understanding of semantic cross-modal relations, which, however, is still an open research problem. Previous work has suggested the metrics cross-modal mutual information and semantic correlation to model and predict cross-modal semantic relations of image and text. In this paper, we present an approach to predict the (cross-modal)… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: 14 pages, 6 figures, accepted at ECIR2019

  12. arXiv:1806.07309  [pdf, other

    cs.DL cs.IR

    Recommending Scientific Videos based on Metadata Enrichment using Linked Open Data

    Authors: Justyna Medrek, Christian Otto, Ralph Ewerth

    Abstract: The amount of available videos in the Web has significantly increased not only for entertainment etc., but also to convey educational or scientific information in an effective way. There are several web portals that offer access to the latter kind of video material. One of them is the TIB AV-Portal of the Leibniz Information Centre for Science and Technology (TIB), which hosts scientific and educa… ▽ More

    Submitted 3 April, 2019; v1 submitted 19 June, 2018; originally announced June 2018.

    Comments: 6 pages, 2 figures

    MSC Class: 68U35

  13. Face Clustering: Representation and Pairwise Constraints

    Authors: Yichun Shi, Charles Otto, Anil K. Jain

    Abstract: Clustering face images according to their identity has two important applications: (i) grouping a collection of face images when no external labels are associated with images, and (ii) indexing for efficient large scale face retrieval. The clustering problem is composed of two key parts: face representation and choice of similarity for grouping faces. We first propose a representation based on Res… ▽ More

    Submitted 26 July, 2018; v1 submitted 15 June, 2017; originally announced June 2017.

    Comments: This second version is the same as TIFS version. Some experiment results are different from v1 because we correct the protocols

    Journal ref: IEEE Transactions on Information Forensics and Security ( Volume: 13, Issue: 7, July 2018 )

  14. arXiv:1604.00989  [pdf, other

    cs.CV

    Clustering Millions of Faces by Identity

    Authors: Charles Otto, Dayong Wang, Anil K. Jain

    Abstract: In this work, we attempt to address the following problem: Given a large number of unlabeled face images, cluster them into the individual identities present in this data. We consider this a relevant problem in different application scenarios ranging from social media to law enforcement. In large-scale scenarios the number of faces in the collection can be of the order of hundreds of million, whil… ▽ More

    Submitted 4 April, 2016; originally announced April 2016.

    Report number: MSU-CSE-16-3

  15. arXiv:1507.07242  [pdf, ps, other

    cs.CV

    Face Search at Scale: 80 Million Gallery

    Authors: Dayong Wang, Charles Otto, Anil K. Jain

    Abstract: Due to the prevalence of social media websites, one challenge facing computer vision researchers is to devise methods to process and search for persons of interest among the billions of shared photos on these websites. Facebook revealed in a 2013 white paper that its users have uploaded more than 250 billion photos, and are uploading 350 million new photos each day. Due to this humongous amount of… ▽ More

    Submitted 28 July, 2015; v1 submitted 26 July, 2015; originally announced July 2015.

    Comments: 14 pages, 16 figures

    Report number: MSU TECHNICAL REPORT MSU-CSE-15-11, JULY 24, 2015