Skip to main content

Showing 1–18 of 18 results for author: Makarov, I

  1. arXiv:2407.02599  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Meta 3D Gen

    Authors: Raphael Bensadoun, Tom Monnier, Yanir Kleiman, Filippos Kokkinos, Yawar Siddiqui, Mahendra Kariya, Omri Harosh, Roman Shapovalov, Benjamin Graham, Emilien Garreau, Animesh Karnewar, Ang Cao, Idan Azuri, Iurii Makarov, Eric-Tuan Le, Antoine Toisoul, David Novotny, Oran Gafni, Natalia Neverova, Andrea Vedaldi

    Abstract: We introduce Meta 3D Gen (3DGen), a new state-of-the-art, fast pipeline for text-to-3D asset generation. 3DGen offers 3D asset creation with high prompt fidelity and high-quality 3D shapes and textures in under a minute. It supports physically-based rendering (PBR), necessary for 3D asset relighting in real-world applications. Additionally, 3DGen supports generative retexturing of previously gener… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  2. arXiv:2406.17636  [pdf, other

    cs.CV cs.AI

    Aligning Diffusion Models with Noise-Conditioned Perception

    Authors: Alexander Gambashidze, Anton Kulikov, Yuriy Sosnin, Ilya Makarov

    Abstract: Recent advancements in human preference optimization, initially developed for Language Models (LMs), have shown promise for text-to-image Diffusion Models, enhancing prompt alignment, visual appeal, and user preference. Unlike LMs, Diffusion Models typically optimize in pixel or VAE space, which does not align well with human perception, leading to slower and less efficient training during the pre… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.03299  [pdf, other

    cs.AI cs.CL

    The Good, the Bad, and the Hulk-like GPT: Analyzing Emotional Decisions of Large Language Models in Cooperation and Bargaining Games

    Authors: Mikhail Mozikov, Nikita Severin, Valeria Bodishtianu, Maria Glushanina, Mikhail Baklashkin, Andrey V. Savchenko, Ilya Makarov

    Abstract: Behavior study experiments are an important part of society modeling and understanding human interactions. In practice, many behavioral experiments encounter challenges related to internal and external validity, reproducibility, and social bias due to the complexity of social interactions and cooperation in human user studies. Recent advances in Large Language Models (LLMs) have provided researche… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    ACM Class: I.2.7; J.4

  4. arXiv:2404.00679  [pdf, other

    cs.CV

    Weak-to-Strong 3D Object Detection with X-Ray Distillation

    Authors: Alexander Gambashidze, Aleksandr Dadukin, Maksim Golyadkin, Maria Razzhivina, Ilya Makarov

    Abstract: This paper addresses the critical challenges of sparsity and occlusion in LiDAR-based 3D object detection. Current methods often rely on supplementary modules or specific architectural designs, potentially limiting their applicability to new and evolving architectures. To our knowledge, we are the first to propose a versatile technique that seamlessly integrates into any existing framework for 3D… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Computer Vision and Pattern Recognition 2024

  5. arXiv:2403.13502  [pdf, other

    cs.LG cs.CR eess.SY

    Adversarial Attacks and Defenses in Fault Detection and Diagnosis: A Comprehensive Benchmark on the Tennessee Eastman Process

    Authors: Vitaliy Pozdnyakov, Aleksandr Kovalenko, Ilya Makarov, Mikhail Drobyshevskiy, Kirill Lukyanov

    Abstract: Integrating machine learning into Automated Control Systems (ACS) enhances decision-making in industrial process management. One of the limitations to the widespread adoption of these technologies in industry is the vulnerability of neural networks to adversarial attacks. This study explores the threats in deploying deep learning models for fault diagnosis in ACS using the Tennessee Eastman Proces… ▽ More

    Submitted 7 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    ACM Class: I.2.6; I.2.1

    Journal ref: IEEE Open Journal of the Industrial Electronics Society, 5 (2024) 428-440

  6. arXiv:2312.06467  [pdf, other

    cs.LG eess.IV q-bio.NC

    Aligning brain functions boosts the decoding of visual semantics in novel subjects

    Authors: Alexis Thual, Yohann Benchetrit, Felix Geilert, Jérémy Rapin, Iurii Makarov, Hubert Banville, Jean-Rémi King

    Abstract: Deep learning is leading to major advances in the realm of brain decoding from functional Magnetic Resonance Imaging (fMRI). However, the large inter-subject variability in brain characteristics has limited most studies to train models on one subject at a time. Consequently, this approach hampers the training of deep learning models, which typically requires very large datasets. Here, we propose t… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  7. arXiv:2312.01092  [pdf, other

    cs.SD cs.LG eess.AS

    A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-By-Humming Task

    Authors: Amantur Amatov, Dmitry Lamanov, Maksim Titov, Ivan Vovk, Ilya Makarov, Mikhail Kudinov

    Abstract: Query-by-Humming (QbH) is a task that involves finding the most relevant song based on a hummed or sung fragment. Despite recent successful commercial solutions, implementing QbH systems remains challenging due to the lack of high-quality datasets for training machine learning models. In this paper, we propose a deep learning data collection technique and introduce Covers and Hummings Aligned Data… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  8. arXiv:2311.06054  [pdf, ps, other

    cs.CV cs.AI

    Refining the ONCE Benchmark with Hyperparameter Tuning

    Authors: Maksim Golyadkin, Alexander Gambashidze, Ildar Nurgaliev, Ilya Makarov

    Abstract: In response to the growing demand for 3D object detection in applications such as autonomous driving, robotics, and augmented reality, this work focuses on the evaluation of semi-supervised learning approaches for point cloud data. The point cloud representation provides reliable and consistent observations regardless of lighting conditions, thanks to advances in LiDAR sensors. Data annotation is… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  9. arXiv:2303.11898  [pdf, other

    cs.CV cs.GR

    Real-time volumetric rendering of dynamic humans

    Authors: Ignacio Rocco, Iurii Makarov, Filippos Kokkinos, David Novotny, Benjamin Graham, Natalia Neverova, Andrea Vedaldi

    Abstract: We present a method for fast 3D reconstruction and real-time rendering of dynamic humans from monocular videos with accompanying parametric body fits. Our method can reconstruct a dynamic human in less than 3h using a single GPU, compared to recent state-of-the-art alternatives that take up to 72h. These speedups are obtained by using a lightweight deformation model solely based on linear blend sk… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Project page: https://real-time-humans.github.io/

  10. arXiv:2301.11280  [pdf, other

    cs.CV cs.AI cs.LG

    Text-To-4D Dynamic Scene Generation

    Authors: Uriel Singer, Shelly Sheynin, Adam Polyak, Oron Ashual, Iurii Makarov, Filippos Kokkinos, Naman Goyal, Andrea Vedaldi, Devi Parikh, Justin Johnson, Yaniv Taigman

    Abstract: We present MAV3D (Make-A-Video3D), a method for generating three-dimensional dynamic scenes from text descriptions. Our approach uses a 4D dynamic Neural Radiance Field (NeRF), which is optimized for scene appearance, density, and motion consistency by querying a Text-to-Video (T2V) diffusion-based model. The dynamic video output generated from the provided text can be viewed from any camera locat… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  11. arXiv:2301.05029  [pdf, other

    cs.LG

    Interaction models for remaining useful life estimation

    Authors: Dmitry Zhevnenko, Mikhail Kazantsev, Ilya Makarov

    Abstract: The paper deals with the problem of controlling the state of industrial devices according to the readings of their sensors. The current methods rely on one approach to feature extraction in which the prediction occurs. We proposed a technique to build a scalable model that combines multiple different feature extractor blocks. A new model based on sequential sensor space analysis achieves state-of-… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

    Comments: submitted to Journal of Industrial Information Integration

    MSC Class: 68T07 ACM Class: C.3

  12. arXiv:2210.11164  [pdf, other

    cs.AI cs.LG

    Graph Neural Networks with Trainable Adjacency Matrices for Fault Diagnosis on Multivariate Sensor Data

    Authors: Alexander Kovalenko, Vitaliy Pozdnyakov, Ilya Makarov

    Abstract: Timely detected anomalies in the chemical technological processes, as well as the earliest detection of the cause of the fault, significantly reduce the production cost in the industrial factories. Data on the state of the technological process and the operation of production equipment are received by a large number of different sensors. To better predict the behavior of the process and equipment,… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  13. arXiv:2209.01191  [pdf, other

    physics.ao-ph cs.LG

    Long-term hail risk assessment with deep neural networks

    Authors: Ivan Lukyanenko, Mikhail Mozikov, Yury Maximov, Ilya Makarov

    Abstract: Hail risk assessment is necessary to estimate and reduce damage to crops, orchards, and infrastructure. Also, it helps to estimate and reduce consequent losses for businesses and, particularly, insurance companies. But hail forecasting is challenging. Data used for designing models for this purpose are tree-dimensional geospatial time series. Hail is a very local event with respect to the resoluti… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

  14. arXiv:2208.08879  [pdf, other

    cs.LG cs.AI

    SensorSCAN: Self-Supervised Learning and Deep Clustering for Fault Diagnosis in Chemical Processes

    Authors: Maksim Golyadkin, Vitaliy Pozdnyakov, Leonid Zhukov, Ilya Makarov

    Abstract: Modern industrial facilities generate large volumes of raw sensor data during the production process. This data is used to monitor and control the processes and can be analyzed to detect and predict process abnormalities. Typically, the data has to be annotated by experts in order to be used in predictive modeling. However, manual annotation of large amounts of data can be difficult in industrial… ▽ More

    Submitted 2 November, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

  15. Dealing with Sparse Rewards Using Graph Neural Networks

    Authors: Matvey Gerasyov, Ilya Makarov

    Abstract: Deep reinforcement learning in partially observable environments is a difficult task in itself, and can be further complicated by a sparse reward signal. Most tasks involving navigation in three-dimensional environments provide the agent with extremely limited information. Typically, the agent receives a visual observation input from the environment and is rewarded once at the end of the episode.… ▽ More

    Submitted 15 October, 2023; v1 submitted 24 March, 2022; originally announced March 2022.

    Journal ref: IEEE Access, vol. 11, pp. 89180-89187, 2023

  16. arXiv:2108.08754  [pdf, other

    cs.LG

    Temporal Graph Network Embedding with Causal Anonymous Walks Representations

    Authors: Ilya Makarov, Andrey Savchenko, Arseny Korovko, Leonid Sherstyuk, Nikita Severin, Aleksandr Mikheev, Dmitrii Babaev

    Abstract: Many tasks in graph machine learning, such as link prediction and node classification, are typically solved by using representation learning, in which each node or edge in the network is encoded via an embedding. Though there exists a lot of network embeddings for static graphs, the task becomes much more complicated when the dynamic (i.e. temporal) network is analyzed. In this paper, we propose a… ▽ More

    Submitted 24 August, 2021; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: 10 pages, 3 figures

  17. arXiv:2106.08048  [pdf, other

    q-bio.PE cs.LG stat.AP

    Epidemic modelling of multiple virus strains: a case study of SARS-CoV-2 B.1.1.7 in Moscow

    Authors: Boris Tseytlin, Ilya Makarov

    Abstract: During a long-running pandemic a pathogen can mutate, producing new strains with different epidemiological parameters. Existing approaches to epidemic modelling only consider one virus strain. We have developed a modified SEIR model to simulate multiple virus strains within the same population. As a case study, we investigate the potential effects of SARS-CoV-2 strain B.1.1.7 on the city of Moscow… ▽ More

    Submitted 16 June, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

  18. arXiv:2106.08042  [pdf, other

    cs.CV cs.IR cs.LG

    Hotel Recognition via Latent Image Embedding

    Authors: Boris Tseytlin, Ilya Makarov

    Abstract: We approach the problem of hotel recognition with deep metric learning. We overview the existing approaches and propose a modification to Contrastive loss called Contrastive-Triplet loss. We construct a robust pipeline for benchmarking metric learning models and perform experiments on Hotels-50K and CUB200 datasets. Contrastive-Triplet loss is shown to achieve better retrieval on Hotels-50k. We op… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: IWANN 2021