Skip to main content

Showing 1–9 of 9 results for author: Ioffe, S

  1. arXiv:2211.09981  [pdf, other

    cs.LG cs.AI stat.ML

    Weighted Ensemble Self-Supervised Learning

    Authors: Yangjun Ruan, Saurabh Singh, Warren Morningstar, Alexander A. Alemi, Sergey Ioffe, Ian Fischer, Joshua V. Dillon

    Abstract: Ensembling has proven to be a powerful technique for boosting model performance, uncertainty estimation, and robustness in supervised learning. Advances in self-supervised learning (SSL) enable leveraging large unlabeled corpora for state-of-the-art few-shot and supervised learning performance. In this paper, we explore how ensemble methods can improve recent SSL techniques by developing a framewo… ▽ More

    Submitted 9 April, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted by ICLR 2023

  2. arXiv:1808.00447  [pdf, other

    cs.CV

    Towards a Semantic Perceptual Image Metric

    Authors: Troy Chinen, Johannes Ballé, Chunhui Gu, Sung Jin Hwang, Sergey Ioffe, Nick Johnston, Thomas Leung, David Minnen, Sean O'Malley, Charles Rosenberg, George Toderici

    Abstract: We present a full reference, perceptual image metric based on VGG-16, an artificial neural network trained on object classification. We fit the metric to a new database based on 140k unique images annotated with ground truth by human raters who received minimal instruction. The resulting metric shows competitive performance on TID 2013, a database widely used to assess image quality assessments me… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

  3. arXiv:1703.07464  [pdf, other

    cs.CV

    No Fuss Distance Metric Learning using Proxies

    Authors: Yair Movshovitz-Attias, Alexander Toshev, Thomas K. Leung, Sergey Ioffe, Saurabh Singh

    Abstract: We address the problem of distance metric learning (DML), defined as learning a distance consistent with a notion of semantic similarity. Traditionally, for this problem supervision is expressed in the form of sets of points that follow an ordinal relationship -- an anchor point $x$ is similar to a set of positive points $Y$, and dissimilar to a set of negative points $Z$, and a loss defined over… ▽ More

    Submitted 1 August, 2017; v1 submitted 21 March, 2017; originally announced March 2017.

    Comments: To be presented in ICCV 2017

  4. arXiv:1702.03275  [pdf, ps, other

    cs.LG

    Batch Renormalization: Towards Reducing Minibatch Dependence in Batch-Normalized Models

    Authors: Sergey Ioffe

    Abstract: Batch Normalization is quite effective at accelerating and improving the training of deep models. However, its effectiveness diminishes when the training minibatches are small, or do not consist of independent samples. We hypothesize that this is due to the dependence of model layer inputs on all the examples in the minibatch, and different activations being produced between training and inference… ▽ More

    Submitted 30 March, 2017; v1 submitted 10 February, 2017; originally announced February 2017.

  5. arXiv:1602.07261  [pdf, other

    cs.CV

    Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

    Authors: Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, Alex Alemi

    Abstract: Very deep convolutional networks have been central to the largest advances in image recognition performance in recent years. One example is the Inception architecture that has been shown to achieve very good performance at relatively low computational cost. Recently, the introduction of residual connections in conjunction with a more traditional architecture has yielded state-of-the-art performanc… ▽ More

    Submitted 23 August, 2016; v1 submitted 23 February, 2016; originally announced February 2016.

  6. arXiv:1512.00567  [pdf, other

    cs.CV

    Rethinking the Inception Architecture for Computer Vision

    Authors: Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, Zbigniew Wojna

    Abstract: Convolutional networks are at the core of most state-of-the-art computer vision solutions for a wide variety of tasks. Since 2014 very deep convolutional networks started to become mainstream, yielding substantial gains in various benchmarks. Although increased model size and computational cost tend to translate to immediate quality gains for most tasks (as long as enough labeled data is provided… ▽ More

    Submitted 11 December, 2015; v1 submitted 1 December, 2015; originally announced December 2015.

  7. arXiv:1502.03167  [pdf, ps, other

    cs.LG

    Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

    Authors: Sergey Ioffe, Christian Szegedy

    Abstract: Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates and careful parameter initialization, and makes it notoriously hard to train models with saturating nonlinearities. We refer to this phenomenon as internal covar… ▽ More

    Submitted 2 March, 2015; v1 submitted 10 February, 2015; originally announced February 2015.

  8. arXiv:1412.1441  [pdf, other

    cs.CV

    Scalable, High-Quality Object Detection

    Authors: Christian Szegedy, Scott Reed, Dumitru Erhan, Dragomir Anguelov, Sergey Ioffe

    Abstract: Current high-quality object detection approaches use the scheme of salience-based object proposal methods followed by post-classification using deep convolutional features. This spurred recent research in improving object proposal methods. However, domain agnostic proposal generation has the principal drawback that the proposals come unranked or with very weak ranking, making it hard to trade-off… ▽ More

    Submitted 8 December, 2015; v1 submitted 3 December, 2014; originally announced December 2014.

  9. arXiv:1312.4894  [pdf, other

    cs.CV

    Deep Convolutional Ranking for Multilabel Image Annotation

    Authors: Yunchao Gong, Yangqing Jia, Thomas Leung, Alexander Toshev, Sergey Ioffe

    Abstract: Multilabel image annotation is one of the most important challenges in computer vision with many real-world applications. While existing work usually use conventional visual features for multilabel annotation, features based on Deep Neural Networks have shown potential to significantly boost performance. In this work, we propose to leverage the advantage of such features and analyze key components… ▽ More

    Submitted 14 April, 2014; v1 submitted 17 December, 2013; originally announced December 2013.