Skip to main content

Showing 1–30 of 30 results for author: Pinto, F

  1. arXiv:2407.20105  [pdf, other

    cs.LG cs.CR

    Strong Copyright Protection for Language Models via Adaptive Model Fusion

    Authors: Javier Abad, Konstantin Donhauser, Francesco Pinto, Fanny Yang

    Abstract: The risk of language models unintentionally reproducing copyrighted material from their training data has led to the development of various protective measures. In this paper, we propose model fusion as an effective solution to safeguard against copyright infringement. In particular, we introduce Copyright-Protecting Fusion (CP-Fuse), an algorithm that adaptively combines language models to minimi… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  2. arXiv:2407.08707  [pdf, other

    cs.CV cs.LG

    Extracting Training Data from Document-Based VQA Models

    Authors: Francesco Pinto, Nathalie Rauschmayr, Florian Tramèr, Philip Torr, Federico Tombari

    Abstract: Vision-Language Models (VLMs) have made remarkable progress in document-based Visual Question Answering (i.e., responding to queries about the contents of an input document provided as an image). In this work, we show these models can memorize responses for training samples and regurgitate them even when the relevant visual information has been removed. This includes Personal Identifiable Informat… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: ICML 2024

    ACM Class: I.2.7; I.2.10; K.4.1

  3. arXiv:2405.13922  [pdf, other

    cs.LG stat.ML

    Towards Certification of Uncertainty Calibration under Adversarial Attacks

    Authors: Cornelius Emde, Francesco Pinto, Thomas Lukasiewicz, Philip H. S. Torr, Adel Bibi

    Abstract: Since neural classifiers are known to be sensitive to adversarial perturbations that alter their accuracy, \textit{certification methods} have been developed to provide provable guarantees on the insensitivity of their predictions to such perturbations. Furthermore, in safety-critical applications, the frequentist interpretation of the confidence of a classifier (also known as model calibration) c… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 11 pages main paper, appendix included

  4. arXiv:2404.17882  [pdf, other

    cs.DS

    Directed Isoperimetry and Monotonicity Testing: A Dynamical Approach

    Authors: Renato Ferreira Pinto Jr

    Abstract: This paper explores the connection between classical isoperimetric inequalities, their directed analogues, and monotonicity testing. We study the setting of real-valued functions $f : [0,1]^d \to \mathbb{R}$ on the solid unit cube, where the goal is to test with respect to the $L^p$ distance. Our goals are twofold: to further understand the relationship between classical and directed isoperimetry,… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 83 pages

  5. arXiv:2403.13941  [pdf, ps, other

    cs.RO eess.SY

    Sensory Glove-Based Surgical Robot User Interface

    Authors: Leonardo Borgioli, Ki-Hwan Oh, Alberto Mangano, Alvaro Ducas, Luciano Ambrosini, Federico Pinto, Paula A Lopez, Jessica Cassiani, Milos Zefran, Liaohai Chen, Pier Cristoforo Giulianotti

    Abstract: Robotic surgery has reached a high level of maturity and has become an integral part of standard surgical care. However, existing surgeon consoles are bulky and take up valuable space in the operating room, present challenges for surgical team coordination, and their proprietary nature makes it difficult to take advantage of recent technological advances, especially in virtual and augmented realit… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 6 pages, 5 figures, 7 tables, submitted to International Conference on Intelligent Robots and Systems (IROS)2024

  6. arXiv:2403.12693  [pdf, other

    cs.CV

    As Firm As Their Foundations: Can open-sourced foundation models be used to create adversarial examples for downstream tasks?

    Authors: Anjun Hu, Jindong Gu, Francesco Pinto, Konstantinos Kamnitsas, Philip Torr

    Abstract: Foundation models pre-trained on web-scale vision-language data, such as CLIP, are widely used as cornerstones of powerful machine learning systems. While pre-training offers clear advantages for downstream learning, it also endows downstream models with shared adversarial vulnerabilities that can be easily identified through the open-sourced foundation model. In this work, we expose such vulnerab… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  7. arXiv:2311.14247  [pdf, other

    cs.DS

    Distribution Testing with a Confused Collector

    Authors: Renato Ferreira Pinto Jr., Nathaniel Harms

    Abstract: We are interested in testing properties of distributions with systematically mislabeled samples. Our goal is to make decisions about unknown probability distributions, using a sample that has been collected by a confused collector, such as a machine-learning classifier that has not learned to distinguish all elements of the domain. The confused collector holds an unknown clustering of the domain a… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

    Comments: 64 pages. Full version of paper to appear at ITCS 2024. arXiv admin note: text overlap with arXiv:2304.01374

  8. arXiv:2307.02193  [pdf, other

    cs.DS

    Directed Poincaré Inequalities and $L^1$ Monotonicity Testing of Lipschitz Functions

    Authors: Renato Ferreira Pinto Jr

    Abstract: We study the connection between directed isoperimetric inequalities and monotonicity testing. In recent years, this connection has unlocked breakthroughs for testing monotonicity of functions defined on discrete domains. Inspired the rich history of isoperimetric inequalities in continuous settings, we propose that studying the relationship between directed isoperimetry and monotonicity in such se… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 35 pages including 5 page appendix. To appear at RANDOM 2023

  9. arXiv:2306.03962  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    PILLAR: How to make semi-private learning more effective

    Authors: Francesco Pinto, Yaxi Hu, Fanny Yang, Amartya Sanyal

    Abstract: In Semi-Supervised Semi-Private (SP) learning, the learner has access to both public unlabelled and private labelled data. We propose a computationally efficient algorithm that, under mild assumptions on the data, provably achieves significantly lower private labelled sample complexity and can be efficiently run on real-world datasets. For this purpose, we leverage the features extracted by networ… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  10. arXiv:2304.01374  [pdf, other

    cs.DS cs.CC

    Distribution Testing Under the Parity Trace

    Authors: Renato Ferreira Pinto Jr., Nathaniel Harms

    Abstract: Distribution testing is a fundamental statistical task with many applications, but we are interested in a variety of problems where systematic mislabelings of the sample prevent us from applying the existing theory. To apply distribution testing to these problems, we introduce distribution testing under the parity trace, where the algorithm receives an ordered sample $S$ that reveals only the leas… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 132 pages

  11. arXiv:2212.11237  [pdf, other

    cs.CV

    Not Just Pretty Pictures: Toward Interventional Data Augmentation Using Text-to-Image Generators

    Authors: Jianhao Yuan, Francesco Pinto, Adam Davies, Philip Torr

    Abstract: Neural image classifiers are known to undergo severe performance degradation when exposed to inputs that are sampled from environmental conditions that differ from their training data. Given the recent progress in Text-to-Image (T2I) generation, a natural question is how modern T2I generators can be used to simulate arbitrary interventions over such environmental factors in order to augment traini… ▽ More

    Submitted 3 June, 2024; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: 29 pages, 16 figures

    Journal ref: ICML 2024

  12. arXiv:2207.11347  [pdf, other

    cs.CV cs.LG

    An Impartial Take to the CNN vs Transformer Robustness Contest

    Authors: Francesco Pinto, Philip H. S. Torr, Puneet K. Dokania

    Abstract: Following the surge of popularity of Transformers in Computer Vision, several studies have attempted to determine whether they could be more robust to distribution shifts and provide better uncertainty estimates than Convolutional Neural Networks (CNNs). The almost unanimous conclusion is that they are, and it is often conjectured more or less explicitly that the reason of this supposed superiorit… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Journal ref: ECCV 2022

  13. arXiv:2207.06211  [pdf, other

    cs.CV

    Sample-dependent Adaptive Temperature Scaling for Improved Calibration

    Authors: Tom Joy, Francesco Pinto, Ser-Nam Lim, Philip H. S. Torr, Puneet K. Dokania

    Abstract: It is now well known that neural networks can be wrong with high confidence in their predictions, leading to poor calibration. The most common post-hoc approach to compensate for this is to perform temperature scaling, which adjusts the confidences of the predictions on any input by scaling the logits by a fixed value. Whilst this approach typically improves the average calibration across the whol… ▽ More

    Submitted 22 July, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

  14. arXiv:2206.14502  [pdf, other

    cs.LG cs.CV

    RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness

    Authors: Francesco Pinto, Harry Yang, Ser-Nam Lim, Philip H. S. Torr, Puneet K. Dokania

    Abstract: We show that the effectiveness of the well celebrated Mixup [Zhang et al., 2018] can be further improved if instead of using it as the sole learning objective, it is utilized as an additional regularizer to the standard cross-entropy loss. This simple change not only provides much improved accuracy but also significantly improves the quality of the predictive uncertainty estimation of Mixup in mos… ▽ More

    Submitted 6 February, 2023; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: 22 pages, 18 figures

    ACM Class: I.4.0; I.2.6

  15. arXiv:2202.09271  [pdf, other

    cs.RO cs.AI

    Enhanced Behavioral Cloning with Environmental Losses for Self-Driving Vehicles

    Authors: Nelson Fernandez Pinto, Thomas Gilles

    Abstract: Learned path planners have attracted research interest due to their ability to model human driving behavior and rapid inference. Recent works on behavioral cloning show that simple imitation of expert observations is not sufficient to handle complex driving scenarios. Besides, predictions that land outside drivable areas can lead to potentially dangerous situations. This paper proposes a set of lo… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  16. arXiv:2112.07086  [pdf, ps, other

    cs.IT

    Study of Linear Precoding and Power Allocation for Large Multiple-Antenna Systems with Coarsely Quantized Signals

    Authors: S. F. Pinto, R. C. de Lamare

    Abstract: This work studies coarse quantization-aware BD (${\scriptstyle\mathrm{CQA-BD}}$) and coarse quantization-aware RBD (${\scriptstyle\mathrm{CQA-RBD}}$) precoding algorithms for large-scale MU-MIMO systems with coarsely quantized signals and proposes the coarse-quantization most advantageous allocation strategy (${\scriptstyle\mathrm{CQA-MAAS}}$) power allocation algorithm for linearly-precoded MU-MI… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 7 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2107.03969

  17. arXiv:2106.11447  [pdf, other

    eess.IV cs.CV cs.LG

    Encoder-Decoder Architectures for Clinically Relevant Coronary Artery Segmentation

    Authors: João Lourenço Silva, Miguel Nobre Menezes, Tiago Rodrigues, Beatriz Silva, Fausto J. Pinto, Arlindo L. Oliveira

    Abstract: Coronary X-ray angiography is a crucial clinical procedure for the diagnosis and treatment of coronary artery disease, which accounts for roughly 16% of global deaths every year. However, the images acquired in these procedures have low resolution and poor contrast, making lesion detection and assessment challenging. Accurate coronary artery segmentation not only helps mitigate these problems, but… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  18. arXiv:2103.02969  [pdf, other

    eess.IV cs.CV cs.LG

    Automated Detection of Coronary Artery Stenosis in X-ray Angiography using Deep Neural Networks

    Authors: Dinis L. Rodrigues, Miguel Nobre Menezes, Fausto J. Pinto, Arlindo L. Oliveira

    Abstract: Coronary artery disease leading up to stenosis, the partial or total blocking of coronary arteries, is a severe condition that affects millions of patients each year. Automated identification and classification of stenosis severity from minimally invasive procedures would be of great clinical value, but existing methods do not match the accuracy of experienced cardiologists, due to the complexity… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: 10 pages, 4 Figures

  19. arXiv:2012.12450  [pdf, other

    cs.LG stat.ML

    Towards Automated Satellite Conjunction Management with Bayesian Deep Learning

    Authors: Francesco Pinto, Giacomo Acciarini, Sascha Metz, Sarah Boufelja, Sylvester Kaczmarek, Klaus Merz, José A. Martinez-Heras, Francesca Letizia, Christopher Bridges, Atılım Güneş Baydin

    Abstract: After decades of space travel, low Earth orbit is a junkyard of discarded rocket bodies, dead satellites, and millions of pieces of debris from collisions and explosions. Objects in high enough altitudes do not re-enter and burn up in the atmosphere, but stay in orbit around Earth for a long time. With a speed of 28,000 km/h, collisions in these orbits can generate fragments and potentially trigge… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

    Comments: 7 pages, 2 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: AI for Earth Sciences Workshop at NeurIPS 2020, Vancouver, Canada

  20. arXiv:2012.10260  [pdf, other

    cs.LG physics.app-ph

    Spacecraft Collision Risk Assessment with Probabilistic Programming

    Authors: Giacomo Acciarini, Francesco Pinto, Sascha Metz, Sarah Boufelja, Sylvester Kaczmarek, Klaus Merz, José A. Martinez-Heras, Francesca Letizia, Christopher Bridges, Atılım Güneş Baydin

    Abstract: Over 34,000 objects bigger than 10 cm in length are known to orbit Earth. Among them, only a small percentage are active satellites, while the rest of the population is made of dead satellites, rocket bodies, and debris that pose a collision threat to operational spacecraft. Furthermore, the predicted growth of the space sector and the planned launch of megaconstellations will add even more comple… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 8 pages, 2 figures

    MSC Class: 68T37; 68T05; 62P35 ACM Class: G.3; I.2.6; J.2

    Journal ref: Third Workshop on Machine Learning and the Physical Sciences (NeurIPS 2020), Vancouver, Canada

  21. arXiv:2012.03923  [pdf, ps, other

    cs.LG cs.CC cs.DS

    VC Dimension and Distribution-Free Sample-Based Testing

    Authors: Eric Blais, Renato Ferreira Pinto Jr., Nathaniel Harms

    Abstract: We consider the problem of determining which classes of functions can be tested more efficiently than they can be learned, in the distribution-free sample-based model that corresponds to the standard PAC learning setting. Our main result shows that while VC dimension by itself does not always provide tight bounds on the number of samples required to test a class of functions in this model, it can… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: 44 pages

  22. arXiv:2007.07333  [pdf

    cs.SI cs.CY

    Individual Factors that Influence Effort and Contributions on Wikipedia

    Authors: Luiz F. Pinto, Carlos Denner dos Santos, Silvia Onoyama

    Abstract: In this work, we aim to analyze how attitude, self-efficacy, and altruism influence effort and active contributions on Wikipedia. We propose a new conceptual model based on the theory of planned behavior and findings from the literature on online communities. This model differs from other models that have been previously proposed by considering altruism in its various facets (identification, recip… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: Presented at AoM 2019 in Boston

  23. arXiv:2001.07209  [pdf, other

    cs.CL

    Text-based inference of moral sentiment change

    Authors: Jing Yi Xie, Renato Ferreira Pinto Jr., Graeme Hirst, Yang Xu

    Abstract: We present a text-based framework for investigating moral sentiment change of the public via longitudinal corpora. Our framework is based on the premise that language use can inform people's moral perception toward right or wrong, and we build our methodology by exploring moral biases learned from diachronic word embeddings. We demonstrate how a parameter-free model supports inference of historica… ▽ More

    Submitted 20 January, 2020; originally announced January 2020.

    Comments: In Proceedings of EMNLP 2019

  24. arXiv:2001.00278  [pdf, ps, other

    cs.LG stat.ML

    Motivic clustering schemes for directed graphs

    Authors: Facundo Mémoli, Guilherme Vituri F. Pinto

    Abstract: Motivated by the concept of network motifs we construct certain clustering methods (functors) which are parametrized by a given collection of motifs (or representers).

    Submitted 6 January, 2020; v1 submitted 1 January, 2020; originally announced January 2020.

    Comments: 23 pages

  25. arXiv:1908.04240  [pdf, other

    cs.LG stat.ML

    Automatic Model Monitoring for Data Streams

    Authors: Fábio Pinto, Marco O. P. Sampaio, Pedro Bizarro

    Abstract: Detecting concept drift is a well known problem that affects production systems. However, two important issues that are frequently not addressed in the literature are 1) the detection of drift when the labels are not immediately available; and 2) the automatic generation of explanations to identify possible causes for the drift. For example, a fraud detection model in online payments could show a… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.

    Comments: 9 pages, 9 figures, 2 tables

    Journal ref: KDD-ADF-2019

  26. A Framework for Analyzing Fog-Cloud Computing Cooperation Applied to Information Processing of UAVs

    Authors: Milena F. Pinto, André L. M. Marcato, Aurélio G. Melo, Leonardo M. Honório, Cristina Urdiales

    Abstract: Unmanned aerial vehicles (UAVs) are a relatively new technology. Their application can often involve complex and unseen problems. For instance, they can work in a cooperative-based environment under the supervision of a ground station to speed up critical decision-making processes. However, the amount of information exchanged among the aircraft and ground station is limited by high distances, low… ▽ More

    Submitted 10 January, 2019; originally announced January 2019.

    Comments: Volume 2019, Article ID 7497924, 14 pages

    Journal ref: Wireless Communications and Mobile Computing, 2019

  27. arXiv:1805.00169  [pdf, ps, other

    eess.SP cs.LG stat.ML

    Multi-Step Knowledge-Aided Iterative ESPRIT for Direction Finding

    Authors: S. F. B. Pinto, R. C. de Lamare

    Abstract: In this work, we propose a subspace-based algorithm for DOA estimation which iteratively reduces the disturbance factors of the estimated data covariance matrix and incorporates prior knowledge which is gradually obtained on line. An analysis of the MSE of the reshaped data covariance matrix is carried out along with comparisons between computational complexities of the proposed and existing algor… ▽ More

    Submitted 30 April, 2018; originally announced May 2018.

    Comments: 7 figures. arXiv admin note: text overlap with arXiv:1703.10523

  28. arXiv:1706.09367  [pdf, other

    stat.ML cs.LG

    autoBagging: Learning to Rank Bagging Workflows with Metalearning

    Authors: Fábio Pinto, Vítor Cerqueira, Carlos Soares, João Mendes-Moreira

    Abstract: Machine Learning (ML) has been successfully applied to a wide range of domains and applications. One of the techniques behind most of these successful applications is Ensemble Learning (EL), the field of ML that gave birth to methods such as Random Forests or Boosting. The complexity of applying these techniques together with the market scarcity on ML experts, has created the need for systems that… ▽ More

    Submitted 28 June, 2017; originally announced June 2017.

  29. arXiv:1609.08583  [pdf, other

    cs.NI

    Survey of Inter-satellite Communication for Small Satellite Systems: Physical Layer to Network Layer View

    Authors: Radhika Radhakrishnan, William Edmonson, Fatemeh Afghah, R. Rodriguez-Osorio, Frank Pinto, Scott Burleigh

    Abstract: Small satellite systems enable whole new class of missions for navigation, communications, remote sensing and scientific research for both civilian and military purposes. As individual spacecraft are limited by the size, mass and power constraints, mass-produced small satellites in large constellations or clusters could be useful in many science missions such as gravity mapping, tracking of forest… ▽ More

    Submitted 27 September, 2016; v1 submitted 27 September, 2016; originally announced September 2016.

    Comments: 51 pages, 21 Figures, 11 Tables, accepted in IEEE Communications Surveys and Tutorials

  30. arXiv:1210.4919  [pdf

    cs.LG cs.CE stat.ML

    Latent Dirichlet Allocation Uncovers Spectral Characteristics of Drought Stressed Plants

    Authors: Mirwaes Wahabzada, Kristian Kersting, Christian Bauckhage, Christoph Roemer, Agim Ballvora, Francisco Pinto, Uwe Rascher, Jens Leon, Lutz Ploemer

    Abstract: Understanding the adaptation process of plants to drought stress is essential in improving management practices, breeding strategies as well as engineering viable crops for a sustainable agriculture in the coming decades. Hyper-spectral imaging provides a particularly promising approach to gain such understanding since it allows to discover non-destructively spectral characteristics of plants gove… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-852-862