Revisiting and advancing fast adversarial training through the lens of bi-level optimization

S Liu, Y Yao, J Jia, S Casper, N Baracaldo… - arXiv preprint arXiv …, 2024 - arxiv.org

We explore machine unlearning (MU) in the domain of large language models (LLMs),
referred to as LLM unlearning. This initiative aims to eliminate undesirable data influence …

被引用次数：49 相关文章所有 2 个版本

[PDF] mlr.press

On penalty-based bilevel gradient descent method

H Shen, T Chen - International Conference on Machine …, 2023 - proceedings.mlr.press

Bilevel optimization enjoys a wide range of applications in hyper-parameter optimization,
meta-learning and reinforcement learning. However, bilevel problems are difficult to solve …

被引用次数：31 相关文章所有 7 个版本

[PDF] neurips.cc

Advancing model pruning via bi-level optimization

Y Zhang, Y Yao, P Ram, P Zhao… - Advances in …, 2022 - proceedings.neurips.cc

The deployment constraints in practical applications necessitate the pruning of large-scale
deep learning models, ie, promoting their weight sparsity. As illustrated by the Lottery Ticket …

被引用次数：46 相关文章所有 8 个版本

[PDF] thecvf.com

Enhancing generalization of universal adversarial perturbation through gradient aggregation

X Liu, Y Zhong, Y Zhang, L Qin… - Proceedings of the …, 2023 - openaccess.thecvf.com

Deep neural networks are vulnerable to universal adversarial perturbation (UAP), an
instance-agnostic perturbation capable of fooling the target model for most samples …

被引用次数：13 相关文章所有 6 个版本

[PDF] thecvf.com

Robust mixture-of-expert training for convolutional neural networks

Y Zhang, R Cai, T Chen, G Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Sparsely-gated Mixture of Expert (MoE), an emerging deep model architecture, has
demonstrated a great promise to enable high-accuracy and ultra-efficient model inference …

被引用次数：9 相关文章所有 5 个版本

[PDF] thecvf.com

Text-visual prompting for efficient 2d temporal video grounding

Y Zhang, X Chen, J Jia, S Liu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

In this paper, we study the problem of temporal video grounding (TVG), which aims to predict
the starting/ending time points of moments described by a text sentence within a long …

被引用次数：18 相关文章所有 8 个版本

[PDF] arxiv.org

An Introduction to Bilevel Optimization: Foundations and applications in signal processing and machine learning

Y Zhang, P Khanduri, I Tsaknakis, Y Yao… - IEEE Signal …, 2024 - ieeexplore.ieee.org

Recently, bilevel optimization (BLO) has taken center stage in some very exciting
developments in the area of signal processing (SP) and machine learning (ML). Roughly …

被引用次数：18 相关文章所有 4 个版本

[PDF] neurips.cc

Simfbo: Towards simple, flexible and communication-efficient federated bilevel learning

Y Yang, P Xiao, K Ji - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Federated bilevel optimization (FBO) has shown great potential recently in machine learning
and edge computing due to the emerging nested optimization structure in meta-learning …

被引用次数：8 相关文章所有 7 个版本

[PDF] neurips.cc

Random sharpness-aware minimization

Y Liu, S Mai, M Cheng, X Chen… - Advances in Neural …, 2022 - proceedings.neurips.cc

Abstract Currently, Sharpness-Aware Minimization (SAM) is proposed to seek the
parameters that lie in a flat region to improve the generalization when training neural …

被引用次数：17 相关文章所有 5 个版本

[PDF] ieee.org

A survey on efficient methods for adversarial robustness

A Muhammad, SH Bae - IEEE Access, 2022 - ieeexplore.ieee.org

Deep learning has revolutionized computer vision with phenomenal success and
widespread applications. Despite impressive results in complex problems, neural networks …

被引用次数：20 相关文章所有 3 个版本