Skip to main navigation Skip to search Skip to main content

TUNING-FREE BILEVEL OPTIMIZATION: NEW ALGORITHMS AND CONVERGENCE ANALYSIS

  • Yifan Yang
  • , Hao Ban
  • , Minhui Huang
  • , Shiqian Ma
  • , Kaiyi Ji
  • SUNY Buffalo
  • Meta
  • Rice University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Bilevel optimization has recently attracted considerable attention due to its abundant applications in machine learning problems. However, existing methods rely on prior knowledge of problem parameters to determine stepsizes, resulting in significant effort in tuning stepsizes when these parameters are unknown. In this paper, we propose two novel tuning-free algorithms, D-TFBO and S-TFBO. D-TFBO employs a double-loop structure with stepsizes adaptively adjusted by the "inverse of cumulative gradient norms" strategy. S-TFBO features a simpler fully single-loop structure that updates three variables simultaneously with a theory-motivated joint design of adaptive stepsizes for all variables. We provide a comprehensive convergence analysis for both algorithms and show that D-TFBO and S-TFBO respectively require O(1/ϵ) and O(1/ϵ log4(1/ϵ)) iterations to find an ϵ-accurate stationary point, (nearly) matching their well-tuned counterparts using the information of problem parameters. Experiments on various problems show that our methods achieve performance comparable to existing well-tuned approaches, while being more robust to the selection of initial stepsizes. To the best of our knowledge, our methods are the first to completely eliminate the need for stepsize tuning, while achieving theoretical guarantees.

Original languageEnglish
Title of host publication13th International Conference on Learning Representations, ICLR 2025
PublisherInternational Conference on Learning Representations, ICLR
Pages17883-17933
Number of pages51
ISBN (Electronic)9798331320850
StatePublished - 2025
Event13th International Conference on Learning Representations, ICLR 2025 - Singapore, Singapore
Duration: Apr 24 2025Apr 28 2025

Publication series

Name13th International Conference on Learning Representations, ICLR 2025

Conference

Conference13th International Conference on Learning Representations, ICLR 2025
Country/TerritorySingapore
CitySingapore
Period04/24/2504/28/25

Fingerprint

Dive into the research topics of 'TUNING-FREE BILEVEL OPTIMIZATION: NEW ALGORITHMS AND CONVERGENCE ANALYSIS'. Together they form a unique fingerprint.

Cite this