Multi-Objective Hyperparameter Selection via Hypothesis Testing on Reliability Graphs

📅 2025-01-22

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

In high-stakes AI decision-making, jointly ensuring statistical reliability and optimizing performance during multi-objective hyperparameter selection remains challenging. Method: This paper proposes RG-PT—a novel framework that (i) introduces a Reliability Graph (RG), a directed acyclic graph modeling reliability dependencies among hyperparameters; and (ii) tightly integrates the Bradley–Terry pairwise comparison model with false discovery rate (FDR) control to enable parallel hypothesis testing across hyperparameters within the same reliability tier, overcoming efficiency and robustness limitations of conventional sequential Pareto testing. Results: Evaluated on multiple real-world tasks, RG-PT significantly improves Pareto front quality under identical reliability constraints, achieves higher calibration accuracy, boosts validation efficiency by 3.2×, and strictly controls FDR at the pre-specified threshold.

Technology Category

Application Category

📝 Abstract

In sensitive application domains, multi-objective hyperparameter selection can ensure the reliability of AI models prior to deployment, while optimizing auxiliary performance metrics. The state-of-the-art Pareto Testing (PT) method guarantees statistical reliability constraints by adopting a multiple hypothesis testing framework. In PT, hyperparameters are validated one at a time, following a data-driven order determined by expected reliability levels. This paper introduces a novel framework for multi-objective hyperparameter selection that captures the interdependencies among the reliability levels of different hyperparameter configurations using a directed acyclic graph (DAG), which is termed the reliability graph (RG). The RG is constructed based on prior information and data by using the Bradley-Terry model. The proposed approach, RG-based PT (RG-PT), leverages the RG to enable the efficient, parallel testing of multiple hyperparameters at the same reliability level. By integrating False Discovery Rate (FDR) control, RG-PT ensures robust statistical reliability guarantees and is shown via experiments across diverse domains to consistently yield superior solutions for multi-objective calibration problems.

Problem

Research questions and friction points this paper is trying to address.

AI decision-making

hyperparameter optimization

reliability and efficiency

Innovation

Methods, ideas, or system contributions that make the work stand out.

RG-PT Method

Bradley-Terry Model

False Discovery Rate (FDR)

🔎 Similar Papers

Hyperparameter Importance Analysis for Multi-Objective AutoML

2024-05-13European Conference on Artificial IntelligenceCitations: 1

2024-08-24arXiv.orgCitations: 0

Authors to Follow