Fast & Faithful Function Vectors

📅 2026-06-03
📈 Citations: 0
Influential: 0
📄 PDF

career value

203K/year
🤖 AI Summary
Function vectors (FVs) are employed in in-context learning to steer large language models, yet critical design choices in their construction—specifically attention head selection and steering mechanisms—have not been systematically investigated. This work proposes a gradient attribution method based on Layer-wise Relevance Propagation (LRP) to efficiently identify the attention heads most contributive to a given task and introduces a distributed steering strategy that replaces conventional aggregation approaches. By preserving steering efficiency while substantially improving model accuracy on instruction-following tasks, the proposed method offers a novel pathway for optimizing function vectors.
📝 Abstract
Function vectors (FVs) are task representations elicited during in-context learning that can be used to steer Large Language Models (LLMs). However, design choices in their formulation remain underexplored. In this work, we study the impact of varying FV definitions for instructions along two degrees of freedom: attention head selection and steering. For head selection, using gradient-based attributions with Layer-wise Relevance Propagation (LRP) substantially improves efficiency as well as accuracy. For FV steering, applying it in a distributed manner yields a higher accuracy compared to simple aggregation. Our code is publicly available.
Problem

Research questions and friction points this paper is trying to address.

Function Vectors
In-context Learning
Large Language Models
Attention Heads
Steering
Innovation

Methods, ideas, or system contributions that make the work stand out.

Function Vectors
Layer-wise Relevance Propagation
Attention Head Selection
Distributed Steering
In-Context Learning
🔎 Similar Papers
2024-01-16arXiv.orgCitations: 76