Dans ce document, nous présentons EasyEdit2, a framework designed to enable
plug-and-play adjustability for controlling Large Language Model (LLM)
comportements. EasyEdit2 prend en charge un large éventail d'interventions lors des tests,
y compris la sécurité, sentiment, personnalité, modèles de raisonnement, factualité, et
language features. Contrairement à son prédécesseur, EasyEdit2 features a new
architecture specifically designed for seamless model steering. It comprises
key modules such as the steering vector generator and the steering vector
applier, which enable automatic generation and application of steering vectors
to influence the model’s behavior without modifying its parameters. One of the
main advantages of EasyEdit2 is its ease of use-users do not need extensive
technical knowledge. With just a single example, they can effectively guide and
adjust the model’s responses, making precise control both accessible and
efficient. Empirically, we report model steering performance across different
LLM, demonstrating the effectiveness of these techniques. We have released the
source code on GitHub at https://github.com/zjunlp/EasyEdit along with a
demonstration notebook. En outre, we provide a demo video at
https://zjunlp.github.io/project/EasyEdit2/video for a quick introduction.
Cet article explore les excursions dans le temps et leurs implications.
Télécharger PDF:



