Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Value Profiles for Encoding Human Variation

Created by
  • Haebom

Author

Taylor Sorensen, Pushkar Mishra, Roma Patel, Michael Henry Tessler, Michiel Bakker, Georgina Evans, Iason Gabriel, Noah Goodman, Verena Rieser

Outline

Modeling human variability in rating tasks is crucial for personalization, multi-factorial model alignment, and computational social science. In this paper, we represent individuals using natural language value profiles, which are descriptions of underlying values compressed from contextual demonstrations, and propose a manipulable decoder model that estimates individual ratings from rater representations. We employ information-theoretic methods to measure the predictive information of rater representations and find that demonstrations contain the most information, followed by value profiles and then demographics. However, value profiles effectively compress useful information from demonstrations (preserving over 70% of information) and offer advantages in reviewability, interpretability, and manipulability. Furthermore, clustering value profiles to identify individuals with similar behavior better explains rater variability than demographic groupings, which are often the most predictive. Beyond test set performance, we demonstrate that decoder predictions vary with semantic profile differences, are well-calibrated, and can help account for instance-level discrepancies by simulating annotator populations. These results demonstrate that value profiles offer a novel and predictive way to explain individual variability beyond demographic or group information.

Takeaways, Limitations

Takeaways:
Value profiles provide useful information for predicting an individual's performance.
Value profiles effectively compress demonstrations, increasing information retention.
Value profiles offer advantages in terms of reviewability, interpretability, and manipulability.
Value profile clustering explains rater variability better than demographic grouping.
Decoder predictions vary across semantic profiles, helping to explain instance-level mismatches.
Limitations:
The specific Limitations is not specified in the paper. (Only the abstract is provided.)
👍