Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Created by
  • Haebom

Author

Vaishnavh Nagarajan, Chen Henry Wu, Charles Ding, Aditi Raghunathan

Outline

This paper designs minimal algorithmic tasks that abstract open, real-world tasks to quantitatively measure the creative limitations of existing language models. These tasks require implicit, open, and probabilistic planning steps, either discovering new connections in an abstract knowledge graph (e.g., puns, analogies, research) or constructing new patterns (e.g., mathematical problems or the design of novel proteins). We empirically and conceptually argue against the myopia of next-token learning and argue that multi-token approaches, such as teacherless training and diffusion models, are superior in generating diverse and original outputs. Furthermore, we find that seed conditioning, which injects noise into the input layer to induce randomness without compromising consistency, is as effective as temperature sampling in the output layer, and under some conditions, even superior. In conclusion, this study provides a principled, minimal test environment for analyzing open-ended creative capabilities and offers new arguments beyond next-token learning and temperature sampling.

Takeaways, Limitations

Takeaways:
We present a new set of algorithmic tasks to assess open-ended creative abilities.
We demonstrate the limitations of token learning and demonstrate the superiority of a multi-token approach.
We propose that seed-conditioning the input layer is an effective way to simultaneously achieve randomness and consistency.
It presents new perspectives and directions for research on the creativity of language models.
Limitations:
The presented algorithmic task may not perfectly reflect the complex creative tasks of the real world.
Due to limitations in the dataset and model used, further validation of generalizability is required.
Accessibility may be limited as only part of the code is disclosed.
👍