Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

Exchangeability in Neural Network and its Application to Dynamic Pruning

Created by
  • Haebom

Author

Pu (Luke), Yi, Tianlang Chen, Yifan Yang, Sara Achour

Outline

ExPrune is a general dynamic pruning optimization technique that enables partial computation of multi-grain rates for each input, without changing the model architecture or learning algorithm. It leverages the statistical property of exchangeability to identify relationships between some model parameters and intermediate values, and dynamically evaluates the network based on partial network evaluations to make pruning decisions. ExPrune has been applied to computer vision, graph models, and language models, demonstrating minimal computational effort and accuracy loss. It can also be used in conjunction with static-size pruning.

Takeaways, Limitations

Takeaways:
Applicable without modifying model architecture or learning algorithm.
Generalizable to a variety of model architectures and problem domains.
Reduced computational load (10.98% - 27.16%) and minimal accuracy loss (up to 1%).
Combined with static pruning, additional computational overhead reduction (10.24% - 14.39%) can be achieved.
Limitations:
The specific Limitations is not specified in this paper. (However, due to the nature of dynamic pruning, runtime overhead may occur, and additional computational costs may be incurred due to exchangeability analysis.)
👍