Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Enhancing AI System Resiliency: Formulation and Guarantee for LSTM Resilience Based on Control Theory

작성자
  • Haebom

Author

Sota Yoshihara (Graduate School of Mathematics, Nagoya University), Ryosuke Yamamoto (AISIN SOFTWARE Co., Ltd), Hiroyuki Kusumoto (Graduate School of Mathematics, Nagoya University), Masanari Shimura (Graduate School of Mathematics, Nagoya University)

Outline

This paper proposes a novel theoretical framework for ensuring and evaluating the resilience of Long Short-Term Memory (LSTM) networks in control systems. We introduce "recovery time" as a new resilience metric to quantify the time required for LSTMs to return to a steady state after anomalous inputs. By mathematically improving the theory of incremental input-state stability ($\delta$ISS) for LSTMs, we derive a practical, data-independent upper bound on the recovery time. This upper bound enables resilience-aware learning. Experimental validation on a simple model demonstrates the effectiveness of our resilience estimation and control method, strengthening the foundation for rigorous quality assurance in safety-critical AI applications.

Takeaways, Limitations

Takeaways:
We present a new metric, “recovery time,” that can quantitatively evaluate the resilience of LSTM networks.
Enables resilience-aware learning by providing an upper bound on data-independent recovery time.
We provide a rigorous theoretical foundation for quality assurance of LSTM networks in safety-critical AI applications.
Limitations:
The effectiveness of the proposed method has been verified only through experiments on simple models, and further research is needed to determine its scalability to complex real-world systems.
Further analysis is needed to determine whether the mathematical refinement of the $\delta$ISS theory can be applied to all types and situations of LSTM networks.
A more in-depth analysis of the accuracy and conservativeness of the upper bound on recovery time is needed.
👍