Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Reality Proxy: Fluid Interactions with Real-World Objects in MR via Abstract Representations

Created by
  • Haebom

Author

Xiaoan Liu, Difan Jia, Xianhao Carton Liu, Mar Gonzalez-Franco, Chen Zhu-Tian

Outline

Interacting with real objects in mixed reality (MR) is often challenging when objects are dense, distant, or partially occluded. This difficulty arises because inputs are tightly coupled to physical constraints, and thus interact directly with physical objects. In this paper, we present a key insight to decouple interactions from these constraints by introducing proxies, abstract representations of real objects. The Reality Proxy system seamlessly transitions the interaction target from the physical object to the proxy during selection. In addition to facilitating basic selection, Reality Proxy uses AI to enrich the proxy with semantic properties and hierarchical spatial relationships of the objects, enabling novel interactions such as skimming, attribute-based filtering, nested group navigation, and complex multi-object selection without the need for a new gesture or menu system. We demonstrate Reality Proxy’s versatility in a variety of scenarios, including office information retrieval, large-scale spatial navigation, and multi-drone control. Expert evaluations demonstrate the utility and usability of the system, suggesting that proxy-based abstractions provide a powerful and generalizable interaction paradigm for future MR systems.

Takeaways, Limitations

Takeaways:
We present a novel interaction paradigm for solving interaction problems with crowded, distant, or partially occluded real-world objects.
Proxy-based abstraction enables new interactions such as complex multi-object selection, attribute-based filtering, and hierarchical navigation.
Enables a variety of interactions without new gestures or menu systems.
It shows potential applications in various fields such as office information retrieval, large-scale space exploration, and multi-drone control.
Limitations:
This paper does not provide an explicit Limitations. Additional research is needed to verify the system performance and scalability in real environments, and its applicability to various types of objects.
Lack of detailed description of performance and error handling of AI algorithms for proxy creation and management.
Lack of assessment of the user's cognitive load.
👍