Daily Arxiv

This page organizes papers related to artificial intelligence published around the world.
This page is summarized using Google Gemini and is operated on a non-profit basis.
The copyright of the paper belongs to the author and the relevant institution. When sharing, simply cite the source.

The Transparent Earth: A Multimodal Foundation Model for the Earth's Subsurface

Created by
  • Haebom

Author

Arnab Mazumder, Javier E. Santos, Noah Hobbs, Mohamed Mehana, Daniel O'Malley

Outline

This paper presents "Transparent Earth," a Transformer-based architecture for reconstructing subsurface properties from heterogeneous datasets with varying sparsity, resolution, and modality (e.g., stress angle, mantle temperature, and plate type). Each modality represents a different type of observation, and the model integrates modality encodings derived from descriptions of each modality via a text embedding model along with the positional encoding of the observations. This design allows the model to be extended to any number of modalities, simplifying the addition of new modalities not initially considered. Currently, eight modalities are included, including orientation, categorical classes, and continuous features such as temperature and thickness. This feature supports in-context learning, allowing predictions to be generated without input or using an arbitrary number of additional observations from a random subset of modalities. On validation data, this reduced stress angle prediction errors by more than threefold. The proposed architecture is scalable, demonstrating that performance improves with increasing parameters. These developments make Transparent Earth an early baseline model of Earth's subsurface, ultimately aiming to predict subsurface properties anywhere on Earth.

Takeaways, Limitations

Takeaways:
Integrating heterogeneous geophysical data from various modalities to improve the accuracy of subsurface property prediction.
Transformer-based architecture ensures extensibility and ease of adding new modalities.
Supports context-based learning, enabling flexible predictions for various input conditions.
A new basic model for predicting Earth's subsurface properties is presented.
Limitations:
Currently, only eight modalities are included, and integration and performance evaluation of more modalities are needed.
Further validation of the model's generalization performance and applicability to real-world subsurface data is needed.
Analysis of the model's computational cost and training time is needed.
Further research is needed on the model's robustness to data imbalances and errors.
👍