This paper addresses the dichotomy between data confidentiality and data integration goals in deriving operational intelligence from organizational data repositories, as well as the challenges faced by Natural Language Processing (NLP) tools for the knowledge structures of specific domains such as operations and maintenance (O&M). We decompose the knowledge graph construction and knowledge extraction processes into functional components, including named entity recognition, coreference resolution, named entity association, and relationship extraction. We evaluate 16 NLP tools and conduct a comparative analysis with rapidly evolving large-scale language models (LLMs). We focus on O&M intelligence use cases for trusted applications in the aviation industry, developing a baseline dataset based on the public-domain datasets of the U.S. Federal Aviation Administration (FAA). We evaluate the zero-shot performance of NLP and LLM tools operating within a controlled, confidential environment that does not transmit data to third parties. Based on our observations of significant performance limitations, we discuss the challenges associated with trusted NLP and LLM tools and their readiness for widespread use in mission-critical industries such as aviation. We provide recommendations for improving reliability and provide open, managed datasets to support additional baseline testing and evaluation.