This paper presents a critical analysis of existing studies that applied existing psychological scales (e.g., the Behavioral Factor Index (BFI) and the Prescriptive Questionnaire (PVQ)) to measure personality traits and values in large-scale language models (LLMs). We address the lack of ecological validity of existing scales and compare and analyze the differences in results between existing and ecologically valid questionnaires. Our analysis reveals that existing scales (1) generate LLM profiles that are inconsistent with the psychological traits expressed in the context of user queries, (2) lack sufficient items for reliable measurement, (3) create the misconception that LLMs are a stable construct, and (4) generate inflated profiles in LLMs using persona prompts. Therefore, we urge caution in applying existing psychological scales to LLMs.