MediNotes is a cutting-edge generative AI framework that automatically generates SOAP notes based on medical conversations. It integrates Large-Scale Language Models (LLMs), Retrieval Augmentation Generation (RAG), and Automatic Speech Recognition (ASR) to process text and speech inputs in real time or from recorded audio to generate structured and contextually accurate medical notes. It incorporates advanced techniques such as Quantized Low-Rank Adaptation (QLoRA) and Parameter-Efficient Fine-Tuning (PEFT) for efficient model fine-tuning in resource-constrained environments. It also provides a query-based retrieval system, enabling healthcare providers and patients to quickly and accurately access relevant medical information. Evaluation results using the ACI-BENCH dataset demonstrate that MediNotes significantly improves the accuracy, efficiency, and usability of automated medical documentation, reducing the administrative burden on healthcare professionals and enhancing the quality of clinical workflows.