This paper addresses the complex manipulation problem of autonomous harvesting. Specifically, we present a simulation-based reinforcement learning (RL) framework to address occlusion and structural uncertainty (since every plant is unique). The goal is to rearrange stems and leaves to expose target fruits. We separate high-level motion planning from low-level flexible control to simplify sim2real transfer, ensuring that the learned policy generalizes to plants of various heights and shapes.