This paper emphasizes the importance of cinematic continuity and editing patterns in multi-shot generation and presents Cut2Next, a novel framework that overcomes the limitations of existing methods. Cut2Next generates the next shot using a hierarchical multi-prompting strategy based on the Diffusion Transformer (DiT). Hierarchical multi-prompting utilizes relational and individual prompts to specify the overall context, editing style between shots, and the content and cinematic properties of each shot. Structural innovations such as Context-Aware Condition Injection (CACI) and Hierarchical Attention Mask (HAM) integrate various cues without adding parameters. We build a large-scale RawCuts dataset and a refined CuratedCuts dataset, and present CutBench for evaluation. Experimental results demonstrate that Cut2Next performs well in visual consistency and text fidelity. Specifically, user studies have confirmed a strong preference for adherence to intended editing patterns and cinematic continuity, validating its ability to generate high-quality, narratively consistent next shots.