PianoVAM is a comprehensive piano performance dataset covering multiple modes (video, audio, MIDI, hand landmarks, fretboard notations, and rich metadata). It was recorded using a Disklavier piano during daily practice sessions by amateur pianists, capturing audio and MIDI data alongside synchronized top-view videos in a variety of real-world performance environments. Hand landmarks and fretboard notations were extracted using a pre-trained hand pose estimation model and a semi-automatic fretboard notation algorithm. We discuss challenges encountered during data collection and alignment across various modes, and a fretboard notation method based on video-extracted hand landmarks. We present benchmark results for audio-only and audiovisual piano transcription using the PianoVAM dataset, and discuss additional potential applications.