This paper presents a novel framework for identifying duplicates in ancient documents. Specifically, we design an advanced duplicate detection framework that combines low-level keypoint matching and high-level text-centric content-based matching for one of the ancient documents, Oracle Bones (OB). Compared with existing content-based image retrieval and image matching methods, our model achieves similar recall performance and higher simplified average inverse rank score, while achieving much faster computational efficiency. Through practical applications, we have discovered more than 60 new pairs of OB duplicates that traditional experts have failed to detect for decades. We have released the code, model, and actual results on GitHub.