The development of high-speed camera and computer technology enables the vision-based displacement extraction technique to be used for vibration signal acquisition of UHV transformer body. In this study, a contactless vibration monitoring technology for UHV transformers is innovatively proposed to meet the demand of non-destructive testing of the mechanical condition of UHV transformers. The system consists of a high-speed camera with a zoom optical lens and a laptop computer. At the algorithmic level, firstly, based on the template matching fusion rhombus search strategy, the computation amount of each picture matching is reduced by a dozen times to dozens of times compared with the traditional global search, and secondly, the surface fitting method is adopted to realize the sub-pixel accuracy displacement resolution calculation. After MATLAB simulation verification, the proposed technique can realize accurate sub-pixel accuracy displacement measurement, with the advantages of high detection accuracy, fast speed and strong robustness. Field tests show that the method successfully captures the axial vibration signals of UHV transformer bushings, and its Fourier spectrum accurately analyzes the characteristic harmonic components such as 100Hz, 200Hz and 300Hz. Compared with the traditional detection methods, this technology can realize the long-distance, full-field, real-time online monitoring of UHV transformers, and provide a new way for intelligent substation equipment condition assessment.