Turnaev A.., Apanovich Z..pdf287.45 KB

This paper describes a pipeline for extracting the author’s terms and definitions from mathematical texts. We used two models: one, for detecting mathematical formulas to clear text from noise and the other, for converting images into LaTeX formulas to restore the deleted formulas. Experimental data show that noise clearing is...