Result extraction from searchable PDF

Manish Yadav; Harshit Virkar; Ishan Tipnis; Rohan Gaikwad; Namita Pulgam; Kamlesh Nenwani

doi:XX.XXX/IJARIIT-V4I2-1971

This paper is published in Volume-4, Issue-2, 2018

Paper Details
Abstract & PDF

Area

Searchable PDF

Author

Manish Yadav, Harshit Virkar, Ishan Tipnis, Rohan Gaikwad, Namita Pulgam, Kamlesh Nenwani

Org/Univ

Ramrao Adik Institute of Technology, Navi Mumbai, Maharashtra, India

Pub. Date

17 April, 2018

Paper ID

V4I2-1971

Publisher

IJARIIT

Edition

Volume-4, Issue-2, 2018

Keywords

XML, Searchable PDF

Citations

IEEE
Manish Yadav, Harshit Virkar, Ishan Tipnis, Rohan Gaikwad, Namita Pulgam, Kamlesh Nenwani. Result extraction from searchable PDF, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.

APA
Manish Yadav, Harshit Virkar, Ishan Tipnis, Rohan Gaikwad, Namita Pulgam, Kamlesh Nenwani (2018). Result extraction from searchable PDF. International Journal of Advance Research, Ideas and Innovations in Technology, 4(2) www.IJARIIT.com.

MLA
Manish Yadav, Harshit Virkar, Ishan Tipnis, Rohan Gaikwad, Namita Pulgam, Kamlesh Nenwani. "Result extraction from searchable PDF." International Journal of Advance Research, Ideas and Innovations in Technology 4.2 (2018). www.IJARIIT.com.

Give proper credits, use Citation.

Abstract

Digital documents present knowledge in most areas of study, exchanging and communicating information in a portable way. These digital repositories help in providing efficient retrieval of the stored data thus making it an important tool. Systematic extraction of data from digital document helps in mitigating the tedious work of manual data entry and facilitates effective analysis of generated structured data. The proposed system aims to extract a student's final result from the digital copy of the result sheet (PDF file) and then storing it in a centralized database which reduces the tiresome manual work needed to store records of each and every student. It provides a novel method of detecting text which is organized in tabular format and retaining the structural information after text recognition with good accuracy

All content is copyright protected.