Article information

2009 , Volume 14, ¹ 2, p.58-73

Bychkov I.V., Rugnikov G.M., Hmelnov A.E., Shigarov A.O.

A heuristic method of table detection in documents of various formats

The paper discusses a heuristic method of statistical table detection, which uses meta-files as input, what allows one to apply it to the documents of various formats. In the method, the process of table detection is constructed as the bottom-up segmentation of the document's page. The experimental evaluation of the method has given evidence of its efficiency for a wide range of statistical tables.

[full text]
Keywords: document analysis and recognition, information extraction, table extraction and processing

Author(s):
Bychkov Igor Vyacheslavovich
Dr. , Academician RAS, Professor
Position: Director
Office: Institute for System Dynamics and Control Theory of Siberian Branch of Russian Academy of Sciences
Address: 664033, Russia, Irkutsk, Lermontova st., 134
Phone Office: (3952) 45-30-61
E-mail: idstu@icc.ru
SPIN-code: 5816-7451

Rugnikov Gennady Mikhailovich
Dr. , Senior Scientist
Position: Head of Departament
Office: Institute for System Dynamics and Control Theory Siberian Branch of RAS, Irkutsk Scientific Center of Siberian Branch of Russian Academy of Sciences
Address: 664033, Russia, Irkutsk, Lermontova st., 134
Phone Office: (3952) 45-30-06
E-mail: rugnikov@icc.ru
SPIN-code: 2947-8443

Hmelnov Alexey Evgenievich
PhD. , Associate Professor
Position: Head of Laboratory
Office: Matrosov Institute for System Dynamics and Control Theory of Siberian Branch of Russian Academy of Sciences
Address: 664033, Russia, Irkutsk, 134 Lermontov str.
Phone Office: (3952) 45-30-71
E-mail: hmelnov@icc.ru
SPIN-code: 8041-3667

Shigarov Alexei Olegovich
PhD.
Position: Senior Research Scientist
Office: Institute for System Dynamics and Control Theory, Siberian Branch of RAS
Address: 664033, Russia, Irkutsk, 134 Lermontov str.
Phone Office: (3952) 45-31-02
E-mail: shigarov@icc.ru


Bibliography link:
Bychkov I.V., Rugnikov G.M., Hmelnov A.E., Shigarov A.O. A heuristic method of table detection in documents of various formats // Computational technologies. 2009. V. 14. ¹ 2. P. 58-73
Home| Scope| Editorial Board| Content| Search| Subscription| Rules| Contacts
ISSN 1560-7534
© 2024 FRC ICT