Article information
2009 , Volume 14, ¹ 2, p.58-73
Bychkov I.V., Rugnikov G.M., Hmelnov A.E., Shigarov A.O.
A heuristic method of table detection in documents of various formats
The paper discusses a heuristic method of statistical table detection, which uses meta-files as input, what allows one to apply it to the documents of various formats. In the method, the process of table detection is constructed as the bottom-up segmentation of the document's page. The experimental evaluation of the method has given evidence of its efficiency for a wide range of statistical tables.
[full text] Keywords: document analysis and recognition, information extraction, table extraction and processing
Author(s): Bychkov Igor Vyacheslavovich Dr. , Academician RAS, Professor Position: Director Office: Institute for System Dynamics and Control Theory of Siberian Branch of Russian Academy of Sciences Address: 664033, Russia, Irkutsk, Lermontova st., 134
Phone Office: (3952) 45-30-61 E-mail: idstu@icc.ru SPIN-code: 5816-7451Rugnikov Gennady Mikhailovich Dr. , Senior Scientist Position: Head of Departament Office: Institute for System Dynamics and Control Theory Siberian Branch of RAS, Irkutsk Scientific Center of Siberian Branch of Russian Academy of Sciences Address: 664033, Russia, Irkutsk, Lermontova st., 134
Phone Office: (3952) 45-30-06 E-mail: rugnikov@icc.ru SPIN-code: 2947-8443Hmelnov Alexey Evgenievich PhD. , Associate Professor Position: Head of Laboratory Office: Matrosov Institute for System Dynamics and Control Theory of Siberian Branch of Russian Academy of Sciences Address: 664033, Russia, Irkutsk, 134 Lermontov str.
Phone Office: (3952) 45-30-71 E-mail: hmelnov@icc.ru SPIN-code: 8041-3667Shigarov Alexei Olegovich PhD. Position: Senior Research Scientist Office: Institute for System Dynamics and Control Theory, Siberian Branch of RAS Address: 664033, Russia, Irkutsk, 134 Lermontov str.
Phone Office: (3952) 45-31-02 E-mail: shigarov@icc.ru
Bibliography link: Bychkov I.V., Rugnikov G.M., Hmelnov A.E., Shigarov A.O. A heuristic method of table detection in documents of various formats // Computational technologies. 2009. V. 14. ¹ 2. P. 58-73
|