Страница публикации
Heuristic Algorithm for Recovering a Physical Structure of Spreadsheet Header
Авторы: Paramonov V., Shigarov A., Vetrova V., Mikhailov A.
Журнал: Advances in Intelligent Systems and Computing: Proc. of 40th Anniversary Intern. Conf. on Information Systems Architecture and Technology (ISAT 2019; Wrocław, Poland; 15-17 September 2019)
Том: 1050
Номер:
Год: 2020
Отчётный год: 2020
Издательство:
Местоположение издательства:
URL:
Проекты:
DOI: 10.1007/978-3-030-30440-9_14
Аннотация: Tables in electronic documents (spreadsheets) contain large volumes of useful information about different domains. Efficient extraction of data from document tables plays a crucial role in its further usage including analysis and integration. The visual or logical structure of table elements might differ from its physical structure. Such differences cause difficulties for automated table processing and understanding. Automated correction from physical form to visual allows to simplify tables processing operations. In this paper, we propose a heuristic approach for transformation of tables’ header cells. The main goal of the proposed approach is to provide an algorithm and software tool for recovering a physical structure of a spreadsheet header. The proposed approach is illustrated by application to the Statistical Abstract of the United States (SAUS) dataset.
Индексируется WOS: Q5
Индексируется Scopus: Нет
Индексируется УБС: Нет
Индексируется РИНЦ: Да
Индексируется ВАК: Нет
Индексируется CORE: Нет
Публикация в печати: 0