Article information
2017 , Volume 22, ¹ 4, p.105-120
Yurchenko A.V.
On the concept of information-analytical system for supporting data intensive science
We consider the problem of developing the infrastructure and special tools for manipulating big volumes of scientific data. Actuality of the problem is increasing due to exponential growth of data volume and the emerging open science and open data trends. The purpose of this work is to form and describe an ideology, some conceptual aspects and technological issues of developing of the information-analytical system for researchers who deal with scientific data. The place of this system in the research process and it’s basic functional requirements are specified. The key settings, including the concept of data “enlightenment” are described. The basic hardware and software blocks of the system are listed and the current state of the IT infrastructure at the Institute of Computational Technologies SB RAS regarding the data intensive science is reported.
[full text] Keywords: science infrastructure, data intensive science, information system, scientific data, storing and processing data
Author(s): Yurchenko Andrey Vasilyevich PhD. Position: director Office: Federal Research Center for Information and Computational Technologies Address: 630090, Russia, Novosibirsk, ac. Lavrentyev Ave. 6
Phone Office: (383) 334-91-16 E-mail: yurchenko@ict.sbras.ru
References: [1] Nathan L. Yozwiak, Stephen F. Schaffner and Pardis C. Sabeti Data sharing: Make outbreak research open access. Nature. 2015; 518(7540): 477–479. DOI:10.1038/518477a. Avaliable at: https://www.nature.com/news/data-sharing-make-outbreak-research-openaccess-1.16966. [2] Joel Cutcher-Gershenfeld, Karen S. Baker, Nicholas Berente, Courtney Flint, Gabriel Gershenfeld, Brandon Grant, Michael Haberman, John Leslie King, Christine Kirkpatrick, Barbara Lawrence, Spenser Lewis, W. Christopher Lenhardt, Matthew Mayernik, Charles McElroy, Barbara Mittleman, Namchul Shin, Shelley Stall, Susan Winter, Ilya Zaslavsky Five ways consortia can catalyse open science. Nature. 2017; 543(7647):615–617. DOI:10.1038/543615a. Avaliable at https://www.nature.com/news/five-ways-consortia-can-catalyse-open-science-1.21706. [3] Jan-Eric Litton We must urgently clarify data-sharing rules. Nature. 2017; 541(7638): 437. DOI:10.1038/541437a. Available at: https://www.nature.com/news/we-must-urgentlyclarify-data-sharing-rules-1.21350. [4] Elizabeth Gibney European labs set sights on continent-wide computing cloud. Nature. 2015; 523(7559):136–137. DOI:10.1038/523136a. Available at: https://www.nature.com/news/european-labs-set-sights-on-continent-wide-computing-cloud- 1.17926. [5] Nature Editorial. Empty rhetoric over data sharing slows science. Nature. 2017; 546(7658):327. DOI: 10.1038/546327a. Available at: https://www.nature.com/news/empty-rhetoricover-data-sharing-slows-science-1.22133. [6] Nature Editorial. Don’t let Europe’s open-science dream drift. Nature. 2017; 546(7659): 451. DOI:10.1038/546451a. Available at: http://www.nature.com/news/don-t-let-europe-sopen-science-dream-drift-1.22179. [7] Zhizhimov, O.L., Fedotov, A.M., Shokin, Yu.I. Basic principles, architecture and realization of information systems ICT SB RAS. Izvestiya Kyrgyzskogo gosudarstvennogo tekhnicheskogo universiteta im. I.Razzakova. 2016, Pt. 1; 3(39):348-352. (In Russ.) [8] Shokin, Yu.I., Fedotov, A.M., Zhizhimov, O.L. Technologies for designing of distributed information systems to support research. Computational Technologies. 2015; 20(5):251-274. (In Russ.) [9] Zhizhimov, O.L, Fedotov, A.M., Shokin, Yu.I Platform ZooSPACE - providing access to heterogeneous distributed resources. Russian Digital Libraries Journal. 2014; 17(2). ISSN 1562-5419. Available at: http://www.elbib.ru/index.phtml?page=elbib/rus/journal/2014/part2/ZFS ( In Russ.) [10] Shokin, Y.I., Dobretsov, N.N., Mamash, E.A., Kikhtenko, V.A., Voronina, P.V., Smirnov, V.V., Chubarov, D.L. An information system for acquisition, processing and access to satellite data and its applications in environmental monitoring. Computational Technologies. 2015; 20(5):157-174. (In Russ.) [11] Belov, S.D., Zaytsev, A.S., Kaplin, V.I., Korol, A.A., Skovpen, K.Y., Sykharev, A.M., Adakin, A.S., Nikultsev, V.S., Chubarov, D.L., Kuchin, N.V., Lomakin, S.V., Kalyuzhny, V.A. Using the virtualized HPC infrastructure of Novosibirsk Scientific Center for production analysis of HEP experiments data. Computational Technologies. 2012; 17(6):36-46. ( In Russ.) [12] Systems Engineering Fundamentals. FORT BELVOIR, VIRGINIA: DEFENSE ACQUISITION UNIVERSITY PRESS; 2001: 222. [13] Shokin ,Yu.I., Fedoruk, M.P., Chubarov, D.L., Yurchenko, A.V. High performance computations in ICT SB RAS. Computational Technologies. 2006; 11(Special issue 6):17-26. (In Russ) [14] Shokin, Yu.I., Fedoruk, M.P., Chubarov, D.L., Yurchenko, A.V. Computing facility of the Institute of Computational Technologies SB RAS. Notes on Numerical Fluid Mechanics and Multidisciplinary Design. 2008; (101):1-7. [15] Shokin, Yu.I., Fedoruk, M.P., Chubarov, D.L., Yurchenko, A.V. On the future of Grid building’s perspectives in Siberian region. 6-th Russian-Kazakhstan Workshop on computational and information technologies. Almaty, Kazakhstan, 16–18 march 2009. Proceedings of the Workshop Almaty. 2009: 324-338. (In Russ.) [16] Shokin, Yu.I., Fedoruk, M.P., Chubarov, D.L., Yurchenko, A.V. Building a resource center for the grid infrastructure. Zbornik radova Konferencije MIT (Matematicke i informacione technologije), 2009. Beograd; 2010:377-380. ISBN 978-86-7412-052-1. [17] Shokin, Yu.I., Fedoruk, M.P., Chubarov, D.L., Yurchenko, A.V. Development of the Supercomputing and Distributed Computing Infrastructure in the Siberian Branch of the Russian Academy of Sciences. Informatsionnye tekhnologii i vychislitel'nye sistemy. 2011; (3):9-19.
Bibliography link: Yurchenko A.V. On the concept of information-analytical system for supporting data intensive science // Computational technologies. 2017. V. 22. ¹ 4. P. 105-120
|