Dataset on Scientometrics of Russian Scientists: eLibrary Case Study

Keywords: data discovery, data search, data reuse, research practices, open data policy, research communities

Abstract

A dataset of aggregated eLibrary data is presented. Parsing is performed using the Python language, the aim of the algorithm is to retrieve references with data on authors. The database is generated in Excel format, for federal universities and universities participating in the programmes "5-100", "Priority-2030" and in more detail for universities and organisations of the RAS structure of the Sverdlovsk region. The database is represented by a matrix: the rows contain information on the i-th author, the columns contain scientometric indicators. Despite the limitations inherent in the elibrary database, its data are relevant in the study of Russian HEIs; due to the stratification of regional HEI systems, it is quite relevant for Russia to cooperate with leading domestic HEIs, in this case, to assess the parameters of scientific activity on a national scale, we cannot ignore internal databases (e.g., eLibrary).

Downloads

Download data is not yet available.

References

Balatsky E.V., Ekimova N.A (2015) The Problem of Manipulation in the RSCI System. Vestnik UrFU. Series: Economics and Management, vol. 14, no 2, pp. 166–178 (In Russian). http://dx.doi.org/10.15826/vestnik.2015.14.2.021

Balatsky E.V., Yurevich M.A. (2016) The Misalignment of Russian Economists’ Scientometric Indicators in RSCI. Journal of the New Economic Association, no 2(30), pp.176–180 (In Russian). https://doi.org/10.31737/2221-2264-2016-30-2-8

Baranov A.N. (2012) Semantic Network as a Bibliometry Tool in the Humanities. Measuring Philosophy: On the Grounds and Criteria for Evaluating the Effectiveness of Philosophical and Socio-Humanitarian Research (ed. A.V. Rubtsov), Moscow: RAS Institute of Philosophy, pp. 108–117 (In Russian).

Beall J. (2012) Predatory Publishers Are Corrupting Open Access. Nature, vol. 489, iss. 7415, p. 179. https://doi.org/10.1038/489179a

Bednyi B.I., Sorokin Yu.M. (2012) On Indicators of Science Citation and Its Application. Vysshee obrazovanie v Rossii / Higher Education in Russia, no 3, pp. 17–28 (In Russian).

Borgman C.L. (2015) If Data Sharing Is the Answer, What Is the Question? ERCIM News, no 100, pp. 15–16. Available at: https://ercim-news.ercim.eu/images/stories/EN100/EN100-web.pdf (accessed 12.11.2024).

Borgman C.L. (2012) The Conundrum of Sharing Research Data. Journal of the American Society for Information Science and Technology, vol. 63, no 6, pp. 1059–1078. https://doi.org/10.1002/asi.22634

Fienberg S.E., Martin M.E., Straf M.L. (eds) (1985) Sharing Research Data. Washington, DC: National Academies Press. https://doi.org/10.17226/2033

Filippova I.N. (2022) Russian Science Citation Index: Problems and Prospects of Publication Activity. Surgut State Pedagogical University Bulletin, no 2 (77), pp. 113–121 (In Russian). https://doi.org/10.26105/SSPU.2022.77.2.010

Gene Ontology Consortium (2004) The Gene Ontology (GO) Database and Informatics Resource. Nucleic Acids Research, vol. 32, iss. suppl_1, pp. D258–D261. https://doi.org/10.1093/nar/gkh036

Gregory K., Groth P., Scharnhorst A., Wyatt S. (2020) Lost or Found? Discovering Data Needed for Research. Harvard Data Science Review, iss. 2(2). https://doi.org/10.1162/99608f92.e38165eb

Gregory K., Ninkov A., Ripp C., Roblin E., Peters I., Haustein S. (2023) Tracing Data: A Survey Investigating Disciplinary Differences in Data Citation. Quantitative Science Studies, vol. 4, no 3, pp. 622–649. https://doi.org/10.1162/qss_a_00264

Grinev A.V. (2019) Using Scientometrics to Estimate Publication Activity in Modern Russia. Vestnik Rossijskoj akademii nauk, vol. 89, no 10, pp. 993-1002 (In Russian). https://doi.org/10.31857/S0869-58738910993-1002

Hey T., Tansley S., Tolle K., Gray J. (eds) (2009) The Fourth Paradigm: Data-Intensive Scientific Discovery. Redmond, WA: Microsoft Research. Available at: https://www.microsoft.com/en-us/research/publication/fourth-paradigm-data-intensive-scientific-discovery/#!abstract (accessed 19 November 2024).

Kim Y., Yoon A. (2017) Scientists' Data Reuse Behaviors: A Multilevel Analysis. Journal of the Association for Information Science and Technology, vol. 68, no 12, pp. 2709–2719. https://doi.org/10.1002/asi.23892

Li K., Jiao C. (2021) The Data Paper as a Sociolinguistic Epistemic Object: A Content Analysis on the Rhetorical Moves Used in Data Paper Abstracts. Journal of the Association for Information Science and Technology, vol. 73, no 6, pp. 834–846. https://doi.org/10.1002/asi.24585

Markova Y.V., Shmatko N.A., Katchanov Y.L. (2016) Synchronous International Scientific Mobility in the Space of Affiliations: Evidence from Russia. SpringerPlus, vol. 5, April, Article no 480. https://doi.org/10.1186/s40064-016-2127-3

Mayernik M.S. (2011) Metadata Realities for Cyberinfrastructure: Data Authors as Metadata Creators (PhD Thesis), Los Angeles: University of California. https://doi.org/10.2139/ssrn.2042653

Melnik A.D., Sudakova A.E. (2023) Quality of Supervisor's Publication Profile as a Criterion for Effective Doctoral Training. Science Governance and Scientometrics, vol. 18, no 4, pp. 759–790 (In Russian). https://doi.org/10.33873/2686-6706.2023.18-4.759-790

Moscow Center for Continuing Mathematical Education (2011) The Game of Numbers, or How the Work of a Scientist Is Now Evaluated. Collection of Articles on Bibliometrics. Moscow: MCCME (In Russian).

Mouromtsev D.I., Lehmann J., Semerkhanov I.A., Navrotskiy M.A., Ermilov I.S. (2015) Study of Current Approaches for Web Publishing of Open Scientific Data. Scientific and Technical Journal of Information Technologies, Mechanics and Optics, vol. 15, no 6, pp. 1081–1087 (In Russian). https://doi.org/10.17586/2226-1494-2015-15-6-1081-1087

Nelson B. (2009) Data Sharing: Empty Archives. Nature, vol. 461, no 7261, pp. 160–163. https://doi.org/10.1038/461160a

Park H., Wolfram D. (2017) An Examination of Research Data Sharing and Reuse: Implications for Data Citation Practice. Scientometrics, vol. 111, no 1, pp. 443–461. https://doi.org/10.1007/s11192-017-2240-2

Patwardhan B., Nagarkar S., Gadre S.R., Lakhotia S.C., Katoch V.M., Moher D. (2018) A Critical Analysis of the ‘UGC-Approved List of Journals’. Current Science, vol. 114, no 6, pp. 1299–1303. https://doi.org/10.18520/cs/v114/i06/1299-1303

Quayle M., Greer M. (2014) Mapping the State of the Field of Social Psychology in Africa and Patterns of Collaboration between African and International Social Psychologists. International Journal of Psychology, vol. 49, no 6, pp. 498–502. https://doi.org/10.1002/ijop.12059

Sindin X. (2017) Secondary Data. The SAGE Encyclopedia of Communication Research Methods (ed. M. Allen), Thousand Oaks, CA: Sage, vol. 4, pp. 1578–1579. https://doi.org/10.4135/9781483381411

Smith L.C. (1981) Citation Analysis. Library Trends, vol. 30, no 1, pp. 83–106.

Sudakova A.E., Tarasyev A.A., Koksharov V.A. (2021) Trends in the Migration of Russian Scholars: The Regional Dimension. Terra Economicus, vol. 19, no 2, pp. 91–104. (In Russian). https://doi.org/10.18522/2073-6606-2021-19-2-91-104

Tatarova G.G. (2006) Methodological Trauma of a Sociologist. On the Issue of Knowledge Integration. Sotsiologicheskie Issledovaniia / Sociological Studies, no 9, pp. 3–12 (In Russian).

The Russian Committee of the UNESCO Information for All Program (2013) Sustainable Economics for a Digital Planet: Ensuring Long-Term Access to Digital Information: Final Report of the Blue Ribbon Task Force on Sustainable Digital Preservation and Access. Moscow: Interregional Library Cooperation Center (In Russian). Available at: https://ifapcom.ru/files/News/Images/2014/sust_econ.pdf (accessed 12.11.2024).

Thorne F.C. (1977) The Citation Index: Another Case of Spurious Validity. Journal of Clinical Psychology, vol. 33, no 4, pp. 1157–1161. https://doi.org/10.1002/1097-4679(197710)33:4<1157::aid-jclp2270330453>3.0.co;2-b

Toronto International Data Release Workshop Authors (2009) Prepublication Data Sharing. Nature, vol. 461, pp. 168–170. https://doi.org/10.1038/461168a

Tretyakova O.V. (2014) On the Issue of the Impact Factor of a Scientific Journal and Methods of Its Formation. Voprosy territorialʼnogo razvitiya, no 5 (15), pp. 1–9 (In Russian).

Wallis J.C., Rolando E., Borgman C.L. (2013) If We Share Data, Will Anyone Use Them? Data Sharing and Reuse in the Long Tail of Science and Technology. PLOS One, vol. 8, no 7, Article no e67332. https://doi.org/10.1371/journal.pone.0067332

Yurevich M.A., Erkina D.S., Tsapenko I.P. (2020) Measuring International Mobility of Russian Scientists: A Bibliometric Approach. World Economy and International Relations, vol. 64, no 9, pp. 53–62 (In Russian). https://doi.org/10.20542/0131-2227-2020-64-9-53-62

Zhang L., Sivertsen G. (2020) The New Research Assessment Reform in China and Its Implementation. Scholarly Assessment Reports, vol. 2, no 1, Article no 3. https://doi.org/10.29024/sar.15

Published
2025-04-01
How to Cite
SudakovaAnastasia E., and Agarkov Gavriil A. 2025. “Dataset on Scientometrics of Russian Scientists: ELibrary Case Study”. Voprosy Obrazovaniya / Educational Studies Moscow, no. 1 (April). https://doi.org/10.17323/vo-2025-21514.
Section
Datasets in Education