Модель рекомендательной системы на основе технических событий

Kirill I.  Pashigorev; Andrei O.  Reznikov

Kirill I. Pashigorev Head of the Department, SberData, PJSC Sberbank, Moscow, Russia https://orcid.org/0009-0008-3478-4874
Andrei O. Reznikov Chief Development Engineer, SberData, PJSC Sberbank, Moscow, Russia https://orcid.org/0009-0001-2406-7619

Keywords: recommendation system, matrix factorization, prompt engineering, LLM, AI agents

Abstract

Recommendation systems are widely used in the commercial field. The algorithms and architectures of recommendation systems are similar in various fields of application and have proven their effectiveness. Recommendations are based on the user’s profile, the manner of his behavior on various IT (Information Technology) resources, as well as on similar users. At the same time, the use of recommendation systems in specialized areas is not widespread. Technology divisions are a promising new area of application for recommendation systems, and IT experts themselves will be the users. The purpose of this article is to consider a combination of a recommendation system, machine learning (ML) and LLM (Large Language Model) and to design these tools in a single system. Data volumes are currently measured in petabytes (10¹⁵ bytes) and exabytes (10¹⁸ bytes). In order to process even technical information (metadata/technodata) from the surrounding IT landscape, from the IT systems used by experts, AI (Artificial Intelligence) agents are needed. This article provides a literature review regarding the use of recommendation systems in combination with LLM applications, and suggests an application architecture model that generates human-readable news from technical event logs. The system is designed for a group of users who work with big data (ML engineers, data analysts, and data researchers). It is a combination of recommendation system technologies, LLM, and machine learning models. The article also provides the first results of the research that was carried out.

Downloads

Download data is not yet available.

References

Microsoft (2024) What are AI agents? Available at: https://learn.microsoft.com/ru-ru/azure/cloud-adoption-framework/innovate/best-practices/conversational-ai (accessed 20 July 2024).

Sber (2024) A platform for launching autonomous AI agents is presented. Available at: https://ai.sber.ru/en/post/predstavlena-platforma-dlya-zapuska-avtonomnyh-ai-agentov (accessed 20 July 2024).

Shah A., Pasha D., Zadeh E., Konur S. (2022) Automated log analysis and anomaly detection using machine learning. Frontiers in Artificial Intelligence and Applications, vol. 358: Fuzzy Systems and Data Mining, pp. 137–147. https://doi.org/10.3233/FAIA220378

Chen Z., Liu J., Gu W., et al. (2021) Experience report: Deep learning-based system log analysis for anomaly detection. arXiv:2107.05908. https://doi.org/10.48550/arXiv.2107.05908

Mokshanov M.V. (2024) The use of artificial intelligence in data analysis: an overview of the current state and future directions. Universum: technical sciences: electronic scientific journal, no. 5(122) (in Russian). https://doi.org/10.32743/UniTech.2024.122.5.17513

Eremin O.Y. (2023) Methods of implementation of hybrid recommendation systems. E-Scio, no. 3(78) (in Russian).

Kurennykh A.E., Sudakov V.A. (2022) Approach to the development of hybrid recommendation systems. Bulletin of Science and Practice, vol. 8, no. 11 (in Russian).

Völske M., Potthast M., Syed S., Stein B. (2017) TL;DR: Mining Reddit to Learn Automatic Summarization. Proceedings of the Workshop on New Frontiers in Summarization, Copenhagen, Denmark, 2017, pp. 59–63. Association for Computational Linguistics. https://doi.org/10.18653/v1/W17-4508

Pu H., Yang X., Li J., Guo R. (2024) AutoRepo: A general framework for multimodal LLM-based automated construction reporting. Expert Systems with Applications, vol. 255, part B, article 124601. https://doi.org/10.1016/j.eswa.2024.124601

Sivakumar M., Belle A.B., Shan J., Shahandashti K.K. (2024) Prompting GPT–4 to support automatic safety case generation. Expert Systems with Applications, vol. 255, part C, article 124653. https://doi.org/10.1016/j.eswa.2024.124653

Kim S., Kang H., Choi S., et al. (2024) Large Language Models meet Collaborative Filtering: An efficient all-round LLM-based recommender system. arXiv:2404.11343. https://doi.org/10.48550/arXiv.2404.11343

Shu Y., Zhang H., Gu H., et al. (2023) RAH! RecSys-Assistant-Human: A human-centered recommendation framework with LLM agents. arXiv:2308.09904. https://doi.org/10.48550/arXiv.2308.09904

Vaswani A., Shazeer N., Parmar N., et al. (2017) Attention is all you need. arXiv:1706.03762. https://doi.org/10.48550/arXiv.1706.03762

Morozevich E.S., Korotkov V.S., Kuznetsova E.A. (2022) Development of a model for the formation of individual educational trajectories using machine learning methods. Business Informatics, vol. 16, no. 2, pp. 21–35. https://doi.org/10.17323/2587-814Х.2022.2.21.35

Palchunov D.E., Yakobson A.A. (2024) Development of an intelligent assistant for the selection of goods in the process of dialogue with the user. Business Informatics, vol. 18, no. 1, pp. 7–21. https://doi.org/10.17323/2587-814X.2024.1.7.21

Amenitsky A.V., Rukhovich I.V., Amenitskaya L.A., et al. (2024) Side effects of hallucinations of artificial intelligence. Science, innovation, education: current issues and modern aspects, pp. 224–235. Penza, 2024 (in Russian).

Strömqvist Z. (2018) Matrix factorization in recommender systems: How sensitive are matrix factorization models to sparsity? Uppsala University Publications. Available at: https://uu.diva-portal.org/smash/get/diva2:1214390/FULLTEXT01.pdf (accessed 22 July 2024).

Moisyuk-Dranko P.A., Revotyuk M.P. (2020) Methods of matrix factorization for recommendation systems. Proceedings of the international scientific conference Information technologies and systems 2020 (ITS 2020), pp. 193–194. Minsk: BGUIR (in Russian). Available at: https://libeldoc.bsuir.by/bitstream/123456789/41339/1/Moysyuk_Dranko_Metody.pdf (accessed 22 July 2024).

Kuznetsov I.A. (2019) Methods and algorithms of machine learning for preprocessing and classification of weakly structured text data in scientific recommendation systems. Moscow: MEPhI (in Russian). Available at: https://ds.mephi.ru/documents/90/Кузнецов_И_А_Текст_диссертации.pdf (accessed 22 July 2024).

Yuan M., Lin H.-T., Boyd-Graber J. (2020) Cold-start active learning through self-supervised language modeling. arXiv:2010.09535. https://doi.org/10.48550/arXiv.2010.09535

Liemohn M.W., Shane A.D., Azari A.R., et al. (2021) RMSE is not enough: Guidelines to robust data-model comparisons for magnetospheric physics. Journal of Atmospheric and Solar-Terrestrial Physics, vol. 218, article 105624. https://doi.org/10.1016/j.jastp.2021.105624

Sber (2024) GigaChat API (in Russian). Available at: https://developers.sber.ru/portal/products/gigachat-api (accessed 22 July 2024).

Sber (2024) Prompt engineering (in Russian). Available at: https://developers.sber.ru/docs/ru/gigachat/prompt-engineering (accessed 26 December 2024).

Amenitsky A.V., Rukhovich I.V., Amenitskaya L.A., Amenitsky D.A. (2024) Causes, ethical problems and prevention of hallucination LLM. Intelligence. Collection of articles of the International Competition of Young Scientists. Penza, pp. 12–15(in Russian).

Sber (2024) TagMe Data Markup Platform (in Russian). Available at: https://developers.sber.ru/portal/products/tagme (accessed 09 December 2024).

Recommendation system model based on technical events

Abstract

Downloads

References