Аватар-модель покупателя на сетях Колмогорова-Арнольда

Fedor V.  Krasnov; Fedor I.  Kurushin

Fedor V. Krasnov Research Center of LLC WILDBERRIES SK, Moscow, Russia https://orcid.org/0000-0002-9881-7371
Fedor I. Kurushin Research Center of LLC WILDBERRIES SK, Moscow, Russia https://orcid.org/0009-0007-5126-4507

Keywords: large language models, product search, search query recommendations, search query transformation, user intent determination, text analysis, machine learning, e-commerce

Abstract

The increasing pace of development of e-commerce continues to present new challenges in terms of personalizing product search and recommendations. Monolithic search and recommendation systems have become cumbersome and are unable to effectively address the need for a deeper understanding of users on electronic trading platforms (ETPs) despite having access to comprehensive information about their interests and purchase histories. Collaborative filtering mechanisms which are widely used suffer from a lack of diversity in offerings and a reduced capacity to surprise users. Additionally, the low frequency of recommendation updates and the replacement of “personalized” with “similar to others” concepts contribute to these issues. We have approached the resolution of these issues by developing a shopping assistant named “Ellochka” that is individual for each user of ETP. The digital avatar model of the user continually searches for relevant products based on their history of interaction with ETP. We were guided by the principle of independence – avatar models do not share information with each other. When a new user joins, they are assigned a unique avatar model that evolves independently. Each avatar has its own language to generate search queries. The level of complexity of each avatar can vary depending on the intensity of its interaction with ETP. Continued interaction with the avatar allows for tracking of optimal purchase conditions, reminding users of expiration dates and the need for re-purchasing frequently purchased items. Isolating the avatar allows it to be retrained after each event, without significantly impacting the overall search and recommendation system. The use of neural network architecture-based and Kolmogorov–Arnold networks in the avatar-model has led to improvements in the main indicators of search and recommendation effectiveness, namely, novelty and diversity.

Downloads

Download data is not yet available.

References

Butorin A.V., Murtazin D.G., Krasnov F.V. (2020) Method and system for predicting effective thicknesses in the interwell space when constructing a geological model based on the method of clustering spectral curves. Patent for invention RU 2718135 C1, 03/30/2020. Application No. 2019128334 dated 09/09/2019.

Krasnov F. (2023) Query understanding via Language Models based on transformers for e-commerce. International Journal of Open Information Technologies, vol. 11, no. 9, pp. 33–40 (in Russian).

Muennighoff N., Tazi N., Magne L., Reimers N. (2022) MTEB: Massive text embedding benchmark. arXiv:2210.07316. https://doi.org/10.48550/arXiv.2210.07316

Li P., Tuzhilin A. (2023) When variety seeking meets unexpectedness: Incorporating variety-seeking behaviors into design of unexpected recommender systems. Information Systems Research, vol. 35, no. 3. https://doi.org/10.1287/isre.2021.0053

Wang Y., Banerjee C., Chucri S., et al. (2024) Beyond item dissimilarities: Diversifying by intent in recommender systems. arXiv:2405.12327. https://doi.org/10.48550/arXiv.2405.12327

Castells P., Hurley N., Vargas S. (2022) Novelty and diversity in recommender systems. Recommender Systems Handbook (eds. F. Ricci, L. Rokach, B. Shapira). Springer, New York, NY, pp. 603–646. https://doi.org/10.1007/978-1-0716-2197-4_16

Ding Q., Liu Y., Miao C., et al. (2021) A hybrid bandit framework for diversified recommendation. Proceedings of the AAAI conference on artificial intelligence, vol. 35, no. 5, pp. 4036–4044. https://doi.org/10.1609/aaai.v35i5.16524

Li J., Wang M., Li J., et al. (2023) Text is all you need: Learning language representations for sequential recommendation. Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 1258–1267. https:// doi.org/10.1145/3580305.3599519

Sun F., Liu J., Wu J., et al. (2019) BERT4Rec: Sequential recommendation with bidirectional encoder representations from transformer. Proceedings of the 28th ACM international conference on information and knowledge management, pp. 1441–1450. https://doi.org/10.1145/3357384.3357895

Klenitskiy A., Vasilev A. (2023) Turning dross into gold loss: is BERT4Rec really better than SASRec? Proceedings of the 17th ACM Conference on Recommender Systems, pp. 1120–1125.

Ji Z., Yu T., Xu Y., et al. (2023) Towards mitigating LLM hallucination via self reflection. Findings of the Association for Computational Linguistics: EMNLP 2023, pp. 1827–1843.

Egorov E.A., Rogachev A.I. (2023) Adaptive spectral normalization for generative models. Doklady Mathematics, vol. 108(suppl. 2), pp. S205–S214. https://doi.org/10.1134/S1064562423701089

Chang W.C., Jiang D., Yu H.F., et al. (2021) Extreme multi-label learning for semantic matching in product search. Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining, pp. 2643–2651.

Sennrich R., Haddow B., Birch A. (2016) Neural machine translation of rare words with subword units. Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pp. 1715–1725. https://doi.org/10.18653/v1/P16-1162

Kudo T., Richardson J. (2018) SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pp. 66–71.

Liu Z., Wang Y., Vaidya S., et al. (2024) KAN: Kolmogorov-Arnold networks. arXiv:2404.19756. https://doi.org/10.48550/arXiv.2404.19756

Vaca-Rubio C. J., Blanco L., Pereira R., Caus M. (2024) Kolmogorov-Arnold networks (KANs) for time series analysis. arXiv:2405.08790. https://doi.org/10.48550/arXiv.2405.08790

Ma X., Gong Y., He P., et al. (2023) Query rewriting for retrieval-augmented Large Language Models. arXiv:2305.14283. https://doi.org/10.48550/arXiv.2305.14283

Chen Z., Fan X., Ling Y. (2020) Pre-training for query rewriting in a spoken language understanding system. ICASSP 2020 – 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, pp. 7969–7973. https://doi.org/10.1109/ICASSP40776.2020.9053531

Bhandari M., Wang M., Poliannikov O., Shimizu K. (2023) RecQR: Using Recommendation Systems for Query Reformulation to correct unseen errors in spoken dialog systems. 17th ACM Conference on Recommender Systems (RecSys’23), Singapore, pp. 1019–1022.

Kim T.H., Neubauer M., Sfiligoi I., et al. (2004) The CDF central analysis farm. IEEE Transactions on Nuclear Science, vol. 51, no. 3, pp. 892–896. https://doi.org/10.1109/TNS.2004.829574

Kuleshov A.P. (2008) Cognitive technologies in adaptive models of complex objects. Informatsionnye Tekhnologii i Vychslitel'nye Sistemy, vol. 1, pp. 18–29.

Luo C., Lakshman V., Shrivastavaet A., et al. (2022) ROSE: Robust caches for Amazon product search. WWW '22: Companion Proceedings of the Web Conference 2022, pp. 89–93. https://doi.org/10.1145/3487553.3524213

A customer avatar model based on Kolmogorov–Arnold networks

Abstract

Downloads

References