Comparative analysis of methodologies and approaches in recommender systems utilizing large language models (pdf)

Article PDF cannot be displayed. You can download it here:

https://link.springer.com/content/pdf/10.1007/s10462-025-11189-8.pdf

Comparative analysis of methodologies and approaches in recommender systems utilizing large language models

Artificial Intelligence Review (2025) 58:207 https://doi.org/10.1007/s10462-025-11189-8 Comparative analysis of methodologies and approaches in recommender systems utilizing large language models Salma S. Elmoghazy1 · Marwa A. Shouman2 · Hamdy K. Elminir1 · Gamal Eldin I. Selim2 Accepted: 6 March 2025 © The Author(s) 2025 Abstract Recommendation systems are indispensable technologies nowadays, as they enable analysis of the huge amount of information available on the internet, helping consumers to make decisions effectively. Ongoing efforts are essential to further develop and align them with the evolving demands of the modern era. In the last few years, large language models (LLMs) have made a huge leap in natural language processing. This advancement has directed researchers’ efforts towards employing these models in various fields, including recommender systems, to leverage the vast amount of data they were trained on. This paper presents a comparative study of a set of recent methodologies that adapt LLMs to recommendations. Throughout the discussed research work, we come up with the insight that LLMs offer significant benefits due to the amount of knowledge they possess and their powerful ability to represent textual data effectively, making them useful in common recommendation issues like cold-start. Also, the variety of fine-tuning and in-context learning techniques enables adaptation of LLMs to a wide range of recommendation tasks. We discussed issues addressed in the reviewed research work and the solutions proposed to enhance recommendation systems. To provide a clearer understanding, we propose taxonomies to categorize the reviewed work based on underlying techniques, involving the role of LLMs in recommendations, learning paradigms, and system structures. We explore datasets, recommendation- and language-related metrics commonly used in this domain. Finally, we analyzed findings in related work, highlighting possible strengths and limitations of using LLMs in recommender systems. Keywords Recommender systems · Large language models (LLMs) · Natural language processing (NLP) · User modeling · Collaborative filtering (CF) · Parameter-efficient fine-tuning (PEFT) Marwa A. Shouman, Hamdy K. Elminir, and Gamal Eldin I. Selim have contributed equally to this work. Extended author information available on the last page of the article 13 207 Page 2 of 36 S. S. Elmoghazy et al. 1 Introduction In today’s digital world, where information grows exponentially, recommender systems (RSs) are essential to assist users in finding content, services, and products that align with their preferences. From movie suggestions on Netflix to personalized news feeds on social media, recommender systems have reshaped users’ online experiences. Collaborative filtering is one of the earliest and most popular techniques of recommendation. It assumes that similar users tend to have similar interests (Su and Khoshgoftaar 2009). As an enhancement of collaborative filtering techniques, matrix factorization methods gained a lot of attention during the Netflix Prize competition, allowing efficient scalability for recommender systems (Koren et al. 2009). Another popular type of recommender systems is content-based filtering methods that depend on the similarity between the users past consumed items and the candidate items (Mooney and Roy 2000). Hybrid recommender systems were introduced to leverage the capabilities of both collaborative filtering and content-based filtering, through various implementation methods, to overcome the shortcomings of both approaches (B.Thorat et al. 2015). In conjunction with the remarkable developments in the field of deep learning, many sophisticated neural network-based recommendation techniques have evolved (Zhang et al. 2019; He et al. 2017; Fan et al. 2022), resulting in a huge advancement in recommendation systems. Following the advancements in deep learning-based recommender systems, the integration of natural language processing (NLP) techniques, such as recurrent neural networks-based (Hidasi et al. 2016) and attention-based (Kang and McAuley 2018) techniques, has emerged, providing better modelling for user’s historical sequences of behaviors. However, like any technology, recommender systems face various challenges such as data sparsity, cold-start, scalability, and lack of explainability problems (B.Thorat et al. 2015). Very recent research has started to explore the potential of the integration of large language models (LLMs) like OpenAI’s GPT (Radford et al. 2018) series, T5 (Raffel et al. 2020), BERT (Devlin et al. 2019), BART (Lewis et al. 2020) etc., to further mitigate these limitations. LLMs, trained on massive textual datasets, possess remarkable zero-shot and few-shot capabilities, which enable them to generate responses to tasks they were not explicitly trained on (Brown et al. 2020). This gives the potential to employ LLMs in recommendation systems to solve data sparsity and cold-start problems (Sanner et al. 2023). LLMs have proven their impressive capabilities in adapting to downstream tasks. The same principle applies to recommendation tasks by tuning LLMs using recommendation data (Bao et al. 2023). This flexibility opens the way towards developing unified or multi-tasking recommendation frameworks like P5 (Geng et al. 2022) and M6-Rec (Cui et al. 2022). The growing focus on efficient fine-tuning techniques has the potential to enable LLMs to scale up to large datasets of users and items (Han et al. 2024; Katlariwala and Gupta 2024). Unlike traditional recommender systems, which rely on implicit signals like clicks and interactions, which can be misleading, LLMs excel at capturing semantic information from textual data. This enables the use of descriptions, reviews, and other textual information associated with items and users, helping to obtain better representations for both users and items necessary for improving recommendation performance (Qiu et al. 2021). Additionally, LLM-based recommender systems can be game-changers when it comes to personalized recommendations (Katlariwala and Gupta 2024; Geng et al. 2022). The unique reasoning abilities of LLMs enable them to capture the implied user intent based on the context, which 13 Comparative analysis of methodologies and approaches in… Page 3 of 36 207 traditional CF models often overlook. This results in recommendations that are not only personalized but also contextually relevant, enhancing user satisfaction. Another advantage of LLMs is that they can generate textual explanations and summaries, which can be leveraged to improve the explainability of recommendations, making the recommender system more user-trusted (Li et al. 2023). In the later sections of this survey, we will further explore various research projects that leverage LLMs in recommendation tasks. The goal of this survey is to provide a comparative analysis of selected approaches that use LLMs in recommend (...truncated)