Review Details
Reviewer has chosen to be Anonymous
Overall Impression: Weak
Suggested Decision: Undecided
Technical Quality of the paper: Bad
Presentation: Good
Reviewer`s confidence: Medium
Significance: Moderate significance
Background: Reasonable
Novelty: Limited novelty
Data availability: All used and produced data (if any) are FAIR and openly available in established data repositories
Length of the manuscript: The authors need to elaborate more on certain aspects and the manuscript should therefore be extended (if the general length limit is already reached, I urge the editor to allow for an exception)
Summary of paper in a few sentences:
The paper introduces the problem of Reinforcement Learning, provides a categorization of settings in which it can be (or has been) applied, and then provides a bibliographic overview of the related literature. The latter seems to be the main contribution of the paper. The bibliographic overview involves statistics such as how many algorithms are mentioned in the different papers, whether certain keywords (e.g., 'safety' or 'privacy') are mentioned in them, and various other aspects of the studied settings (e.g., whether the papers used data from user studies).
My main comment is that, while I found the content of the survey well-written, as a reader I would expect to obtain more in formation about the methods involved in the surveyed paper -- and in the current manuscript there is very little technical information.
My overall recommendation would be for the authors to expand the technical content of the paper.
One way to do this would be to :
(1) provide a more detailed description of the algorithms/methods mentioned in the paper (e.g., the ones mentioned in Table 4 or the methods that are cited but not discussed in Section 3), and:
(2) explain how they fall within the framework of approaches outlined in Section 2.
Reasons to accept:
* The paper provides a good introduction to RL.
* The paper gives a detailed account of the Systematic Literature Review process (Sections 4,5), to explain how papers the surveyed papers were collected.
* In terms of language and structure, the paper is written well.
Reasons to reject:
* The paper does not contain substantial technical content on the methods presented in the surveyed papers. It does provide a bibliographic exploration of the methods, but readers would be interested in more detailson methods from a survey article.
Nanopublication comments:
Further comments:
C1. Notation: in the reviewed manuscript, indices appear as upper-font and could be confused with exponents (e.g., r^{t+1}).
C2. The three categories of approaches described in Section 2 would be easier to understand via a running example. Specific examples of settings would also make it easier to understand the categorization of personalization settings in Section 3. Currently, the discussion is too abstract.
C3. In Section 2, there is mention of "users" and their "experiences", but such terms would fit better in a particular example. For the abstract description of the approaches, it would be better to make references to the terms that are already introduce to describe the setting -- e.g., 'agent', 'reward', 'environment'.
C4. The title of Section 4 should start with a capital letter.
C5. The quality of the figures should be improved. For example, in Fig. 4, the legend appears at an awkward location, place it at the top, middle, or bottom of the right side of the figure. Also, in Fig.5 the title of the plot overlaps with the plot.
2 Comments
Meta-Review by Editor
Submitted by Tobias Kuhn on
The paper provides an overview of reinforcement learning (RL) application for personalization across different application domains. After a brief summary of reinforcement learning in general and its usage in personalization tasks in particular, the literature review is constructed based two main components: a quantitative one and a qualitative one. The quantitative component mainly reports on statistics about the topic in the literature (methods, keywords, data mentioned in the related literature). The qualitative component is however less complete than the first one.
While the actual counts are a measure of how much attention the topic is receiving in the literature, such an overview would benefit from a more focused analysis of methods: a comparison of methods, summary of their advantages and disadvantag es and guidelines of their suitability for specific domains, a more technical machine learning approach of reviewing methods.
While this paper represents a nice effort into providing a well-written and comprehensive review in a field that is still new, I encourage the authors to expand the review by adding a more technical analysis of machine learning methods for RL in personalization. Therefore, the authors need to address this concern together with the remarks from the reviewers.
Izabela Moise (https://orcid.org/0000-0003-0370-6749)
Dear editors and reviewers,
Submitted by Floris den Hengst on
Dear editors and reviewers,
Thank you for your useful comments and consideration to accept this manuscript. We have addressed your comments and uploaded a revised manuscript. We have included a response letter detailing the changes.
Kindest regards,
Floris den Hengst