Reﬂexion: an autonomous agent with dynamic memory and self-reﬂection

Recent advancements in decision-making large language model (LLM) agents have demonstrated impressive performance across various benchmarks. However, these state-of-the-art approaches typically necessitate internal model ﬁne-tuning, external model ﬁne-tuning, or policy optimization over a deﬁned state space. Implementing these methods can prove challenging due to the scarcity of high-quality training data or the lack of well-deﬁned state space. Moreover, these agents do not possess certain qualities inherent to human decision-making processes, speciﬁcally the ability to learn from mistakes. Self-reﬂection allows humans to efﬁciently solve novel problems through a process of trial and error. Building on recent research, we propose Reﬂexion, an approach that endows an agent with dynamic memory and self-reﬂection capabilities to enhance its existing reasoning trace and task-speciﬁc action choice abilities. To assess our approach, we evaluate the agent’s ability to complete decision-making tasks in AlfWorld environments and knowledge-intensive, search-based question-and-answer tasks in HotPotQA environments. We observe success rates of 97% and 51%, respectively, and provide a discussion on the emergent property of self-reﬂection. Mastering decision-making and knowledge-intensive search tasks in novel environments is a crucial skill set for large-scale natural language agents. LLMs such as OpenAI’s GPT-3 (Brown et al., 2020), Google’s PaLM (Chowdhery et al., 2022), and others have achieved impressive results on various benchmarks (Kaplan et al., 2020; Rae et al., 2021; Nakano et al., 2021; Kojima et al., 2022; Ouyang et al., 2022; Chung et al., 2022).

Reﬂexion: an autonomous agent with dynamic memory and self-reﬂection

Previoujs Article

New in IntelliJ Rust for 2023.1 (Part 1)

Next Article

Wall Streets Transition from Excel to Python

Tags