Breaking the Chain: Simple Word Swaps Expose LLMs’ Reasoning Limits
Key Findings: Large Language Models (LLMs) exhibit significant limitations in handling sequentially dependent operations. Our simple word-swap experiment reveals that most models struggle to perform correctly beyond two consecutive word swap operations, highlighting a critical weakness in their sequential reasoning.
Read more
Charlie Mnemonic – Update 5: Introducing Chain-of-Thought and Integrated Recall System
We’re excited to announce the fifth major update to Charlie Mnemonic, your open-source AI assistant with Long-Term Memory. This release brings groundbreaking features, including Chain-of-Thought reasoning and an integrated Recall system that allows you to effortlessly search and reference past.
Read more
Discover the LTM Benchmark at NeurIPS 2024
We are glad to announce that our paper “Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models” has been accepted to NeurIPS 2024, where we will have the opportunity to share our work and knowledge in relation to Long-Term Memory.
Read more
AI People: Announcing the next evolution of gaming AI NPCs
Today, GoodAI proudly introduces our new game: AI People Discover more on our official website: www.AIPeopleGame.com The Vision Our vision for AI People was ambitious but clear: to innovate within the gaming industry by making intelligent AI NPCs central to.
Read more
GoodAI LTM Benchmark v3 Released
A Standardization Release: The main purpose of the GoodAI LTM Benchmark has always been to serve as an objective measure for our progress in the development of agents capable of continual and life-long learning.
Read more
Marek Rosa: Solo creators enhanced by a legion of AI agents
Within the next five years, every individual will have the ability to employ AI agents from the cloud. These agents will effectively serve as our AI employees and assistants, aiding in tasks where we might typically enlist the services of.
Read more
Introducing Charlie Mnemonic: The First Personal Assistant with Long-Term Memory
As part of our research efforts in continual learning, we are open-sourcing Charlie Mnemonic, the first personal assistant (LLM agent) equipped with Long-Term Memory (LTM).
Read more
Introducing GoodAI LTM Benchmark
As part of our research efforts in the area of continual learning, we are open-sourcing a benchmark for testing agents’ ability to perform tasks involving the advanced use of the memory over very long conversations.
Read more
Embodied collectives: our progress to date
At GoodAI, we are building collaborative AI agents that enhance human capabilities and drive positive change at scale. Collective intelligence is the guiding principle behind our work.
Read more
LLM Agent taught to control drones
In this article, we demonstrate the learning process of one of our GoodAI LLM Agents being taught how to use an API to control a drone quadcopter.
Read more