Tag: spatial memory

  • MIT Develops Long-Term Memory System That Lets Robots Answer Where You Left Your Keys

    MIT Develops Long-Term Memory System That Lets Robots Answer Where You Left Your Keys

    Imagine asking a robot, “Where did I leave my keys?” and getting an accurate, real-time answer. MIT researchers have created a new spatial memory framework called DAAAM (Describe Anything, Anywhere, Anytime, at Any Moment) that gives robots the ability to form and recall detailed mental models of large-scale environments. This breakthrough could transform how robots assist humans in factories, homes, and beyond.

    DAAAM combines advanced map representations with rich, language-based descriptions of objects a robot encounters as it explores. The system runs fast enough for mobile robots to use in real-time, answering complex queries in plain English with 21% to 53% higher accuracy than existing methods.

    “If we want robots to work side-by-side with humans and interact better with humans, they must speak the same language,” says Luca Carlone, associate professor in MIT’s Department of Aeronautics and Astronautics and lead researcher on the project. “The robot must be able to reason about time and space the same way humans do.”

    The framework bridges computer vision and robotic mapping. As a robot moves through an environment, DAAAM attaches detailed descriptions to objects—like noting that a red bicycle with a flat tire is in the bike rack outside the Stata Center. It stores this information in a 3D map-based representation arranged spatially, grouping objects into regions for efficient retrieval.

    To overcome the speed limitations of existing annotation techniques, DAAAM aggregates nearby objects and uses an optimization method to select key frames—images with the clearest view of multiple objects—allowing the system to describe several items in parallel. This speeds up computation tenfold, making real-time performance possible.

    “We annotate every object only once, so our framework can run in very large-scale environments in real time,” explains lead author Nicolas Gorlo, an MIT graduate student. “And by clustering objects into regions, it can answer a wide range of queries about objects and locations.”

    The researchers used a large language model (LLM) that calls on various tools to retrieve specific information quickly, reducing hallucinations. For example, if asked about a sculpture near an MIT campus building, DAAAM can use a semantic search tool to retrieve information based on the word “sculpture” or a location-based tool to find the building.

    Future work aims to expand DAAAM to capture significant events and incorporate confidence levels into responses. “Ultimately, we want to have robots that can help with any sort of tasks,” Gorlo says. “With this framework, we are trying to create the foundations to enable a generalist agent that can do anything you ask.”

    The research was presented at the Conference on Computer Vision and Pattern Recognition (CVPR) and funded by the U.S. Army Research Laboratory and the Office of Naval Research.

  • Exploring MIT’s Latest Machine Learning Breakthroughs in Robotics and AI

    Exploring MIT’s Latest Machine Learning Breakthroughs in Robotics and AI

    MIT continues to push the boundaries of machine learning with a series of groundbreaking research developments that span robotics, AI efficiency, material science, and more. Recent projects highlight the institute’s commitment to advancing both theory and practical applications.

    In robotics, researchers have developed a system that leverages large language models to help robots interpret vague instructions and focus on crucial details, improving task performance in homes and factories. Another innovation, known as Murakkab, optimizes multistep AI workflows, enhancing speed and energy efficiency. A new low-power chip enables tiny robots to generate 3D maps for navigation with minimal memory and power consumption.

    Beyond robotics, MIT scientists are modeling metal alloys at atomic scales to predict material properties more accurately, while game theory research shows that generalist algorithms can outperform specialists in certain scenarios. A novel spatial memory system allows robots to efficiently remember object locations, and a major update to random utility models—dubbed ‘the power of three’—improves preference prediction accuracy.

    Commercial applications include a startup using MIT technology for real-time product tracking in retail, manufacturing, and logistics. The NSF has renewed support for the MIT-led Institute for Artificial Intelligence and Fundamental Interactions (IAIFI), expanding its second phase with increased funding and broader ambitions. Researchers are also teaching AI agents to ask better questions using the game Battleship, and a new dataset called ChartNet enhances vision-language models’ ability to interpret charts.

    Additional milestones include MIT economist Whitney Newey receiving the Erwin Plein Nemmers Prize, new AI chemistry models designed by Connor Coley, and the appointment of Justin Solomon as associate dean of engineering education. MIT Open Learning launched a universal AI education program, making AI fluency accessible worldwide.

    These achievements underscore MIT’s role as a leader in machine learning, driving innovations that shape the future of technology and science.