Microsoft’s new AI agent can control software and robots
The researchers' explanations about how "Set-of-Mark" and "Trace-of-Mark" work.
Credit:
Microsoft Research
The Magma model introduces two technical components: Set-of-Mark, which identifies objects that can be manipulated in an environment by assigning numeric labels to interactive elements, such as clickable buttons in a UI or graspable objects in a robotic workspace, and Trace-of-Mark, which learns movement patterns from video data. Microsoft says those features allow the model to complete tas...
Read more at arstechnica.com