GUI grounding, which maps natural-language instructions to actionable UI elements, is a core capability of GUI agents. Prior works largely treats instructions as a static proxy for user intent, ...
You can also try the browser version at https://puppetstudio.app Shift + Enter opens the command bar, from which you can open viewports. Navigate the bar with the ...