UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 157 items • Updated 1 day ago • 17
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study Paper • 2403.03186 • Published Mar 5 • 5
MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents Paper • 2410.03450 • Published 9 days ago • 28
WaveUI Collection WaveUI is a collection of datasets and tools to improve UI object detection • 6 items • Updated Jul 31 • 9