Zhihao Luo
Also published as: ZhiHao Luo
2026
Generative Floor Plan Design with LLMs via Reinforcement Learning with Verifiable Rewards
Luis Lara | Aristides Milios | ZhiHao Luo | Aditya Sharma | Ge Ya Luo | Christopher Beckham | Florian Golemo | Christopher Pal
Findings of the Association for Computational Linguistics: ACL 2026
Luis Lara | Aristides Milios | ZhiHao Luo | Aditya Sharma | Ge Ya Luo | Christopher Beckham | Florian Golemo | Christopher Pal
Findings of the Association for Computational Linguistics: ACL 2026
An AI system for professional floor plan design must precisely control room dimensions and areas while respecting the desired connectivity between rooms and maintaining functional and aesthetic quality.Existing generative approaches focus primarily on respecting the requested connectivity between rooms, but do not support generating floor plans that respect numerical constraints. We introduce a text-based floor plan generation approach that fine-tunes a large language model (LLM) on real plans and then applies reinforcement learning with verifiable rewards (RLVR) to improve adherence to topological and numerical constraints while discouraging invalid or overlapping outputs.Furthermore, we design a set of constraint adherence metrics to systematically measure how generated floor plans align with user-defined constraints.Our model generates floor plans that satisfy user-defined connectivity and numerical constraints and outperforms existing methods on Realism, Compatibility, and Diversity metrics. Across all tasks, our approach achieves at least a 94% relative reduction in Compatibility compared with existing methods.Our results demonstrate that LLMs can effectively handle constraints in this setting, suggesting broader applications for text-based generative modeling.
NaviMaster: Learning a Unified Policy for GUI and Embodied Navigation Tasks
Zhihao Luo | Wentao Yan | Jingyu Gong | Min Wang | Zhizhong Zhang | Xuhong Wang | Yuan Xie | Xin Tan
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Zhihao Luo | Wentao Yan | Jingyu Gong | Min Wang | Zhizhong Zhang | Xuhong Wang | Yuan Xie | Xin Tan
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Recent advances in Graphical User Interface (GUI) and embodied navigation have driven progress, yet these domains have largely evolved in isolation, with disparate datasets and training paradigms. In this paper, we observe that both tasks can be formulated as Markov Decision Processes (MDP), suggesting a foundational principle for their unification. Hence, we present NaviMaster, the first unified agent capable of unifying GUI navigation and embodied navigation within a single framework. Specifically, NaviMaster (i) proposes a visual-target trajectory collection pipeline that generates trajectories for both GUI and embodied tasks using a single formulation. (ii) employs a unified reinforcement learning framework on the mix data to improve generalization. (iii) designs a novel distance-aware reward to ensure efficient learning from the trajectories. Through extensive experiments on out-of-domain benchmarks, NaviMaster is shown to outperform state-of-the-art agents in GUI navigation, spatial affordance prediction, and embodied navigation. Ablation studies further demonstrate the efficacy of our unified training strategy, data mixing strategy, and reward design. Resources will be released to the community.