Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
Intelligent cockpit systems represent a unique challenge for API Agents due to their interconnected, state-dependent subsystems that exceed the complexity of typical task environments. Traditional function calling approaches operate statelessly, requiring multiple exploratory calls to build environmental awareness before execution, leading to inefficiency and limited error recovery. We introduce VehicleWorld, the first comprehensive environment for the automotive domain that provides real-time state information during agent execution and enables precise evaluation of vehicle agent behaviors. Based on VehicleWorld, we construct a series of challenging scenarios and problems to benchmark agent performance. Through our analysis, we observe that directly predicting environment states is more effective than predicting function calls to change the environment. Building on this insight, we propose State-based Function Calling (SFC), which explicitly maintains system state awareness and implements direct state transitions to achieve target conditions. Our experimental results demonstrate that SFC serves as a strong baseline that significantly outperforms traditional function calling approaches across multiple metrics, including execution accuracy and latency reduction. By providing comprehensive state information, SFC enables models to better interpret user intentions while substantially reducing erroneous and redundant calls, advancing more reliable agent-vehicle interactions in intelligent cockpit environment.