Zhen Zhang

Other people with similar names: Zhen Zhang, Zhen Zhang, Zhen Zhang, Zhen Zhang

Unverified author pages with similar names: Zhen Zhang

2026

Advanced agentic intelligence is a prerequisite for deploying Large Language Models in practical, real-world applications. Diverse real-world APIs demand precise, robust function-calling intelligence, which needs agents to develop these capabilities through interaction in varied environments. The breadth of function-calling competence is closely tied to the diversity of environments in which agents are trained. In this work, we scale up environments as a step towards advancing general agentic intelligence. This gives rise to two central challenges: (i) how to scale environments in a principled manner, and (ii) how to effectively train agentic capabilities from experiences derived through interactions with these environments. To address these, we design a scalable framework that automatically constructs heterogeneous environments that are fully simulated, broadening the space of function-calling scenarios. We further adapt a two-phase agent fine-tuning strategy: first endowing agents with fundamental agentic capabilities, then specializing them for domain-specific contexts. Extensive experiments on agentic benchmarks, -bench, -Bench, and ACEBench, demonstrate that our trained model, AgentScaler, significantly enhances the models’ function-calling capability.

pdf bib abs

Social bots threaten online platforms by mimicking human behavior and forming deceptive connections, enabling the dissemination of misinformation while evading detection. Existing graph-based detection models leverage graph neural networks (GNNs) to capture relational structures and multimodal user features. However, such models are vulnerable to deceptive message propagation, where bots deliberately interact with legitimate users. These interactions create heterophilous edges–connections between nodes with different labels (i.e. human and bot)–which undermine the homophily assumption that connected users typically share similar characteristics. In this work, we propose a novel framework to mitigate deceptive message propagation through node-level uncertainty estimation and graph structure purification. The framework comprises three key components: (1) Node uncertainty estimation employs evidential deep learning with an error-sensitive uncertainty loss to obtain calibrated node-wise uncertainty; (2) Uncertainty-guided pseudo-label generation assigns pseudo-labels to low-uncertainty nodes using a dynamic threshold; (3) Graph structure purification selectively disconnects heterophilous edges identified between differently labeled nodes. Extensive experiments on three benchmark datasets and six GNN backbones demonstrate that our framework consistently enhances detection performance and serves as an effective general-purpose enhancement module for social bot detection.

2023

pdf bib abs

Recently, aspect sentiment quad prediction has received widespread attention in the field of aspect-based sentiment analysis. Existing studies extract quadruplets via pre-trained generative language models to paraphrase the original sentence into a templated target sequence. However, previous works only focus on what to generate but ignore what not to generate. We argue that considering the negative samples also leads to potential benefits. In this work, we propose a template-agnostic method to control the token-level generation, which boosts original learning and reduces mistakes simultaneously. Specifically, we introduce Monte Carlo dropout to understand the built-in uncertainty of pre-trained language models, acquiring the noises and errors. We further propose marginalized unlikelihood learning to suppress the uncertainty-aware mistake tokens. Finally, we introduce minimization entropy to balance the effects of marginalized unlikelihood learning. Extensive experiments on four public datasets demonstrate the effectiveness of our approach on various generation templates.

pdf bib abs

Most named entity recognition (NER) systems focus on improving model performance, ignoring the need to quantify model uncertainty, which is critical to the reliability of NER systems in open environments. Evidential deep learning (EDL) has recently been proposed as a promising solution to explicitly model predictive uncertainty for classification tasks. However, directly applying EDL to NER applications faces two challenges, i.e., the problems of sparse entities and OOV/OOD entities in NER tasks. To address these challenges, we propose a trustworthy NER framework named E-NER by introducing two uncertainty-guided loss terms to the conventional EDL, along with a series of uncertainty-guided training strategies. Experiments show that E-NER can be applied to multiple NER paradigms to obtain accurate uncertainty estimation. Furthermore, compared to state-of-the-art baselines, the proposed method achieves a better OOV/OOD detection performance and better generalization ability on OOV entities.