Publications

You are welcome to explore my work on my Google Scholar.

Conference Papers

AgentBnB: A Browser-Based Cybersecurity Tabletop Exercise with Large Language Model Support and Retrieval-Aligned Scaffolding

Arman Anwar, Zefang Liu

arXiv preprint arXiv:2511.00265, 2025

AgentBnB uses AI agents and retrieval-based hints to turn cybersecurity tabletop training into a lightweight, scalable browser game.

[Paper]

AutoBnB-RAG: Enhancing Multi-Agent Incident Response with Retrieval-Augmented Generation

Zefang Liu, Arman Anwar

ICDM 2025 Workshop on the use of Large Language Models for Cybersecurity (LLM4Sec), 2025

AutoBnB-RAG adds retrieval-augmented generation to multi-agent incident response, improving coordination and decision-making in cyber incident simulations.

[Paper] [Slides] [Poster] [Code] [Conference]

A Generalized Multinodal Model for Plasma Particle and Energy Transport

Zefang Liu, Weston M. Stacey

arXiv preprint arXiv:2507.13627, 2025

A generalized multinodal model is introduced for toroidal plasma, enabling flexible simulation of particle and energy transport and efficient integration with reactor-scale and data-driven plasma studies.

[Paper] [Code]

Sensitivity Analysis of Transport and Radiation in NeuralPlasmaODE for ITER Burning Plasmas

Zefang Liu, Weston M. Stacey

67th Annual Meeting of the APS Division of Plasma Physics, 2025

NeuralPlasmaODE enables sensitivity analysis of ITER plasmas, revealing the dominant factors influencing energy confinement and the self-regulating nature of temperature-dependent transport.

[Paper] [Poster] [Code]

Optimizing External Sources for Controlled Burning Plasma in Tokamaks with Neural Ordinary Differential Equations

Zefang Liu, Weston M. Stacey

arXiv preprint arXiv:2507.09431, 2025

NeuralPlasmaODE is used to optimize external particle and energy sources in tokamaks by solving inverse problems that guide plasma toward desired core conditions through differentiable control.

[Paper] [Code]

EconWebArena: Benchmarking Autonomous Agents on Economic Tasks in Realistic Web Environments

Zefang Liu, Yinzhu Quan

NeurIPS 2025 Workshop on Bridging Language, Agent, and World Models for Reasoning and Planning (LAW), 2025

EconWebArena is a benchmark of 360 human-curated tasks from real websites that evaluates autonomous agents’ ability to perform complex, multimodal economic reasoning and web navigation using authoritative sources.

[Paper] [Poster] [Data] [Website]

AutoBnB: Multi-Agent Incident Response with Large Language Models

Zefang Liu

2025 13th International Symposium on Digital Forensics and Security (ISDFS), 2025

AutoBnB uses LLM-based agents with the Backdoors & Breaches framework to simulate and evaluate multi-agent collaboration strategies for cybersecurity incident response.

[Paper] [Slides] [Code] [Conference]

Multi-Agent Collaboration in Incident Response with Large Language Models

Zefang Liu

AAAI 2025 Workshop on Multi-Agent AI in the Real World (MARW), 2024

This paper explores LLM-based multi-agent collaboration in incident response, analyzing team structures using the Backdoors & Breaches card game.

[Paper] [Poster] [Code] [Conference]

Retrieval of Temporal Event Sequences from Textual Descriptions

Zefang Liu, Yinzhu Quan

NAACL 2025 Workshop on Knowledge-Augmented Methods for NLP (KnowledgeNLP), 2024

We present TESRBench, a benchmark for retrieving temporal event sequences from descriptions, and TPP-Embedding, a model aligning sequences and descriptions in a shared embedding space, achieving state-of-the-art performance on TESRBench.

[Paper] [Poster] [Code] [Data] [Conference]

TPP-LLM: Modeling Temporal Point Processes by Efficiently Fine-Tuning Large Language Models

Zefang Liu, Yinzhu Quan

ICLR 2025 Workshop on Foundation Models in the Wild (FM-Wild), 2024

This paper introduces TPP-LLM, a framework that integrates large language models with temporal embeddings to improve event sequence modeling and prediction, demonstrating strong performance across multiple real-world datasets.

[Paper] [Poster] [Code] [Data] [Conference]

Application of Neural Ordinary Differential Equations for ITER Burning Plasma Dynamics

Zefang Liu, Weston M. Stacey

AAAI 2025 Workshop on AI to Accelerate Science and Engineering (AI2ASE), 2024

A neural ODE-based burning plasma dynamics model, NeuralPlasmaODE, simulates energy transfer in ITER plasmas for both inductive and non-inductive scenarios.

[Paper] [Poster] [Code] [Conference]

InvAgent: A Large Language Model based Multi-Agent System for Inventory Management in Supply Chains

Yinzhu Quan, Zefang Liu

AAAI 2025 Workshop on Advancing LLM-Based Multi-Agent Collaboration (WMAC), 2024

InvAgent leverages large language models for adaptive and explainable multi-agent inventory management in supply chains, significantly improving efficiency and resilience.

[Paper] [Poster] [Code] [Conference]

EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning

Yinzhu Quan, Zefang Liu

2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) Findings, 2024

We introduce EconLogicQA, a benchmark designed to evaluate large language models’ ability to understand and sequence complex economic events, demonstrating its effectiveness through evaluations of various models.

[Paper] [Poster] [Data] [Conference]

AdaMoLE: Fine-Tuning Large Language Models with Adaptive Mixture of Low-Rank Adaptation Experts

Zefang Liu, Jiahua Luo

2024 Conference on Language Modeling (COLM), 2024

AdaMoLE introduces a dynamic approach to fine-tuning LLMs by using an adaptive mixture of LoRA experts, outperforming traditional top-k routing methods in various tasks.

[Paper] [Poster] [Code] [Conference]

Application of Neural Ordinary Differential Equations for Tokamak Plasma Dynamics Analysis

Zefang Liu, Weston M. Stacey

ICLR 2024 Workshop on AI4DifferentialEquations in Science (AI4DiffEqtnsInSci), 2024

A novel Neural ODE-based model is introduced for simulating tokamak plasma dynamics, offering precise energy transfer analysis crucial for advancing controlled thermonuclear fusion.

[Paper] [Poster] [Code] [Conference]

A Review of Advancements and Applications of Pre-trained Language Models in Cybersecurity

Zefang Liu

2024 12th International Symposium on Digital Forensics and Security (ISDFS), 2024

This paper examines how pre-trained language models enhance cybersecurity across various tasks and calls for continued innovation in this field.

[Paper] [Conference]

CyberBench: A Multi-Task Benchmark for Evaluating Large Language Models in Cybersecurity

Zefang Liu, Jialei Shi, John F. Buford

AAAI 2024 Workshop on Artificial Intelligence for Cyber Security (AICS), 2024

CyberBench introduces a benchmark for evaluating large language models in cybersecurity, alongside CyberInstruct, a fine-tuned LLM that performs competitively in this domain.

[Paper] [Slides] [Code] [Data] [Conference]

SecQA: A Concise Question-Answering Dataset for Evaluating Large Language Models in Computer Security

Zefang Liu

arXiv preprint arXiv:2312.15838, 2023

SecQA introduces a specialized dataset for evaluating the performance of large language models in computer security through multiple-choice questions.

[Paper] [Code] [Data]

Anomaly Detection of Command Shell Sessions based on DistilBERT: Unsupervised and Supervised Approaches

Zefang Liu, John F. Buford

2023 Conference on Applied Machine Learning in Information Security (CAMLIS), 2023

This paper leverages DistilBERT for anomaly detection in Unix shell sessions using both unsupervised and supervised methods, demonstrating effective detection of anomalous behavior with minimal data labeling.

[Paper] [Slides] [Conference]

A Multi-Region Multi-Timescale Burning Plasma Dynamics Model for Tokamaks

Zefang Liu, Weston M. Stacey

64th Annual Meeting of the APS Division of Plasma Physics, 2022

This paper develops a multi-region, multi-timescale transport model to simulate burning plasma dynamics in tokamaks, addressing energy transport and radiation effects to prevent thermal runaway instability in ITER scenarios.

[Slides] [Code] [Conference]

A Multi-Region Multi-Timescale Burning Plasma Dynamics Model for Tokamaks

Zefang Liu, Weston M. Stacey

63rd Annual Meeting of the APS Division of Plasma Physics, 2021

[Code] [Conference]

Deep Reinforcement Learning based Group Recommender System

Zefang Liu, Shuran Wen, Yinzhu Quan

arXiv preprint arXiv:2106.06900, 2021

This paper introduces a Deep Reinforcement learning-based Group Recommender System (DRGR) using actor-critic networks and the deep deterministic policy gradient algorithm.

[Paper] [Slides] [Code]

Yelp Review Rating Prediction: Machine Learning and Deep Learning Models

Zefang Liu

arXiv preprint arXiv:2012.06690, 2020

This paper predicts Yelp restaurant ratings using both traditional machine learning models and transformer-based models.

[Paper] [Code]

A One-Dimensional Multi-Region Multi-Timescale Burning Plasma Dynamics Model for Tokamaks

Zefang Liu, Weston M. Stacey

62nd Annual Meeting of the APS Division of Plasma Physics, 2020

[Code] [Conference]