research

Research map organized around the questions I am asking, not the papers I have written.

Conventional vs. proposed pre-emptive vulnerability patch cycles
My research focus: close the pre-disclosure window — act before the attacker.Diagram generated with Gemini.

Under coordinated vulnerability disclosure, a vulnerability is typically silently fixed on the public repository weeks before its CVE is published — and attackers can infer the vulnerability from those silent commits long before defenders hear about it. In the CVE-2018-11776 Apache Struts remote-code-execution case, a silent fix sat in the public repo for about two months before public disclosure; this is the same class of exposure window that contributed to the 2017 Equifax breach (~147.9M records). Starting from our ASE'21 VulFixMiner paper, our research line has pioneered proactive vulnerability sensing — modeling silent fix commits as the first public, inevitable signal of a hidden vulnerability, covering 65% of silent fixes 1–2 weeks ahead of CVE disclosure.

OSS Vulnerability Management

Q1. How can we detect a vulnerability before it is publicly disclosed?

Q2. How do we manage that vulnerability with one hand tied behind our back — no public CVEs or advisories to draw on?

Show/Hide Work on Vulnerability Management
Empirical Understanding
Proactive Sensing
Assessment
Fix & Validation

A software-engineering agent is only as good as the procedural knowledge it can bring to bear — how this repository is structured, how its tests fail, how past developers navigated change. Our Lingxi agent framework mines that knowledge from historical development data and from its own trajectories, guiding the agent harness and feeding back into the underlying model. #1 on SWE-bench Verified (81.2%), deployed across Huawei's internal product lines.

AI Agents for Software Engineering

Q1. How do we build a software-engineering agent that handles real, repository-scale tasks?

Q2. How does such an agent keep getting better — by mining knowledge from development history and its own trajectories?

Show/Hide Work on Lingxi
Dev Implicit Knowledge Mining
Code Agent Harness
Model & Loop Evolution results coming soon
  • [Traj-Evolver] trajectories → harness + model updates