DS1 spectrogram: The Cases LJP Never Sees: Prosecution Decision Prediction for More Complete Criminal Liability Assessment

The Cases LJP Never Sees: Prosecution Decision Prediction for More Complete Criminal Liability Assessment

2605.28464

Authors

Qi Wei,Jie Zhang,Qianru Wang,Shuyuan Zheng,Junyu Lu

Abstract

Legal Judgment Prediction (LJP) has become a core benchmark for evaluating AI in the criminal legal domain, but it only sees criminal cases that have already passed prosecutorial review and been formally indicted. As a result, LJP leaves a substantial blind spot in assessing criminal liability, overlooking cases involving insufficient evidence, no criminal liability, or guilt exempted from punishment.

To fill this gap, we propose Prosecution Decision Prediction (PDP), the first Legal AI task built around prosecutorial review, which classifies each case into prosecution or one of three non-prosecution decisions and reflects legal AI's capabilities in evidence evaluation, legal subsumption, and value-based discretion. We further construct PDP-Bench, a benchmark of 4{,}630 real Chinese prosecutorial decisions spanning 190 charges. Extensive experiments show that state-of-the-art LLMs perform substantially worse on PDP than on LJP and that mainstream enhancement routes fail to close the gap.

Moreover, controlled RLVR interventions show that simple outcome rewards fail to produce generalizable PDP discrimination.

Resources

Stay in the loop

Every AI paper that matters, free in your inbox daily.

Details

  • takara.ai
  • Custom AI and machine learning from the Frontier Research Team.
  • © 2026 takara.ai Ltd
  • Content is sourced from third-party publications.