Unsupervised Legal Evidence Retrieval via Contrastive Learning with Approximate Aggregated Positive

Authors

  • Feng Yao Tsinghua University
  • Jingyuan Zhang Alibaba Group
  • Yating Zhang Alibaba Group
  • Xiaozhong Liu Worcester Polytechnic Institute
  • Changlong Sun Alibaba Group
  • Yun Liu Tsinghua University
  • Weixing Shen Tsinghua University

DOI:

https://doi.org/10.1609/aaai.v37i4.25603

Keywords:

DMKM: Applications, DMKM: Web Search & Information Retrieval, APP: Other Applications, SNLP: Applications

Abstract

Verifying the facts alleged by the prosecutors before the trial requires the judges to retrieve evidence within the massive materials accompanied. Existing Legal AI applications often assume the facts are already determined and fail to notice the difficulty of reconstructing them. To build a practical Legal AI application and free the judges from the manually searching work, we introduce the task of Legal Evidence Retrieval, which aims at automatically retrieving the precise fact-related verbal evidence within a single case. We formulate the task in a dense retrieval paradigm, and jointly learn the constrastive representations and alignments between facts and evidence. To get rid of the tedious annotations, we construct an approximated positive vector for a given fact by aggregating a set of evidence from the same case. An entropy-based denoise technique is further applied to mitigate the impact of false positive samples. We train our models on tens of thousands of unlabeled cases and evaluate them on a labeled dataset containing 919 cases and 4,336 queries. Experimental results indicate that our approach is effective and outperforms other state-of-the-art representation and retrieval models. The dataset and code are available at https://github.com/yaof20/LER.

Downloads

Published

2023-06-26

How to Cite

Yao, F., Zhang, J., Zhang, Y., Liu, X., Sun, C., Liu, Y., & Shen, W. (2023). Unsupervised Legal Evidence Retrieval via Contrastive Learning with Approximate Aggregated Positive. Proceedings of the AAAI Conference on Artificial Intelligence, 37(4), 4783-4791. https://doi.org/10.1609/aaai.v37i4.25603

Issue

Section

AAAI Technical Track on Data Mining and Knowledge Management