BBScore: A Brownian Bridge Based Metric for Assessing Text Coherence
DOI:
https://doi.org/10.1609/aaai.v38i13.29414Keywords:
ML: Evaluation and Analysis, NLP: (Large) Language Models, NLP: Interpretability, Analysis, and Evaluation of NLP ModelsAbstract
Measuring the coherence of text is a vital aspect of evaluating the quality of written content. Recent advancements in neural coherence modeling have demonstrated their efficacy in capturing entity coreference and discourse relations, thereby enhancing coherence evaluation. However, many existing methods heavily depend on static embeddings or focus narrowly on nearby context, constraining their capacity to measure the overarching coherence of long texts. In this paper, we posit that coherent texts inherently manifest a sequential and cohesive interplay among sentences, effectively conveying the central theme, purpose, or standpoint. To explore this abstract relationship, we introduce the "BB Score," a novel reference-free metric grounded in Brownian bridge theory for assessing text coherence. Our findings showcase that when synergized with a simple additional classification component, this metric attains a performance level comparable to state-of-the-art techniques on standard artificial discrimination tasks. We also establish in downstream tasks that this metric effectively differentiates between human-written documents and text generated by large language models within specific domains. Furthermore, we illustrate the efficacy of this approach in detecting written styles attributed to various large language models, underscoring its potential for generalizability. In summary, we present a novel Brownian bridge coherence metric capable of measuring both local and global text coherence, while circumventing the need for end-to-end model training. This flexibility allows for its application in various downstream tasks.Downloads
Published
2024-03-24
How to Cite
Sheng, Z., Zhang, T., Jiang, C., & Kang, D. (2024). BBScore: A Brownian Bridge Based Metric for Assessing Text Coherence. Proceedings of the AAAI Conference on Artificial Intelligence, 38(13), 14937-14945. https://doi.org/10.1609/aaai.v38i13.29414
Issue
Section
AAAI Technical Track on Machine Learning IV