Luo, F. (2024). Vision-Language Models for Robot Success Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 38(21), 23750-23752. https://doi.org/10.1609/aaai.v38i21.30552