Publications
Conference Proceedings
01
Do automatic factuality metrics measure factuality? a critical evaluation
Advances in Neural Information Processing Systems (NeurIPS), 2025
02
Analyzing LLM behavior in dialogue summarization: Unveiling circumstantial hallucination trends
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024
03
Evaluating the factuality of zero-shot summarizers across varied domains
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2024
04
USB: A unified summarization benchmark across tasks and domains
Findings of the Association for Computational Linguistics: EMNLP, 2023
05
Generating more faithful and consistent soap notes using attribute-specific parameters
Machine Learning for Healthcare Conference (MLHC), 2023
06
Automatically summarizing evidence from clinical trials: A prototype highlighting current challenges
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations (EACL), 2023
Preprints & Under Review
01
More benign errors are better than a few severe ones: Evaluating hallucination severity
Under review, 2025
02
GenAudit: Fixing factual errors in language model outputs with evidence
arXiv preprint arXiv:2402.12566, 2024