An expert-annotated clause retrieval dataset
The Atticus Clause Retrieval Dataset (ACORD) is a corpus of commercial contract clauses with over 126,000 query-clause pairs in response to 114 queries. Each pair is rated from 1 to 5-stars by experts.
ACORD is curated and maintained by The Atticus Project, Inc. to support NLP research and development in legal contract review.
• 126,000+ query-clause pairs
• 114 queries
• Fully annotated by experts
Dataset
Publication
ACORD: An Expert-Annotated Retrieval Dataset for Legal Contract Drafting
https://arxiv.org/abs/2501.06582​
​​
Contributors
Attorney Annotators
Wei Chen
Yuji Sun
Tao Zhang
Benjamin Hendrick
Stacey Phillip
Alexander Kwonji Rosenberg
Michelle Sonu
Chris Herbst
Hannah Kang
Andy Song
Tim Evans
Ji-Hyun Park
Dataset Leads
Yuyang Sun
Sarah Harrell
Student Annotators
Adam Shankman
Lyla Sax
Jerry Jiang
Tarunya Dharmarajan
Liam Percer
Penelope Chung
Kevin Chen
AI Researchers & Technical Support:
Steven Wang
Andreas Plesner
Maksim Zubkov
Kexin Fan
Max Emanuel
Evan Wang
Anya Chen
The use of ACORD, Atticus Labels and other information provided by The Atticus Project is subject to our privacy policy and disclaimer.