top of page

Atticus Clause Retrieval Dataset (ACORD)

An expert-annotated clause retrieval dataset

 

The Atticus Clause Retrieval Dataset (ACORD) is a corpus of commercial contract clauses with over 126,000 query-clause pairs in response to 114 queries. Each pair is rated from 1 to 5-stars by experts.

 

ACORD is curated and maintained by The Atticus Project, Inc. to support NLP research and development in legal contract review.

• 126,000+ query-clause pairs

• 114 queries

• Fully annotated by experts

Dataset

Dataset

Dataset

ACORD

README

Download here.

License

CC BY 4.0

Publication

Publication

Code

ACORD: An Expert-Annotated Retrieval Dataset for Legal Contract Drafting

https://arxiv.org/abs/2501.06582​

​​

People

Contributors

Attorney Annotators

Wei Chen

Yuji Sun

Tao Zhang

Benjamin Hendrick

Stacey Phillip

Alexander Kwonji Rosenberg

Michelle Sonu

Chris Herbst

Hannah Kang

Andy Song

Tim Evans

Ji-Hyun Park

Dataset Leads

Yuyang Sun

Sarah Harrell

 

Student Annotators

Adam Shankman

Lyla Sax

Jerry Jiang

Tarunya Dharmarajan

Liam Percer

Penelope Chung

Kevin Chen

AI Researchers & Technical Support:

Steven Wang

Andreas Plesner

Maksim Zubkov

Kexin Fan

Max Emanuel

Evan Wang

Anya Chen

The use of ACORD, Atticus Labels and other information provided by The Atticus Project is subject to our privacy policy and disclaimer.

  • LinkedIn
bottom of page