Common Datasets
ARCC
¶
Bases: LocalStep
Prepare dataset for RAG augmentation.
Source code in ragfit/processing/local_steps/common_datasets.py
ASQA
¶
Bases: LocalStep
Normalizes ASQA dataset.
It has long answer, to be measured with ROUGE-L and multiple short answers, to be
measured with string-EM. Long answer is saved in the answers
field, while the
short answers (list of lists) are saved in the answer-short
field.
Source code in ragfit/processing/local_steps/common_datasets.py
HotPot
¶
Bases: LocalStep
Normalizes NotPotQA dataset to look like NQ, TQA