This repository contains the data used to train the models of our article "Quantifying French Document Complexity".
The data present in this repository has been created by us.
Our dataset is hosted in this repository in a zip
format. You can manually download it by clicking here or you can use wget
as follow:
wget https://github.com/GRAAL-Research/FCCLC/raw/main/FCCLC.zip
If you use the data provided in this repository, please cite us using the following:
@article{Primpied2022Quantifying,
author = {Primpied, Vincent and Beauchemin, David and Khoury, Richard},
journal = {Proceedings of the Canadian Conference on Artificial Intelligence},
year = {2022},
month = {may 27},
note = {https://caiac.pubpub.org/pub/iaeeogod},
publisher = {Canadian Artificial Intelligence Association (CAIAC)},
title = {Quantifying {French} {Document} {Complexity} },
}
This dataset is under MIT License.
The following table is necessary for this dataset to be indexed by search engines such as Google Dataset Search.
property | value | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
name | French Canadian complexity level corpus |
||||||||||||||
alternateName | FCCLC |
||||||||||||||
url | https://github.com/GRAAL-Research/FCCLC |
||||||||||||||
description | FCCLC is an annotated dataset of different French documents with their associated complexity level on a grading scale.
|
||||||||||||||
creator |
|
||||||||||||||
provider |
|
||||||||||||||
license |
|
||||||||||||||
citation | https://caiac.pubpub.org/pub/iaeeogod/release/1 |