automatic-evaluation

Star

Here are 6 public repositories matching this topic...

terryyz / ice-score

Star

[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code

evaluation code-generation code-quality automatic-evaluation gpt-4 large-language-models llm

Updated Jun 16, 2024
Python

hprodrig / MONSERRATE_Corpus

Star

MONSERRATE is a dataset specifically created to evaluate Question Generation systems. It has, on average, 26 questions associated to each source sentence, attempting to be an “exhaustive” reference.

evaluation corpus dataset squad question-generation msmarco automatic-evaluation

Updated Oct 28, 2022
Python

johnny-brav0 / AutomaticEvaluation

Star

Automatic Evaluation of Textual Answers on the famous Kaggle Automated Essay Scoring (AES) dataset.

tensorflow word2vec essayscoring essay-grading automatic-evaluation

Updated Mar 17, 2022
Jupyter Notebook

laihuiyuan / eval-formality-transfer

Star

Multidimensional Evaluation for Text Style Transfer Using ChatGPT. Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer (HumEval 2022)

text-style-transfer formality-style-transfer human-evaluation automatic-evaluation