human-evaluation

Star

Here are 4 public repositories matching this topic...

Contextualist / lone-arena

Star

Self-hosted LLM chatbot arena, with yourself as the only judge

human-evaluation llm

Updated Feb 6, 2024
Python

TianboJi / Dialogue-Eval

Star

Code and data for paper "Achieving Reliable Human Assessment of Open-Domain Dialogue Systems"

natural-language-processing open-domain-dialog human-evaluation

Updated Nov 18, 2022
Python

laihuiyuan / eval-formality-transfer

Star

Multidimensional Evaluation for Text Style Transfer Using ChatGPT. Human Judgement as a Compass to Navigate Automatic Metrics for Formality Transfer (HumEval 2022)

text-style-transfer formality-style-transfer human-evaluation automatic-evaluation

Updated Apr 27, 2023
Python

davidheineman / salsa

Star

Success and Failure Linguistic Simplification Annotation 💃

nlp text-simplification human-evaluation automatic-evaluation thresh

Updated May 11, 2024
Python

Improve this page

Add a description, image, and links to the human-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the human-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

human-evaluation

Here are 4 public repositories matching this topic...

Contextualist / lone-arena

TianboJi / Dialogue-Eval

laihuiyuan / eval-formality-transfer

davidheineman / salsa

Improve this page

Add this topic to your repo