Pinned Loading
-
mmlu
mmlu PublicForked from hendrycks/test
Measuring Massive Multitask Language Understanding | ICLR 2021
-
SWE-agent
SWE-agent PublicForked from princeton-nlp/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.
Python
-
autogen
autogen PublicForked from microsoft/autogen
A programming framework for agentic AI
Jupyter Notebook
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.