can-ai-code: Self-evaluating interview for AI coders #491

irthomasthomas · 2024-01-31T18:50:48Z

the-crypt-keeper/can-ai-code: Self-evaluating interview for AI coders

Title: the-crypt-keeper/can-ai-code: Self-evaluating interview for AI coders

A self-evaluating interview for AI coding models, written by humans and taken by AI.

Key Ideas

Interview questions written by humans, test taken by AI
Inference scripts for all common API providers and CUDA-enabled quantization runtimes
Sandbox enviroment (Docker-based) for untrusted Python and NodeJS code validation
Evaluate effects of prompting techniques and sampling parameters on LLM coding performance
Evaluate LLM coding performance degradation due to quantization

News

2023-01-23: Evaluate mlabonne/Beyonder-4x7B-v2 (AWQ only, FP16 was mega slow).
2

Suggested labels

{ "label-name": "interview-evaluation", "description": "Self-evaluating interview for AI coding models", "repo": "the-crypt-keeper/can-ai-code", "confidence": 96.49 }

The text was updated successfully, but these errors were encountered:

irthomasthomas changed the title ~~the-crypt-keeper/can-ai-code: Self-evaluating interview for AI coders~~ can-ai-code: Self-evaluating interview for AI coders Jan 31, 2024

irthomasthomas mentioned this issue Feb 28, 2024

LLM API Host Leaderboard | Artificial Analysis #651

Open

1 task

ShellLM mentioned this issue Aug 10, 2024

Coding System Prompt : r/PromptEngineering #875

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

can-ai-code: Self-evaluating interview for AI coders #491

can-ai-code: Self-evaluating interview for AI coders #491

irthomasthomas commented Jan 31, 2024

can-ai-code: Self-evaluating interview for AI coders #491

can-ai-code: Self-evaluating interview for AI coders #491

Comments

irthomasthomas commented Jan 31, 2024

Title: the-crypt-keeper/can-ai-code: Self-evaluating interview for AI coders

Key Ideas

News

Suggested labels

{ "label-name": "interview-evaluation", "description": "Self-evaluating interview for AI coding models", "repo": "the-crypt-keeper/can-ai-code", "confidence": 96.49 }