jplhughes

Follow

John Hughes jplhughes

Follow

Independent Alignment Researcher | Machine Learning Engineer & Lead

23 followers · 0 following

Achievements

Achievements

Highlights

Pro

Pinned Loading

bon-jailbreaking bon-jailbreaking Public

Code release for "Attacking Audio Language Models with BoN Jailbreaking"

Python
ucl-dark/llm_debate ucl-dark/llm_debate Public

Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"

Python 84 11
evals_template evals_template Public

Template for any evals project using LLM apis

Python 5 1
speechmatics/hqa speechmatics/hqa Public

Code to accompany the paper "Hierarchical Quantized Autoencoders"

Jupyter Notebook 37 4
model_written_evals model_written_evals Public

Create lm evals to look for inverse scaling behaviour

Python
dotfiles dotfiles Public

Easily deploy my zsh and tmux configuration on new machines. Includes local and remote aliases to improve workflow.

Shell 1 2