Tim Ferris AI

As a way to examine what's possible with OpenAI's latest embeddings model called text-embedding-ada-002, I spent the weekend building a Tim Ferriss AI to answer questions addressed to him or any of his past guests.

We can use it to get human-like answers based on what was said in any episode.

TLDR;

The site uses a semantic search to find the chunks of text across all episodes that talk about what the question asks. Then it uses a GPT-3 model to generate a coherent answer.

Examples

See a few examples below on how it works:

Run loop

When you pose a question, the following things happen:

question text gets embedded
that embedding gets matched to N closest embeddings across all transcript chunks
the matched chunks get combined into a context string
the context string and the question get combined into a prompt
prompt is sent to another AI model to formulate into a coherent answer
include a sorted-by-similarity list of episode links from all chunks (since all those episodes talk about what the question asked)

Code

The loop above translates to the following code:

// question text gets embedded 
const embedding = await getEmbedding(question);

// embedding gets matched to N closest embeddings across all transcript chunks
const trascriptChunks = await matchTranscriptChunks(question, embedding);

// matched chunks get combined into a context string
const context = combineChunksIntoContext(trascriptChunks);

// context string and the question get combined into a prompt
const prompt = buildPrompt(context, question);

// prompt is sent to another AI model to formulate into a coherent answer
const answer = await getAnswer(prompt);

// include a sorted-by-similarity list of episode links from all chunks
const sortedEpisodes = await getMatchedEpisodesSortedByRelevance(trascriptChunks);

Setup

I crawled (most) of the episode transcripts, chunked them up into smaller segments of text roughly paragraph-size, and then used the embeddings model to embed each chunk into a 1536-dimensional vector.

The frontend is a Next.js app, the data is stored in Supabase, and the embeddings search is using pg-vector.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
.vscode		.vscode
db		db
public		public
src		src
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
README.md		README.md
next.config.js		next.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tim Ferris AI

TLDR;

Examples

Run loop

Code

Setup

About

Releases

Packages

Languages

nem035/tim.nem.ai

Folders and files

Latest commit

History

Repository files navigation

Tim Ferris AI

TLDR;

Examples

Run loop

Code

Setup

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages