Harrison/from methods #1912

hwchase17 · 2023-03-23T03:30:36Z

No description provided.

# Why - Since making vectors of texts can be done outside of langchain Faiss, this PR is to add functionality to pass text and its vector pair to initialize and add embedding to Faiss. # What - Add `from_embedding` method in Faiss to initialize Faiss index by passing the embeddings paired with original text made outside of the langchain. - Add `add_embedding` method to add embedding paired with original text to append the embedding made outside of the langchain.

- **Description:** Provide a way to use different text for embedding. - For example, if you are ingesting stack-overflow Q&As for RAG, you would want to embed the questions and return the answer(s) for the hits. With this change, the consumer of langchain can implement that easily. - I noticed the similar function is added on faiss.py with #1912 which was for performance reason, but I see the same function can be used to achieve what I thought. So instead of changing Document class to have embedding_content, I mimicked the implementation of faiss.py. - The test should provide some guidance on how to use it. It would be more intuitive if I just pass texts and embedding_texts as separate arguments, but I chose to use `zip`-ed object for the consistency with faiss.py implementation. - I plan to make similar pull request for OpenSearch. - **Issue:** N/A - **Dependencies:** None other than the existing ones. Co-authored-by: Bagatur <baskaryan@gmail.com>

- **Description:** Provide a way to use different text for embedding. - For example, if you are ingesting stack-overflow Q&As for RAG, you would want to embed the questions and return the answer(s) for the hits. With this change, the consumer of langchain can implement that easily. - I noticed the similar function is added on faiss.py with langchain-ai#1912 which was for performance reason, but I see the same function can be used to achieve what I thought. So instead of changing Document class to have embedding_content, I mimicked the implementation of faiss.py. - The test should provide some guidance on how to use it. It would be more intuitive if I just pass texts and embedding_texts as separate arguments, but I chose to use `zip`-ed object for the consistency with faiss.py implementation. - I plan to make similar pull request for OpenSearch. - **Issue:** N/A - **Dependencies:** None other than the existing ones. Co-authored-by: Bagatur <baskaryan@gmail.com>

shibuiwilliam and others added 2 commits March 22, 2023 20:28

cr

8adabcd

tomarharsh mentioned this pull request Mar 23, 2023

Add utility method from_embedding_vecotrs in FAISS wrapper #1627

Closed

hwchase17 merged commit eb80d6e into master Mar 23, 2023

hwchase17 deleted the harrison/from-methods branch March 23, 2023 04:10

kennethchoe mentioned this pull request Sep 25, 2023

support add_embeddings for elasticsearch #11002

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harrison/from methods #1912

Harrison/from methods #1912

hwchase17 commented Mar 23, 2023

Harrison/from methods #1912

Harrison/from methods #1912

Conversation

hwchase17 commented Mar 23, 2023