Name		Name	Last commit message	Last commit date
parent directory ..
ElasticSearch		ElasticSearch
Screenshots		Screenshots
Solr		Solr
Watson Discovery		Watson Discovery
ProcessElastic.py		ProcessElastic.py
readme.md		readme.md
resultRetriever		resultRetriever

readme.md

Information Retriever for Retrieval Augmented Generation

This repository contains Python scripts demonstrating the use of a Neural Retriever in a Retrieval Augmented Generation (RAG) pipeline. The scripts demonstrate three different implementations of a Neural Retriever using Apache Solr, Elasticsearch, and Watson Discovery as document stores.

Directory Contents

Elasticsearch: Demonstrates the use of Elasticsearch as a document store.
- es_retriever.ipynb
Solr: Demonstrates the use of Apache Solr as a document store.
- solr_retriever.ipynb
- solr_retriever
Watson Discovery: Demonstrates the use of Watson Discovery as a document store.
- WD_PDF_Retriever
- WD_retriever.py
ProcessElastic.py: Re-usable Script to retrieve documents from Elasticsearch instance.

Getting Started

Clone this repository.
Install the required dependencies (see the Dependencies section below).
Modify the config.yaml to update the retriever pointing to your service
Run the ProcessElastic.py to see the neural retriever in action.

Usage

Run the ProcessElastic.py after updating config.yaml to see the neural retriever in action.

Example scripts and notebook

Each script defines a function for the information retriever (SolrRetriever, ESRetriever, or WDRetriever) takes a query and returns the top matching documents from the respective document store.

Here's a basic example of how you might use the SolrRetriever:

retriever = SolrRetriever(solr_url='http://localhost:8983/solr', collection_name='my_collection')
results = retriever.retrieve('What is DataOps?')
print(results)

Dependencies

These scripts require Python 3.6 or later. They also require the following Python libraries:

pysolr (for solr_retriever.py)
elasticsearch (for es_retriever.ipynb)
requests (for wd_retriever.py)

You can install these libraries using pip:

pip install pysolr elasticsearch requests

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2. Neural Retriever

2. Neural Retriever

readme.md

Information Retriever for Retrieval Augmented Generation

Directory Contents

Getting Started

Usage

Example scripts and notebook

Dependencies

Files

2. Neural Retriever

Directory actions

More options

Directory actions

More options

Latest commit

History

2. Neural Retriever

Folders and files

parent directory

readme.md

Information Retriever for Retrieval Augmented Generation

Directory Contents

Getting Started

Usage

Example scripts and notebook

Dependencies