Financial Insights Project

Overview

This project provides a system for handling financial insights using Large Language Models (LLMs). The LLMs are used in this project to process financial queries and generate insightful responses to user's text queries.

The project integrates OpenSearch, an open-source search and analytics engine, for storing and retrieving financial data. The OpenSearch index acts as a database that efficiently handles and searches through the financial data for different companies. It stores financial documents and allows for quick retrieval of relevant data based on user queries.

The project implements a web front interface enabling users to interact with the system by posing financial questions. The LLM processes these questions, retrieves the necessary data from the created index, and generates detailed and accurate responses.

Solution's principle

Database of Knowledge
Execution Flow for Request Answering
Solution's Architecture

Setup Instructions

Project Setup

To set up the virtual environment and install dependencies, follow these steps:

Clone the repository
```
git clone <project URL>
```
Create a virtual environment
```
python -m venv env
```
Activate the virtual environment
```
source env/bin/activate
```
Install the project and all its dependencies in editable mode (using ./setup.py)
```
pip install -e .
```
Set your personal keys: Set the OpenAI API key and OpenSearch admin password in the files:
```
 ./config/keys/openai_api_key.txt
 ./config/keys/opensearch_admin_password.txt
```

Configuration

Edit the config/config.json file with your specific paths, database credentials, and other configuration details specific to your environment.

Directory Structure

config/: Contains the configuration file config.json to be edited with your specific paths, database credentials, and other configuration details.
data/: Directory for storing financial data, metrics, and templates.
- company_data/: Financial data files (1 for each company) used to to fetch data for the RAG answer.
- metrics/: Contains the financial indicators to be used in evaluating a company.
- templates/: Contains the templates of the answers to be returned by each LLM.
src/: Source code for the project.
- db_scripts/: Scripts for creating and updating the OpenSearch index.
- models/: Contains the LLM model handler.
- web_app/: Contains the web application files.
- utils/: Utility functions used across the project.
test/: Contains unit and non-regression tests and sample test questions.
README.md: This readme file.
.dockerignore: Files to ignore in Docker builds.

Running the Project

Database Creation and Population

Run the OpenSearch Docker Container

python src/db_scripts/run_opensearch_container.py

Verify OpenSearch is running

password=$(cat config/keys/opensearch_admin_password.txt)
curl -X GET -k -u admin:"$password" https://localhost:9200/

Create the OpenSearch indexes

python src/db_scripts/create_index_script.py

Populate the Indexes with Learning Data

python src/db_scripts/update_index_script.py

Web Front Docker Setup

Build the Docker image
```
docker build -t financial_insights .
```

Run the Docker container

docker run -p 8000:8000 financial_insights

Interact with the Web Front

Access the API using a web browser or API client:
- Open your web browser and navigate to:
```
http://localhost:8000/docs
```
  This will open the interactive API documentation provided by FastAPI, where you can test the endpoints.
Sending a query:
- In the interactive API documentation, find the POST /query endpoint.
- Click on the endpoint to expand it.
- Click on the "Try it out" button.
- Enter the following JSON payload in the request body:
```
{
    "query": "What was the total revenue for the company in FY 2023?",
    "company_id": 123
}
```
- Click the "Execute" button to send the request.
View the response:
- The response will be displayed below the request area, showing the financial insights or answers to your query.

Alternatively, you can use an API client like Postman to interact with the web front by sending a POST request to:

http://localhost:8000/query

with the JSON payload mentioned above.

Using these instructions, you can interact with the web front via a web browser or an API client to send queries and receive financial insights.

Documentation Generation

To generate documentation from the code comments, follow these steps:

Create or empty the ./doc directory
```
mkdir -p doc && rm -rf doc/*
```
Generate documentation**
```
pydoc -w src
```

This will generate HTML documentation files from the Python docstrings in the src directory and place them in the ./doc directory.

Running Tests

To run the tests, use pytest:

Run the tests
```
pytest
```

#TODO specify the tests implemented

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Financial Insights Project

Overview

Solution's principle

Setup Instructions

Project Setup

Configuration

Directory Structure

Running the Project

Database Creation and Population

Web Front Docker Setup

Interact with the Web Front

Documentation Generation

Running Tests

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/workflows		.github/workflows
config		config
data		data
src		src
test		test
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
TODO.md		TODO.md
setup.py		setup.py

simbadSid/assignment_rag-20240628T124352Z-001

Folders and files

Latest commit

History

Repository files navigation

Financial Insights Project

Overview

Solution's principle

Setup Instructions

Project Setup

Configuration

Directory Structure

Running the Project

Database Creation and Population

Web Front Docker Setup

Interact with the Web Front

Documentation Generation

Running Tests

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages