Skip to content

This repository allows you to quickly integrate Large Language Models (LLMs) with your Enterprise Data using Amazon Kendra and Amazon Bedrock/Sagemaker Jumpstart

Notifications You must be signed in to change notification settings

sachink2010/EnterpriseData-with-LLM

Repository files navigation

EnterpriseData-with-LLM

Generative AI is creating lots of exciting use-cases across industries. To create true business value from generative AI, requires integration of Large Language Models (LLM) with enterprise knowledge base.  LLMs are not trained on proprietary enterprise specific knowledge (but are trained on publicly available internet data), they might hallucinate and provide incorrect response to enterprise specific questions.

In this repository, I will present a way to quickly (within 1–2 hours) and securely integrate your enterprise data (Confluence pages, Salesforce Data, CRM data, Relational databases, manuals etc.) with Large Language Models (LLM). This is a full end-to-end solution - no model training, fine-tuning, or extensive deployment needed. You also do not need any specific AI/ML experience or extensive developer knowledge to deploy this solution. The answers provided will be grounded in your organisations specific knowledge, avoiding factuality issues such as hallucinations and out-of-context responses.

This solution will enable enterprises to create a lot of business use-cases like:

  • Improving customer experience: intelligent chat-bots providing answers based on enterprise data- say order status, account balance etc.
  • Increasing internal employee productivity: by generating enterprise specific proposals/ marketing material/manuals/job descriptions etc.
  • Internal Search engine: searching code repositories, internal documents, etc.

AnimatedImage

If you want to read more about architecture choices made here, please read Medium Blog https://medium.com/@Sachin.Kulkarni.NL/generative-ai-with-enterprise-data-3c81a8bffaf2.

Architecture of solution:

image

Here are steps to implement this solution:

1. Set up your AWS Sagemaker Studio environment and Git Clone

  • Login to your AWS Account, select any region (for e.g. Ireland (eu-west-1)) as the region and navigate to Amazon SageMaker Management Console. Click on Studio link in the left and then click on the Open Studio link. image

  • It will launch Amazon SageMaker Studio in a new browser window or tab. In the studio, click on File in the top menu. Next Open Terminal In Terminal tab. You can type in: git clone https://github.com/sachink2010/EnterpriseData-with-LLM

2. Set up your Kendra Index Using AWS Console:

  • Upload files in Bank Financial Statements folder to your S3 folder
  • Create a Kendra Index
  • Add data source as S3 bucket, set up sync as periodic, based on your needs
  • Follow steps as shown in src/CreateKendraIndex Folder

3. Run your Streamlit app in SageMaker Studio terminal and start using the app

  • Set the kendra_index_id variable in the Kendra-RAG-StreamlitApp.py file to match the index you created in step 2.
  • In SageMaker Terminal window type streamlit run streamlit run ./EnterpriseData-with-LLM/src/streamlit/Kendra-RAG-StreamlitApp.py --server.port 6006
  • You will see that Streamlit App is running message in your Terminal session
image

About

This repository allows you to quickly integrate Large Language Models (LLMs) with your Enterprise Data using Amazon Kendra and Amazon Bedrock/Sagemaker Jumpstart

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages