Skip to main content

Log 7 ๐Ÿ›ซ

ยท 2 min read

Objectivesโ€‹

  • Deploy watsonx.ai on self-managed AWS infrastructure.

Accomplishmentsโ€‹

AWS

  • Shift from CP Deployer to OpenShift UPI deployment.
  • Artifactory proxy details procured.
  • Discussion of on-site logistics
  • RHEL 8 AMI changed for BootNode.

RAG

  • Creation of cronjob to capture logs from Python app.
  • Enabled metadata insertion into chunks in vector store -> (hopefully) increases retrieval accuracy
  • Return context to user (shows sources used to generate responses)
  • Added mixtral model support
  • Enable functionality for user to give custom rag parameters
  • Migrated vector DB from FAISS to chromaDB to enable the metadata functionality
  • Script written to easily test rag implementation and save results in csv
  • Implemented cache logic to make sure it considers combination of parameters as well before chosing to send a cached response

In Progressโ€‹

  • End-to-end deployment of OCP, CP4D, and watsonx.ai (with GPU node)
  • Tagging cp-deployer.sh generated resources.
  • Updating solution docs with better asset linking.

Next Stepsโ€‹

  • Setup bootnode with necessary downloads and resources.
  • Creation of IAM Role request creation Cloudformation templates.
  • Kick off on-site over the shoulder working sessions.
  • Collating information and resources to be created via OpenShift UPI deployment.
  • Setup Artifactory proxy.
  • Kick off Cloud Pak for Deployment entitlement key.
  • Deploy latest RAG version on AWS
  • Build out actions & flow in Watsonx Assistant after properly defining personas & objectives.

Tracking (Issues)โ€‹

  • Require sign-off on final CloudFormation template.
  • Red Hat CoreOS AMI still pending approval.