Skip to main content

Log 3 ๐Ÿ›ซ

ยท One min read

Objectivesโ€‹

  • Deploy watsonx.ai on self-managed AWS infrastructure.

Accomplishmentsโ€‹

AWS

  • Discovery of AWS DevOps role to be used and augmented with permissions.
  • Adjusted check-permissions.sh script to account for profile to be passed.
  • Creation of Cloudformation templates for roles with permissions needed for install.
    • Added --profile and $PROFILE_NAME
  • Adjusted Cloudformation templates to account for roles instead of a user.

RAG

  • App deployed on Fyre VM
  • Support for granitev2/llama2 70 b chat models added.
  • Watsonx Assistant Configured to interact with app via API for easier testing.

In Progressโ€‹

  • End-to-end deployment of OCP, CP4D, and watsonx.ai (with GPU node)
  • Tagging cp-deployer.sh generated resources.
  • Test out RAG new chunking method.

Next Stepsโ€‹

  • Continue over the shoulder working sessions
  • Compilation of required endpoints
  • Fill out required network values required for OCP deployment.
  • Add Mixtral model to RAG.
  • Deploy latest RAG version on AWS
  • Build out actions & flow in Watsonx Assistant after properly defining personas & objectives.

Tracking (Issues)โ€‹

  • Require sign-off on final CloudFormation template.