watsonx.gov
Overview
warning
Prerequisite: watsonx.gov Software Requirement
This portion of the Create tab will walk you through how to create and integrate the watsonx.gov platform with the Orchestrate solutions. That way, you can monitor, track, and update your models in real-time and before deployment. To check out how watsonx.gov adds to the Business Value of this particular use case, visit the Business Use Case tab.
Steps to Track Models in the watsonx.gov Platform
- Save your Prompt Lab as a Prompt Template.
- Within the project underneath Assets, click on the vertical 3 dots --> "Go to AI Factsheet" button.
- Once your in your AI Factsheet, make sure to click "Track in AI use case" which will allow you to track the model's lifecycle from development to deployment.
- If you have not created an AI Use Case yet, please do so in order to track this model in your AI Use Case.
- Now, back in your project underneath Assets, please click on the vertical 3 dots again --> "Promote to space" button.
- If you do not have a deployment space created yet, please create one on the main watsonx.ai homepage in order to hold all the deployed models that you create.
- OPTIONAL: It might be beneficial to create a Pre-Production and a Post-Production deployment for the same model. This is beneficial because at different stages of deployments, there are different metrics unlocked in order to showcase the attributes of the model.
- If you do not have a deployment space created yet, please create one on the main watsonx.ai homepage in order to hold all the deployed models that you create.
- Now in the Deployment Space underneath the Assets tab, click again on the vertical 3 dots --> "Deploy" button.
- Once you have deployed, the page will bring you to all of your Deployments, click into the model that you would like to Evaluate. Next, click on the header "Evaluations" then click the "Evaluate" button. At this point, it is now time to evaluate your models under the Evaluate tab, but a pre-requisite is to have some sort of feedback dataset with pre-generated inputs and outputs of what you want your model to spit out in an ideal situation. That way, the model can then evaluate its results against this dataset.
- If you do not have this dataset, an easy way to create it is to utilize LLMs to create synthetic data.
- Once you have your feedback dataset, once you have clicked the "Evaluate" button, under the header "Select text data," upload your feedback dataset.
- The model will likely take a few minutes to evaluate, but once it has, in the Actions tab, feel free to adjust parameters depending on what you feel your model thresholds should be -- each use case requires a different set of thresholds.
- Also, note that if you set your model to Production, then you will be able to perform a Drift evaluation. This requires another dataset called the payload dataset, which is typically shorter than the feedback one but similarly has synthetic data in it. Drift will track whether or not your model degrades over time.
- Once deploying and evaluation the models on the watsonx.ai platform, you will be able to view your models in OpenScale where additional metrics regarding quality and model health will be shared. Within OpenScale, you can also perform various evaluations on your models and readjust thresholds.
Assistant Integration
Create Custom Extension
- Within the watsonx Orchestrate instance, navigate to the sidebar and go to the header "AI Assistant Builder"
- Once you've entered the AI Assistant Builder, navigate to the sidebar and go to the header "Integrations"
- Under the "Extensions" tab, click on "Build custom extension" so that you can create a custom extension for watsonx.gov in your assistant.
- Name the extension something descriptive. When it asks for the OpenAPI Spec, you can either create your own OpenAPI Spec, or utilize the one that we provide, located here.
- Once you've created the extension, you need to add it to the assistant by clicking the "Add +" button on the Integrations homepage.
- Click on Next, then click OAuth authentication. To create an API Key, please go to IBM Cloud at this link then create a new API key and paste it into where it asks for an API key.
Assistant Action Configuration
- To then utilize the extension that you've just created, go to the Actions tab on the sidebar of the Assistant. Create a new action for the Assistant where you want to utilize the extension.
- Under the step that you want to utilize the extension, in the "And then" section, click on "Use an extension" and choose the extension that you just created. Click on the operation "Get the predictions" and be sure to click on "Apply" for the wml_deployment_id, version, and query text variables.
- To get the wml_deployment_id, go to your cloud environment, underneath Deployments, click on your Deployment Space, then on the right-hand side where it says "About this deployment," under "Deployment Details," copy the Deployment ID.
- For the query_text variable, this should be filled with the input that you want to send to the model.
- Congratulations, you've successfully integrated a custom extension!
Retrieve Relevant watsonx.gov Metadata
- Within the watsonx Orchestrate instance, navigate to the sidebar and go to the header "AI Assistant Builder"
- Once you've entered the AI Assistant Builder, navigate to the sidebar and go to the header "Assistant"
- Go to the action where you utilize the custom extension (the action that you built in the last section). Navigate to the conversation step where you added "Use an extension" then click on "New Step +" to add a step after this extension.
- In that new step, set a new variable to a new expression. Call the new variable anything you want, but something descriptive. In the expression free text space, type in something like this:
${step_001_result_1}.body.results[0].generated_text
note- Instead of step_001 though, please replace with the specific step number of the previous step where you called the extension. To find this step number, go to the step before where you called the extension, go to the URL and all the way at the right, it should say the step number. Copy that step number and replace the step_001 with it.
- Also, make sure to put the step_#_result_1 in curly brackets.
- Great, now you can use this variable in any of your "Assistant says" sections to depict the model's response in the Assistant response.