🛡️ Safeguards

Safeguards allow you to block AI outputs based on specific keywords in responses. They are perfect for preventing harmful, unethical, and dangerous responses from Await Cortex. They also help protect against malicious prompt injections and jailbreaking.

Feature: Safeguards

Instructions

  1. Go to the agent's tab and click on an agent you have created

    image-20240424-185813.png
  2. Once inside your agent panel you will have 4 tabs to choose from - Configuration, Chat Interface, Analytics, Cached Answers, and Safeguards. Click Safeguards.

image-20240425-021349.png
  1. Now you’ve arrived in the safeguards screen. This is where you can configure them.

     

     

     

  2. To create a new safeguard click the “Add New” button in the top right.

     

  3. Now you are in the create safeguard screen - fill in the fields to create your first safeguard:

Fallback and Warning Example

A Fallback makes the AI respond with a canned response based on your keyword flags

A Warning allows the AI to generate a response put provides a disclaimer for it’s answer

Safeguard Example: Medical Disclaimer

What it looks like in the agent:

Fallback

Disclaimer

Replace