Amazon Bedrock Guardrails provides safeguards that you can configure for your generative AI applications based on your use cases and responsible AI policies. You can create multiple guardrails tailored to different use cases and apply them across multiple foundation models (FMs), providing a consistent user experience and standardizing safety and privacy controls across generative AI applications. You can use guardrails for both model prompts and responses with natural language.
You can use Amazon Bedrock Guardrails in multiple ways to help safeguard your generative AI applications. For example:
- A chatbot application can use guardrails to help filter harmful user inputs and toxic model responses. 
- A banking application can use guardrails to help block user queries or model responses associated with seeking or providing investment advice. 
- A call center application to summarize conversation transcripts between users and agents can use guardrails to redact users’ personally identifiable information (PII) to protect user privacy. 
Amazon Bedrock Guardrails provides the following safeguards (also known as policies) to detect and filter harmful content:
- Content filters – Detect and filter harmful text or image content in input prompts or model responses. Filtering is done based on detection of certain predefined harmful content categories: Hate, Insults, Sexual, Violence, Misconduct and Prompt Attack. You also can adjust the filter strength for each of these categories. 
- Denied topics – Define a set of topics that are undesirable in the context of your application. The filter will help block them if detected in user queries or model responses. 
- Word filters – Configure filters to help block undesirable words, phrases, and profanity (exact match). Such words can include offensive terms, competitor names, etc. 
- Sensitive information filters – Configure filters to help block or mask sensitive information, such as personally identifiable information (PII), or custom regex in user inputs and model responses. Blocking or masking is done based on probabilistic detection of sensitive information in standard formats in entities such as SSN number, Date of Birth, address, etc. This also allows configuring regular expression based detection of patterns for identifiers. 
- Contextual grounding checks – Help detect and filter hallucinations in model responses based on grounding in a source and relevance to the user query. 
- Automated Reasoning checks – Can help you validate the accuracy of foundation model responses against a set of logical rules. You can use Automated Reasoning checks to detect hallucinations, suggest corrections, and highlight unstated assumptions in model responses. 
x
 
 
 
 
No comments:
Post a Comment