Files
sigma-rules/hunting/llm/docs/aws_bedrock_sensitive_content_refusal_detection.md
T
Terrance DeJesus 70411664cf [Bug] Normalize Hunting Index Link Generation (#3872)
* normalizing hunting link generation

* replacing header

* adjusting quotes in f-strings

* added source file to metadata

* removed os dependency

* address bug in source file links

* reverting TOML loading

* change all List type hinting to list

* change all List type hinting to list

* fixed accented characters in queries

* reverted accent character removal; moved macos query and MD to macos folder
2024-07-10 11:01:59 -04:00

1.5 KiB

AWS Bedrock LLM Sensitive Content Refusals


Metadata

  • Author: Elastic
  • Description: This analytic flags multiple instances of LLM refusals to respond to sensitive prompts, helping to maintain ethical guidelines and compliance standards.
  • UUID: 11e33a8f-805b-4394-bee0-08ae8d78b025
  • Integration: aws_bedrock.invocation
  • Language: [ES|QL]
  • Source File: AWS Bedrock LLM Sensitive Content Refusals

Query

from logs-aws_bedrock.invocation-*
 | WHERE @timestamp > NOW() - 1 DAY
   AND (
     gen_ai.completion LIKE "*I cannot provide any information about*"
     AND gen_ai.completion LIKE "*end_turn*"
   )
 | STATS user_request_count = count() BY gen_ai.user.id
 | WHERE user_request_count >= 3

Notes

  • Examine flagged interactions for patterns or anomalies in user requests that may indicate malicious intent or probing of model boundaries.
  • Regularly review and update the phrases that trigger refusals to adapt to new ethical guidelines and compliance requirements.
  • Ensure that data logs contain enough detail to provide context around the refusal, which will aid in subsequent investigations by security teams.

MITRE ATT&CK Techniques

References

License

  • Elastic License v2