Files

T

Terrance DeJesus 70411664cf [Bug] Normalize Hunting Index Link Generation (#3872 )

* normalizing hunting link generation

* replacing header

* adjusting quotes in f-strings

* added source file to metadata

* removed os dependency

* address bug in source file links

* reverting TOML loading

* change all List type hinting to list

* change all List type hinting to list

* fixed accented characters in queries

* reverted accent character removal; moved macos query and MD to macos folder

2024-07-10 11:01:59 -04:00

1.5 KiB

Raw Blame History

AWS Bedrock LLM Sensitive Content Refusals

Metadata

Author: Elastic
Description: This analytic flags multiple instances of LLM refusals to respond to sensitive prompts, helping to maintain ethical guidelines and compliance standards.
UUID: 11e33a8f-805b-4394-bee0-08ae8d78b025
Integration: aws_bedrock.invocation
Language: [ES|QL]
Source File: AWS Bedrock LLM Sensitive Content Refusals

Query

from logs-aws_bedrock.invocation-*
 | WHERE @timestamp > NOW() - 1 DAY
   AND (
     gen_ai.completion LIKE "*I cannot provide any information about*"
     AND gen_ai.completion LIKE "*end_turn*"
   )
 | STATS user_request_count = count() BY gen_ai.user.id
 | WHERE user_request_count >= 3

Notes

Examine flagged interactions for patterns or anomalies in user requests that may indicate malicious intent or probing of model boundaries.
Regularly review and update the phrases that trigger refusals to adapt to new ethical guidelines and compliance requirements.
Ensure that data logs contain enough detail to provide context around the refusal, which will aid in subsequent investigations by security teams.

1.5 KiB

Raw Blame History

AWS Bedrock LLM Sensitive Content Refusals

Metadata

Query

Notes

MITRE ATT&CK Techniques

References

License

1.5 KiB Raw Blame History

AWS Bedrock LLM Sensitive Content Refusals

Metadata

Query

Notes

MITRE ATT&CK Techniques

References

License

1.5 KiB

Raw Blame History