此内容尚未提供您的语言版本,正在以英文显示。

Safety Scan

技能已验证活跃

Scan inputs for prompt injection, unsafe content, and adversarial attacks using AIDefence

目的

Protect your AI workflows from prompt injection, jailbreaks, and other adversarial attacks by scanning all untrusted input before processing.

功能

Detect prompt injection and jailbreaks
Scan for unsafe content and policy violations
Classify threats with confidence scores
Train defenses to improve detection rates
Provide multi-layer scanning for comprehensive safety

使用场景

Scan user submissions before processing
Validate API payloads for adversarial content
Protect against instruction override attacks
Ensure compliance with safety policies

非目标

Performing actions based on detected threats
Replacing the need for LLM-level safety
Scanning code for vulnerabilities

Compliance

info:GDPRThe skill analyzes input text, which may contain personal data. While it doesn't submit data to a third party, personal data might be submitted to the LLM for analysis, with no explicit mention of sanitization beyond detection.

Practical Utility

info:Usage examplesWhile the SKILL.md outlines steps, it does not provide explicit, ready-to-use end-to-end examples of invocation and observable outcome.
info:Edge casesThe SKILL.md lists threat categories but does not explicitly document failure modes, symptoms, or recovery steps for edge cases.

安装

请先添加 Marketplace

/plugin marketplace add ruvnet/ruflo

/plugin install ruflo-aidefence@ruflo

质量评分

已验证

95 /100

1 day ago 分析

信任信号

最近提交1 day ago

GitHub 所有者 ruvnet

星标50.2k

下载量 68.3k

许可证MIT

网站cognitum.one

状态

查看源代码

类似扩展

Prompt Guard

100

Meta's 86M prompt injection and jailbreak detector. Filters malicious prompts and third-party data for LLM apps. 99%+ TPR, <1% FPR. Fast (<2ms GPU). Multilingual (8 languages). Deploy with HuggingFace or batch processing for RAG security.

技能

Orchestra-Research

Secrets Management

100

Implement secure secrets management for CI/CD pipelines using Vault, AWS Secrets Manager, or native platform solutions. Use when handling sensitive credentials, rotating secrets, or securing CI/CD environments.

技能

wshobson

Semgrep Rule Creator

100

Creates custom Semgrep rules for detecting security vulnerabilities, bug patterns, and code patterns. Use when writing Semgrep rules or building custom static analysis detections.

技能

trailofbits

Safe Mode

100

Prevent destructive operations using Claude Code hooks. Three modes — cautious (warn on dangerous commands), lockdown (restrict edits to one directory), and clear (remove restrictions). Uses PreToolUse matchers for Bash, Edit, and Write.

技能

rohitg00

Soul Guardian

100

Drift detection + baseline integrity guard for agent workspace files with automatic alerting support

技能

prompt-security

Audit Dependency Versions

100

Audit project dependencies for version staleness, security vulnerabilities, and compatibility issues. Covers lock file analysis, upgrade path planning, and breaking change assessment. Use before a release to ensure dependencies are current and secure, during periodic maintenance reviews, after receiving a security advisory, when upgrading to a new language version, before submitting to CRAN or npm, or when inheriting a project to assess its dependency health.

技能

pjt222