Go to file

NSFW-Only Filtering Tests / NSFW-Only Filtering Feature Tests (push) Has been cancelled

Details

Add NSFW-only filtering mode for content moderation

- Add nsfw_only_filtering field to GuildSettings model
- Create database migration for new field (20260124_add_nsfw_only_filtering)
- Update AI moderation logic to respect NSFW-only mode
- Add Discord command !ai nsfwonly <true/false> for toggling mode
- Implement filtering logic in image analysis for both attachments and embeds
- Add comprehensive test suite for new functionality
- Update documentation with usage examples and feature description
- Create dedicated CI workflow for testing NSFW-only filtering feature

When enabled, only sexual/nude content is filtered while allowing:
- Violence and gore
- Harassment and bullying
- Hate speech
- Self-harm content
- Other content categories

This mode is useful for gaming communities and mature discussion
servers that have specific content policies allowing violence
but prohibiting sexual material.

2026-01-24 23:51:10 +01:00

.gitea/workflows

Add NSFW-only filtering mode for content moderation

2026-01-24 23:51:10 +01:00

dashboard

Update dashboard and Docker compose

2026-01-24 19:14:00 +01:00

migrations

Add NSFW-only filtering mode for content moderation

2026-01-24 23:51:10 +01:00

monitoring

quick commit

2026-01-17 20:24:43 +01:00

scripts

Update dashboard and Docker compose

2026-01-24 19:14:00 +01:00

src/guardden

Add NSFW-only filtering mode for content moderation

2026-01-24 23:51:10 +01:00

tests

Add NSFW-only filtering mode for content moderation

2026-01-24 23:51:10 +01:00

.dockerignore

Fix: make Grafana optional and allow README in build

2026-01-17 20:24:16 +01:00

.env.example

quick commit

2026-01-17 20:24:43 +01:00

.gitignore

Implement GuardDen Discord moderation bot

2026-01-16 19:27:48 +01:00

AGENTS.md

quick commit

2026-01-17 20:24:43 +01:00

alembic.ini

Implement GuardDen Discord moderation bot

2026-01-16 19:27:48 +01:00

CLAUDE.md

Implement GuardDen Discord moderation bot

2026-01-16 19:27:48 +01:00

DEV_GUIDE.md

Update dashboard and Docker compose

2026-01-24 19:14:00 +01:00

docker-compose.dev.yml

Update dashboard and Docker compose

2026-01-24 19:14:00 +01:00

docker-compose.yml

Update dashboard and Docker compose

2026-01-24 19:14:00 +01:00

Dockerfile

quick commit

2026-01-17 20:24:43 +01:00

pyproject.toml

update

2026-01-17 21:57:04 +01:00

pytest.ini

quick commit

2026-01-17 20:24:43 +01:00

README.md

Add NSFW-only filtering mode for content moderation

2026-01-24 23:51:10 +01:00

README.md

GuardDen

GuardDen is a comprehensive Discord moderation bot designed to protect your community while maintaining a warm, welcoming environment. Built with privacy and self-hosting in mind, GuardDen combines AI-powered content filtering with traditional moderation tools to create a safe space for your members.

Features

Core Moderation

Warn, Kick, Ban, Timeout - Standard moderation commands with logging
Strike System - Configurable point-based system with automatic escalation
Moderation History - Track all actions taken against users
Bulk Message Deletion - Purge up to 100 messages at once

Automod

Banned Words Filter - Block words/phrases with regex support
Scam Detection - Automatic detection of phishing/scam links
Anti-Spam - Rate limiting, duplicate detection, mass mention protection
Link Filtering - Block Discord invites and suspicious URLs

AI Moderation

Text Analysis - AI-powered content moderation using Claude or GPT
NSFW Image Detection - Automatic flagging of inappropriate images
NSFW-Only Filtering - Option to only filter sexual content, allowing violence/harassment
Phishing Analysis - AI-enhanced detection of scam URLs
Configurable Sensitivity - Adjust strictness per server (0-100)

Verification System

Multiple Challenge Types - Button, captcha, math problems, emoji selection
Automatic New Member Verification - Challenge users on join
Configurable Verified Role - Auto-assign role on successful verification
Rate Limited - Prevents verification spam

Logging

Member joins/leaves
Message edits and deletions
Voice channel activity
Ban/unban events
All moderation actions

Web Dashboard

Servers overview with plan status and quick config links
Users view with cross-guild search and strike totals
Chats view for moderated message logs with filters
Moderation logs, analytics, and configuration updates
Config export for backups

Quick Start

Prerequisites

Python 3.11+
PostgreSQL 15+
Discord Bot Token (see setup below)
(Optional) Anthropic or OpenAI API key for AI features

Discord Bot Setup

Go to the Discord Developer Portal
Click New Application and give it a name (e.g., "GuardDen")
Go to the Bot tab and click Add Bot
Configure Bot Settings:
- Disable Public Bot if you only want yourself to add it
- Copy the Token (click "Reset Token") - this is your GUARDDEN_DISCORD_TOKEN
Enable Privileged Gateway Intents (all three required):
- Presence Intent - for user status tracking
- Server Members Intent - for member join/leave events, verification
- Message Content Intent - for reading messages (automod, AI moderation)
Generate Invite URL - Go to OAuth2 > URL Generator:

Scopes:
- bot
- applications.commands
Bot Permissions:
- Manage Roles
- Kick Members
- Ban Members
- Moderate Members (timeout)
- Manage Channels
- View Channels
- Send Messages
- Manage Messages
- Embed Links
- Attach Files
- Read Message History
- Add Reactions
Or use permission integer: 1239943348294
Use the generated URL to invite the bot to your server

Docker Deployment (Recommended)

Clone the repository:

git clone https://git.hiddenden.cafe/Hiddenden/GuardDen.git
cd guardden

Create your environment file:

cp .env.example .env
# Edit .env and add your Discord token

Start with Docker Compose:
```
docker compose up -d
```
Open the dashboard (if configured): http://localhost:8080

Local Development

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -e ".[dev,ai]"
```

Set up environment variables:

cp .env.example .env
# Edit .env with your configuration

Start PostgreSQL (or use Docker):
```
docker compose up db -d
```
Run the bot:
```
python -m guardden
```

Configuration

Environment Variables

Variable	Description	Default
`GUARDDEN_DISCORD_TOKEN`	Your Discord bot token	Required
`GUARDDEN_DISCORD_PREFIX`	Default command prefix	`!`
`GUARDDEN_ALLOWED_GUILDS`	Comma-separated guild allowlist	(empty = all)
`GUARDDEN_OWNER_IDS`	Comma-separated owner user IDs	(empty = admins)
`GUARDDEN_DATABASE_URL`	PostgreSQL connection URL	`postgresql://guardden:guardden@localhost:5432/guardden`
`GUARDDEN_LOG_LEVEL`	Logging level	`INFO`
`GUARDDEN_AI_PROVIDER`	AI provider (anthropic/openai/none)	`none`
`GUARDDEN_ANTHROPIC_API_KEY`	Anthropic API key (if using Claude)	-
`GUARDDEN_OPENAI_API_KEY`	OpenAI API key (if using GPT)	-
`GUARDDEN_DASHBOARD_BASE_URL`	Dashboard base URL for OAuth callbacks	`http://localhost:8080`
`GUARDDEN_DASHBOARD_SECRET_KEY`	Session secret for dashboard	Required
`GUARDDEN_DASHBOARD_ENTRA_TENANT_ID`	Entra tenant ID	Required
`GUARDDEN_DASHBOARD_ENTRA_CLIENT_ID`	Entra client ID	Required
`GUARDDEN_DASHBOARD_ENTRA_CLIENT_SECRET`	Entra client secret	Required
`GUARDDEN_DASHBOARD_DISCORD_CLIENT_ID`	Discord OAuth client ID	Required
`GUARDDEN_DASHBOARD_DISCORD_CLIENT_SECRET`	Discord OAuth client secret	Required
`GUARDDEN_DASHBOARD_OWNER_DISCORD_ID`	Discord user ID allowed	Required
`GUARDDEN_DASHBOARD_OWNER_ENTRA_OBJECT_ID`	Entra object ID allowed	Required
`GUARDDEN_DASHBOARD_CORS_ORIGINS`	Dashboard CORS origins	(empty = none)
`GUARDDEN_WORDLIST_ENABLED`	Enable managed wordlist sync	`true`
`GUARDDEN_WORDLIST_UPDATE_HOURS`	Managed wordlist sync interval	`168`
`GUARDDEN_WORDLIST_SOURCES`	JSON array of wordlist sources	(empty = defaults)

Per-Guild Settings

Each server can configure:

Command prefix
Log channels (general and moderation)
Welcome channel
Mute role and verified role
Automod toggles (spam, links, banned words)
Automod thresholds and scam allowlist
Strike action thresholds
AI moderation settings (enabled, sensitivity, confidence threshold, log-only, NSFW detection, NSFW-only mode)
Verification settings (type, enabled)

Commands

Moderation

Command	Permission	Description
`!warn <user> [reason]`	Kick Members	Warn a user
`!strike <user> [points] [reason]`	Kick Members	Add strikes to a user
`!strikes <user>`	Kick Members	View user's strikes
`!timeout <user> <duration> [reason]`	Moderate Members	Timeout a user (e.g., 1h, 30m, 7d)
`!untimeout <user>`	Moderate Members	Remove timeout
`!kick <user> [reason]`	Kick Members	Kick a user
`!ban <user> [reason]`	Ban Members	Ban a user
`!unban <user_id> [reason]`	Ban Members	Unban a user by ID
`!purge <amount>`	Manage Messages	Delete multiple messages (max 100)
`!modlogs <user>`	Kick Members	View moderation history

Configuration (Admin only)

Command	Description
`!config`	View current configuration
`!config prefix <prefix>`	Set command prefix
`!config logchannel [#channel]`	Set general log channel
`!config modlogchannel [#channel]`	Set moderation log channel
`!config welcomechannel [#channel]`	Set welcome channel
`!config muterole [@role]`	Set mute role
`!config automod <true/false>`	Toggle automod
`!config antispam <true/false>`	Toggle anti-spam
`!config linkfilter <true/false>`	Toggle link filtering

Banned Words

Command	Description
`!bannedwords`	List all banned words
`!bannedwords add <word> [action] [is_regex]`	Add a banned word
`!bannedwords remove <id>`	Remove a banned word by ID

Managed wordlists are synced weekly by default. You can override sources with GUARDDEN_WORDLIST_SOURCES (JSON array) or disable syncing entirely with GUARDDEN_WORDLIST_ENABLED=false.

Automod

Command	Description
`!automod`	View automod status
`!automod test <text>`	Test text against filters
`!automod threshold <setting> <value>`	Update a single automod threshold
`!automod allowlist`	List allowlisted domains
`!automod allowlist add <domain>`	Add a domain to the allowlist
`!automod allowlist remove <domain>`	Remove a domain from the allowlist

AI Moderation (Admin only)

Command	Description
`!ai`	View AI moderation settings
`!ai enable`	Enable AI moderation
`!ai disable`	Disable AI moderation
`!ai sensitivity <0-100>`	Set AI sensitivity level
`!ai threshold <0.0-1.0>`	Set AI confidence threshold
`!ai logonly <true/false>`	Toggle AI log-only mode
`!ai nsfw <true/false>`	Toggle NSFW image detection
`!ai nsfwonly <true/false>`	Toggle NSFW-only filtering mode
`!ai analyze <text>`	Test AI analysis on text

Diagnostics (Admin only)

Command	Description
`!health`	Check database and AI provider status

Verification (Admin only)

Command	Description
`!verify`	Request verification (for users)
`!verify setup`	View verification setup status
`!verify enable`	Enable verification for new members
`!verify disable`	Disable verification
`!verify role @role`	Set the verified role
`!verify type <type>`	Set verification type (button/captcha/math/emoji)
`!verify test [type]`	Test a verification challenge
`!verify reset @user`	Reset verification for a user

Dashboard

The dashboard provides owner-only visibility and configuration across all servers, including servers, users, chats, moderation logs, analytics, and settings.

Configure Entra + Discord OAuth credentials in .env.
Run with Docker: docker compose up -d dashboard (builds the dashboard UI).
For local development without Docker, build the frontend: cd dashboard/frontend && npm install && npm run build
Start the dashboard: python -m guardden.dashboard
OAuth callbacks:
- Entra: http://localhost:8080/auth/entra/callback
- Discord: http://localhost:8080/auth/discord/callback

CI (Gitea Actions)

Workflows live under .gitea/workflows/ and mirror the previous GitHub Actions pipeline for linting, tests, and Docker builds.

Project Structure

guardden/
├── src/guardden/
│   ├── bot.py              # Main bot class
│   ├── config.py           # Settings management
│   ├── cogs/               # Discord command groups
│   │   ├── admin.py        # Configuration commands
│   │   ├── ai_moderation.py # AI-powered moderation
│   │   ├── automod.py      # Automatic moderation
│   │   ├── events.py       # Event logging
│   │   ├── moderation.py   # Moderation commands
│   │   └── verification.py # Member verification
│   ├── models/             # Database models
│   │   ├── guild.py        # Guild settings, banned words
│   │   └── moderation.py   # Logs, strikes, notes
│   └── services/           # Business logic
│       ├── ai/             # AI provider implementations
│       ├── automod.py      # Content filtering
│       ├── database.py     # DB connections
│       ├── guild_config.py # Config caching
│       ├── ratelimit.py    # Rate limiting
│       └── verification.py # Verification challenges
├── tests/                  # Test suite
├── migrations/             # Database migrations
├── dashboard/              # Web dashboard (FastAPI + React)
├── docker-compose.yml      # Docker deployment
└── pyproject.toml          # Dependencies

Verification System

GuardDen includes a verification system to protect your server from bots and raids.

Challenge Types

Type	Description
`button`	Simple button click (default, easiest)
`captcha`	Text-based captcha code entry
`math`	Solve a simple math problem
`emoji`	Select the correct emoji from options

Setup

Create a verified role in your server
Configure the role permissions (verified members get full access)

Set up verification:

!verify role @Verified
!verify type captcha
!verify enable

How It Works

New member joins the server
Bot sends verification challenge via DM (or channel if DMs disabled)
Member completes the challenge
Bot assigns the verified role
Member gains access to the server

AI Moderation

GuardDen supports AI-powered content moderation using either Anthropic's Claude or OpenAI's GPT models.

Setup

Set the AI provider in your environment:

GUARDDEN_AI_PROVIDER=anthropic  # or "openai"
GUARDDEN_ANTHROPIC_API_KEY=sk-ant-...  # if using Claude
GUARDDEN_OPENAI_API_KEY=sk-...  # if using OpenAI

Enable AI moderation per server:

!ai enable
!ai sensitivity 50  # 0=lenient, 100=strict
!ai nsfw true       # Enable NSFW image detection

Content Categories

The AI analyzes content for:

Harassment - Personal attacks, bullying
Hate Speech - Discrimination, slurs
Sexual Content - Explicit material
Violence - Threats, graphic content
Self-Harm - Suicide/self-injury content
Scams - Phishing, fraud attempts
Spam - Promotional, low-quality content

How It Works

Messages are analyzed by the AI provider
Results include confidence scores and severity ratings
Actions are taken based on guild sensitivity settings
All AI actions are logged to the mod log channel

NSFW-Only Filtering Mode

For communities that only want to filter sexual content while allowing other content types:

!ai nsfwonly true

When enabled:

✅ Blocked: Sexual content, nude images, explicit material
❌ Allowed: Violence, harassment, hate speech, self-harm content

When disabled (normal mode):

✅ Blocked: All inappropriate content categories

This mode is useful for gaming communities, mature discussion servers, or communities with specific content policies that allow violence but prohibit sexual material.

Development

Running Tests

pytest
pytest -v                           # Verbose output
pytest tests/test_automod.py        # Specific file
pytest -k "test_scam"               # Filter by name

Code Quality

ruff check src tests                # Linting
ruff format src tests               # Formatting
mypy src                            # Type checking

License

MIT License - see LICENSE file for details.

Roadmap

AI-powered content moderation (Claude/OpenAI integration)
NSFW image detection
Verification/captcha system
Rate limiting
Voice channel moderation
Web dashboard

Description

Readme 346 KiB