reddit~4 min to deploy

Reddit Moderation Safety Baseline

Toxicity, account-age, and brigading controls for automated moderation — with a signed reason for every removal.

For: Trust & safety and community ops running automated moderation

Auditable enforcement trailAppeal-ready evidence

What it enforces

Toxicity threshold

Removes content above the toxicity threshold for young accounts.

Account-age guard

Escalates removals on established accounts for human review.

Brigading signal

Restrains mass actions that match a coordinated-brigade pattern.

Run this policy in the simulator Deploy in Shadow Mode

reddit-mod-safety-baseline.yaml

# Reddit Moderation Safety Baseline
# Fork: adjust thresholds; every removal emits an appeal-ready dossier.
apiVersion: decionis.dev/v1
kind: PolicyPack
metadata:
  name: reddit-mod-safety-baseline
  surface: reddit
  standards: [auditable-enforcement]
defaults:
  mode: shadow
  emit_dossier: true
rules:
  - name: toxicity_threshold
    when: "action == 'moderation.remove'"
    decision: |
      REMOVE IF toxicity > 0.85 AND author.account_age_days < 30
      ESCALATE IF toxicity > 0.85
      ALLOW OTHERWISE
    reason_code: toxicity_over_threshold
  - name: account_age_guard
    when: "action == 'moderation.remove'"
    decision: |
      ESCALATE IF author.account_age_days >= 365
      ALLOW OTHERWISE
    reason_code: established_account_review
  - name: brigading_signal
    when: "action == 'moderation.bulk_remove'"
    decision: |
      RESTRAIN IF signals.coordinated_pattern == true
      ALLOW OTHERWISE
    reason_code: coordinated_brigade_suspected

Fork it, change the thresholds to match your environment, and deploy in shadow mode first — it defaults to listen-only so nothing in your live pipeline changes.

Loading…

Open-Policy Exchange