← スキル䞀芧に戻る
SylphxAI

trust-safety

by SylphxAI

🚀 AI development platform with MEP architecture - stop writing prompts, start building with 90% less typing

⭐ 4🍎 3📅 2026幎1月8日
GitHubで芋るManusで実行

SKILL.md


name: trust-safety description: Trust and safety - abuse prevention, rate limiting. Use when fighting bad actors.

Trust Safety Guideline

Tech Stack

  • Analytics: PostHog
  • Database: Neon (Postgres)
  • Workflows: Upstash Workflows + QStash

Non-Negotiables

  • All enforcement actions must be auditable (who/when/why)
  • Appeals process must exist for affected users
  • Graduated response levels must be defined (warn → restrict → suspend → ban)

Context

Trust & safety is about protecting users — from each other and from malicious actors. Every platform eventually attracts abuse. The question is whether you're prepared for it or scrambling to react.

Consider: what would a bad actor try to do? How would we detect it? How would we respond? What about the false positives — innocent users caught by automated systems? A good T&S system is effective against abuse AND fair to legitimate users.

Driving Questions

  • What would a motivated bad actor try to do on this platform?
  • How would we detect coordinated abuse or bot networks?
  • What happens when automated moderation gets it wrong?
  • How do affected users appeal decisions, and is it fair?
  • What abuse patterns exist that we haven't addressed?
  • What would make users trust that we're protecting them?

スコア

総合スコア

75/100

リポゞトリの品質指暙に基づく評䟡

✓SKILL.md

SKILL.mdファむルが含たれおいる

+20
✓LICENSE

ラむセンスが蚭定されおいる

+10
✓説明文

100文字以䞊の説明がある

+10
○人気

GitHub Stars 100以䞊

0/15
✓最近の掻動

3ヶ月以内に曎新

+5
○フォヌク

10回以䞊フォヌクされおいる

0/5
✓Issue管理

オヌプンIssueが50未満

+5
✓蚀語

プログラミング蚀語が蚭定されおいる

+5
✓タグ

1぀以䞊のタグが蚭定されおいる

+5

レビュヌ

💬

レビュヌ機胜は近日公開予定です