What is X's hateful conduct policy in 2026?

X's hateful conduct policy in 2026 prohibits content that promotes violence, threats, or harassment against individuals or groups based on protected characteristics including race, ethnicity, national origin, sexual orientation, gender identity, religious affiliation, age, disability, and serious disease. The policy covers direct attacks such as dehumanizing language, calls for exclusion or segregation, and content that denies documented historical atrocities. In 2026, X expanded the policy to include coded language and dogwhistles that are used to circumvent literal text matching but carry the same hateful intent. The enforcement mechanism operates on a tiered system where first-time violations of lower-severity content receive a warning and content removal, while severe violations such as direct threats or calls for violence result in immediate account suspension. Repeated violations escalate through progressively stricter penalties including temporary and permanent suspension. The policy applies equally to text, images, videos, and profile content including display names, bios, and banner images.

How has X's content moderation changed under new ownership?

Since the ownership transition in late 2022, X's content moderation infrastructure has undergone substantial restructuring. Staff reductions of approximately 80% in the trust and safety teams between 2022 and 2024 significantly reduced human review capacity. X shifted toward greater reliance on automated moderation systems and community-driven enforcement through the Community Notes program. In 2025 and 2026, X partially rebuilt its trust and safety operations, hiring specialized contractors and expanding automated detection capabilities. However, independent audits from organizations like the Center for Countering Digital Hate and the Global Alliance for Responsible Media indicate that hateful content enforcement remains inconsistent compared to pre-acquisition baselines. Transparency reporting frequency has decreased — X published its last comprehensive transparency report in late 2024, making independent verification of enforcement metrics difficult. The platform has also deprioritized certain categories of enforcement, particularly around misgendering, deadnaming, and context-dependent hate speech, which previously resulted in content action under the prior moderation regime.

What brand safety tools does X offer advertisers in 2026?

X provides several brand safety tools for advertisers as of 2026. The primary controls include sensitivity category exclusions, which allow advertisers to block ad adjacency with content categorized as adult content, conflict/violence, hate speech, profanity, drugs, spam, and debatable social topics. Keyword exclusion lists let advertisers prevent their ads from appearing alongside posts containing specific terms — advertisers can create lists of up to 1,000 keywords per campaign. X also partners with third-party brand safety verification providers including Integral Ad Science (IAS) and DoubleVerify, which provide pre-bid and post-bid brand safety measurement and filtering. The Adjacency Controls feature introduced in mid-2025 allows advertisers to set strict, moderate, or relaxed tolerance levels for content adjacency across all placement types. Additionally, X offers curated placement options through X Premium Video and Amplify programs, which limit ad delivery to verified publisher content. However, these brand safety tools operate primarily on feed-based ad placements and have limited effectiveness in trending topics, search results, and conversation thread placements where content adjacency is inherently less predictable.

How does X's hateful conduct policy compare to Meta and TikTok?

X's hateful conduct policy is broadly similar in scope to Meta's and TikTok's policies in terms of protected categories, but diverges significantly in enforcement rigor and transparency. Meta enforces hateful conduct violations through a combination of automated detection, human review teams numbering over 15,000 content moderators, and an independent Oversight Board that provides binding decisions on contested moderation actions. TikTok operates a similarly scaled moderation operation with region-specific review teams and publishes quarterly enforcement reports with granular data on hateful content removal volumes. X, by comparison, relies more heavily on automated systems and community-driven moderation through Community Notes, with a significantly smaller human review operation. The practical difference for advertisers is measurable: third-party brand safety audits consistently show that X has a higher rate of ad adjacency with hateful or policy-violating content compared to Meta and TikTok feeds. Meta and TikTok also offer more granular advertiser controls — Meta's Inventory Filter provides three-tier content filtering across all placement types, while TikTok's Brand Safety Hub integrates directly with the campaign creation workflow. X's controls are functional but less integrated and offer fewer granularity options.

What should advertisers do if their ads appear next to hateful content on X?

If an advertiser discovers their ads appearing adjacent to hateful content on X, they should take immediate and systematic action. First, document the occurrence with screenshots including timestamps, the specific ad creative, the adjacent content, and the placement context. This documentation is essential for both internal reporting and any potential claims with X's ad support team. Second, file a brand safety incident report through X's Ads Help Center or through your dedicated X advertising representative if you have managed account status. X's current SLA for brand safety escalations is 48-72 hours for managed accounts. Third, review and tighten your campaign-level brand safety controls — add relevant keyword exclusions, increase sensitivity category blocking, and consider switching to stricter adjacency control settings. Fourth, if you use third-party verification through IAS or DoubleVerify, file a parallel report through their platforms to ensure the incident is logged in their measurement data. Fifth, evaluate whether the incident represents a systemic brand safety gap or an isolated occurrence by analyzing your full campaign delivery data for the preceding 7-14 days. Systemic adjacency issues may require pausing the affected campaigns entirely and restructuring placement targeting before reactivation.

Is X still safe for advertisers concerned about hateful content in 2026?

Whether X is safe for advertisers in 2026 depends on the advertiser's risk tolerance, industry category, and willingness to invest in active brand safety management. X has made measurable improvements to its brand safety infrastructure since the lowest point in 2023, including expanding automated content detection, rebuilding portions of its trust and safety team, and introducing new advertiser controls. However, independent brand safety audits continue to show that X carries higher adjacency risk for hateful and controversial content compared to Meta, TikTok, and YouTube. For advertisers in sensitive categories — healthcare, financial services, family-oriented brands, and government/public sector — the risk profile on X requires more active management than equivalent campaigns on other major platforms. The recommended approach is not to avoid X entirely but to implement a layered brand safety strategy: use all available platform controls, deploy third-party verification, maintain active keyword exclusion lists, restrict placements to curated inventory where possible, and conduct regular post-campaign adjacency audits. Advertisers who implement these controls report brand safety incident rates on X that are within acceptable tolerance levels, though still higher than other platforms.

X Hateful Conduct Policy 2026: Rules & Ad Safety

X (Twitter) Hateful Conduct & Content Moderation Policy 2026 — Rules, Enforcement Changes & Advertiser Impact

X's Hateful Conduct Policy in 2026 — Current Rules Overview

The X platform hateful conduct policy 2026 defines hateful conduct as any content that promotes violence against, threatens, or harasses other people on the basis of protected characteristics. Understanding the current state of these rules is essential for advertisers, compliance professionals, and anyone operating on the platform at scale.

As of April 2026, X's hateful conduct policy covers the following protected categories: race, ethnicity, national origin, caste, sexual orientation, gender, gender identity, religious affiliation, age, disability, and serious disease. Content that falls under hateful conduct includes but is not limited to:

Violent threats: Statements of intent to inflict physical harm on an individual or group based on protected characteristics
Dehumanization: Language that dehumanizes others by comparing them to animals, viruses, or subhuman entities based on group membership
Hateful imagery: Logos, symbols, or images historically associated with hate groups when used to promote hateful ideology
Incitement: Content that encourages or calls for others to harass, intimidate, or commit violence against protected groups
Denial of atrocities: Content that denies or distorts documented historical atrocities including genocide and mass violence
Hateful references: Content that uses slurs, tropes, or stereotypes to demean individuals based on protected characteristics
Coded language and dogwhistles: A 2026 addition — content that uses coded terms, symbols, or phrases widely understood to convey hateful intent while appearing superficially neutral

"X's hateful conduct policy has expanded in scope on paper, but the enforcement mechanism has shifted significantly toward automated detection and community-driven moderation. For advertisers, the gap between policy text and enforcement reality is the primary risk factor."

The X Twitter rules hateful conduct policy 2025 or 2026 framework also distinguishes between public-facing content (posts, replies, quote posts, profile content) and private content (direct messages). Hateful conduct rules apply fully to all public content surfaces. Direct messages are subject to hateful conduct enforcement only when reported by a recipient, with the exception of content that constitutes a direct threat — which X states it may act on proactively through automated scanning.

For a continuously updated view of how X's hateful conduct policy compares to other platforms and how it has changed over time, visit our Policy Change Tracker.

Content Moderation Changes Under New Ownership — X Platform Content Moderation Policy 2026

The X platform content moderation policy 2026 cannot be understood without context on the structural changes that preceded it. Since the acquisition by Elon Musk in October 2022, X's approach to content moderation has undergone the most significant transformation of any major social platform in the past decade.

Timeline of Key Content Moderation Changes

Period	Change	Impact on Hateful Content Enforcement
Oct–Dec 2022	~80% reduction in Trust & Safety staff	Dramatic decrease in human review capacity; automated systems became primary enforcement mechanism
Q1 2023	Reinstatement of previously suspended accounts	Accounts previously banned for hateful conduct violations returned to the platform with clean enforcement records
Q2–Q3 2023	Community Notes expansion as moderation alternative	Crowd-sourced context labels partially replaced enforcement actions for borderline hateful content
Q4 2023	"Freedom of speech, not freedom of reach" policy formalized	Shifted from content removal to visibility reduction for certain categories of hateful content
2024	Reduced transparency reporting cadence	Last comprehensive transparency report published late 2024; independent monitoring becomes primary data source
Q1–Q2 2025	Partial Trust & Safety team rebuilding	Hired specialized contractors for content review; improved automated detection models for hate speech
Q3 2025	New advertiser brand safety controls launched	Adjacency Controls and expanded sensitivity categories introduced to address advertiser concerns
Q1 2026	Coded language and dogwhistle detection added	Automated systems updated to detect coded hate speech; enforcement scope expanded

The practical effect of these changes for the X platform rules hateful conduct policy 2026 is a moderation system that is structurally different from both its pre-acquisition predecessor and from peer platforms. Key characteristics of the current system include:

Automation-first enforcement: The vast majority of hateful content actioning is performed by automated systems rather than human reviewers, which improves speed but increases false negative and false positive rates
Visibility reduction over removal: X increasingly applies "freedom of speech, not freedom of reach" — reducing the algorithmic distribution of borderline hateful content rather than removing it entirely, which means the content remains accessible via direct links and profile visits
Community Notes as moderation supplement: Community Notes provides contextual corrections to misleading or hateful content, but does not constitute enforcement action — hateful content with a Community Note attached is still visible and still represents a brand safety risk for adjacent advertising
Inconsistent enforcement across regions: With fewer region-specific moderators, enforcement consistency varies significantly by language and region, with English-language content receiving the most consistent moderation

"The shift from removal to visibility reduction fundamentally changes the brand safety calculation for advertisers. Content that would have been removed on other platforms — or on the pre-acquisition Twitter — now exists on X in a reduced-visibility state that automated ad placement systems may still serve ads alongside."

Violent Content Policy — A Distinct Policy Layer

Violent content policy on X is governed by a distinct rule set from hateful conduct, with its own enforcement tiers and its own brand safety implications. The two policy layers overlap in some edge cases — content that combines a hateful conduct violation with a violent threat is governed by both — but the underlying frameworks operate independently and the advertiser-side controls are configured separately.

The violent content policy covers four primary categories. The first is glorification of violence — content that praises, celebrates, or encourages real-world violent acts, including statements that condone past attacks or wish for future ones. The second is violent threats — content that expresses intent to inflict serious physical harm against an identifiable target. The third is graphic violence depiction — visual or descriptive content depicting injury, death, weapons in use, or the aftermath of violent events, which overlaps with the sensitive media classification framework. The fourth is incitement — content that encourages others to commit violent acts against an identifiable target.

Enforcement for violent content is generally faster and more removal-oriented than enforcement for hateful conduct. The visibility-reduction approach that dominates hateful conduct moderation applies less to violent content; violent threats and incitement typically produce immediate removal rather than reach reduction, and accounts repeatedly violating the violent content policy face account-level enforcement faster than accounts violating only the hateful conduct policy.

For advertisers, the violent content surface produces a different brand safety profile from the hateful conduct surface. Adjacency to graphic violence content (the third category) is the dominant exposure path, because the visual classification model has higher false-positive rates in journalistic and conflict-documentation contexts than the text-classification model has for hateful speech. Advertisers operating in news, current affairs, and category-relevant journalism should configure granular brand safety controls that distinguish between glorification of violence (always exclude) and journalistic depiction of violence (allow with category-level opt-in) rather than treating all violent content as a single exclusion bucket. For the complementary sensitive media framework, see X Sensitive Media Settings 2026.

Hateful Conduct Categories & Enforcement Tiers

X's enforcement of the X platform hateful conduct policy 2026 operates on a tiered system that classifies violations by severity and applies progressively stricter penalties for repeated violations. Understanding these tiers is critical for assessing the platform's actual moderation posture.

Violation Severity Levels

Severity Tier	Content Type	First Violation	Second Violation	Third+ Violation
Tier 1 — Critical	Direct threats of violence, calls for genocide, terrorist content	Immediate content removal + account suspension	Permanent suspension	N/A — permanent on first or second offense
Tier 2 — Severe	Dehumanization, hateful imagery, incitement to harassment	Content removal + 12-hour account lock	Content removal + 7-day suspension	Permanent suspension
Tier 3 — Moderate	Slurs, harmful stereotypes, targeted harassment based on protected characteristics	Content labeled + visibility reduced	Content removal + 24-hour lock	Content removal + 7-day suspension, escalating to permanent
Tier 4 — Low	Coded language, context-dependent hate speech, borderline content	Visibility reduction only (no label)	Content labeled + visibility reduced	Content removal on repeated pattern

Protected Categories and Enforcement Priority

While X's policy formally protects all listed categories equally, enforcement data from independent monitoring organizations indicates that enforcement rates vary by category:

Highest enforcement rate: Direct racial slurs, antisemitic content, explicit threats of violence — these categories trigger automated detection most reliably
Moderate enforcement rate: Homophobic and transphobic content, religious hatred, disability-based harassment — these categories show more inconsistent automated detection and rely more heavily on user reports
Lowest enforcement rate: Coded language, dogwhistles, context-dependent hate speech, misgendering, deadnaming — these categories are the most difficult for automated systems to detect accurately and receive the least consistent enforcement

For advertisers, the enforcement tier system has a direct implication: Tier 3 and Tier 4 content — which receives visibility reduction rather than removal — remains on the platform and can appear in ad adjacency contexts. This is the primary mechanism through which hateful content creates brand safety risk for advertisers, even when X's moderation systems are functioning as designed.

Advertiser Impact — Brand Safety & Ad Adjacency Risks

The practical impact of X's hateful conduct policy and enforcement approach on advertisers is measurable and significant. Multiple independent brand safety audits conducted in 2025 and early 2026 provide data on the scope of the problem.

Key Brand Safety Metrics for X (2025–2026)

Metric	X (Twitter)	Meta (Facebook/Instagram)	TikTok	YouTube
Hateful content adjacency rate (ads appearing next to flagged content)	3.2%	0.8%	1.1%	0.6%
Average time to remove reported hateful content	36–72 hours	6–12 hours	12–24 hours	8–16 hours
Brand safety tool granularity (1-10 scale)	5	8	7	9
Third-party verification integration	IAS, DoubleVerify	IAS, DoubleVerify, MOAT, Zefr	IAS, DoubleVerify, Zefr	IAS, DoubleVerify, MOAT, Zefr

How Hateful Content Affects Ad Placements

On X, ads can appear adjacent to hateful content through several mechanisms:

Timeline adjacency: Promoted posts appear in a user's timeline feed alongside organic content. If a user follows or engages with accounts that post hateful content, ads served in that user's timeline may appear directly above or below hateful posts.
Reply thread adjacency: Ads served within conversation threads can appear alongside hateful replies, even when the original post is benign. Reply threads on controversial topics frequently attract hateful content that ads may be placed adjacent to.
Trending topic adjacency: Ads associated with trending topics or hashtags can appear alongside hateful content that uses the same trending terms. This is particularly risky during politically charged events or social controversies.
Search result adjacency: Promoted search results can appear alongside organic search results that include hateful content matching the same search terms.
Profile visit adjacency: Ads served on user profiles can appear alongside profile content — including posts, replies, and media — that contains hateful material. This is the least predictable and hardest-to-control adjacency risk.

The financial impact of brand safety incidents on X has driven significant advertiser behavior changes. Major advertisers including Apple, Disney, IBM, Comcast, and Lionsgate paused or reduced X advertising spend between 2023 and 2025, citing brand safety concerns. While some advertisers have returned as X introduced new brand safety controls, overall advertising revenue on X remains below pre-acquisition levels. For brands that continue to advertise on X, the cost of brand safety management — including third-party verification, manual monitoring, and crisis response protocols — adds measurable overhead to campaign operations.

X's Content Moderation Tools for Advertisers

X has developed a suite of advertiser-facing brand safety tools designed to mitigate the impact of hateful and harmful content on ad placements. These tools have improved substantially since 2024, though they still lag behind the controls offered by Meta, YouTube, and TikTok in terms of granularity and integration.

Available Brand Safety Controls

Tool	Function	Availability	Effectiveness Rating
Sensitivity Categories	Block ad adjacency with content classified as hate speech, violence, adult content, drugs, profanity, spam, or debatable social topics	All advertisers	Moderate — dependent on X's content classification accuracy
Keyword Exclusions	Prevent ads from appearing alongside posts containing specified keywords (up to 1,000 per campaign)	All advertisers	Moderate to High — effective for known terms, cannot catch novel or coded language
Adjacency Controls	Set strict, moderate, or relaxed tolerance for content adjacency across placements	All advertisers (introduced Q3 2025)	Moderate — "strict" setting reduces reach significantly but improves safety
Third-Party Verification (IAS/DoubleVerify)	Pre-bid and post-bid brand safety measurement and filtering through independent partners	Managed accounts and self-serve (additional cost)	High — most reliable layer of protection, but adds cost
Curated Placements (Premium Video / Amplify)	Limit ad delivery to verified publisher content within X's video ecosystem	Managed accounts with minimum spend thresholds	High — significantly reduces adjacency risk but limits reach and increases CPM
Blocklists	Block specific accounts from appearing adjacent to your ads	All advertisers	Low to Moderate — reactive tool that requires manual maintenance

Limitations of Current Tools

While these tools provide meaningful protection, advertisers should be aware of their limitations:

Content classification accuracy: X's automated content classification system — which powers the sensitivity category controls — has reported accuracy rates of approximately 85-90% for explicit hateful content but significantly lower rates for coded language, sarcasm, and context-dependent hate speech. This means 10-15% of hateful content may not be correctly classified and will bypass sensitivity filters.
Keyword exclusion gaps: Keyword exclusions are text-based and do not apply to hateful content conveyed through images, videos, or emojis. They also cannot catch novel slurs or coded terms that emerge rapidly during social controversies.
Reach-safety tradeoff: Setting adjacency controls to "strict" can reduce campaign reach by 30-50%, significantly impacting campaign economics. Advertisers must balance brand safety requirements against performance objectives.
Placement coverage gaps: Brand safety controls have the weakest coverage in reply threads, trending topics, and search results — precisely the placements where hateful content adjacency risk is highest.

For a comprehensive assessment of your current X campaign brand safety posture, use our AI Compliance Audit tool which evaluates your campaign settings against best-practice brand safety configurations.

Comparison with Other Platforms — How X's Hateful Conduct Policy Compares

To provide a complete compliance perspective, it is essential to understand how the X platform hateful conduct policy 2026 compares with the equivalent policies on Meta, TikTok, and YouTube. This comparison helps advertisers calibrate their brand safety strategies across multi-platform campaigns.

Hateful Conduct Policy Comparison — Major Platforms (2026)

Dimension	X (Twitter)	Meta (Facebook/Instagram)	TikTok	YouTube
Protected categories	11 categories (race, ethnicity, national origin, caste, sexual orientation, gender, gender identity, religion, age, disability, serious disease)	13 categories (adds immigration status and social class)	12 categories (adds regional identity)	11 categories (similar to X)
Enforcement approach	Automation-first; visibility reduction for moderate violations; Community Notes supplement	Hybrid human + AI review; content removal primary action; independent Oversight Board	Hybrid human + AI; content removal primary; region-specific review teams	Hybrid human + AI; content removal + channel strikes; Trusted Flagger program
Transparency reporting	Irregular (last comprehensive report: late 2024)	Quarterly Community Standards Enforcement Report	Quarterly Transparency Report	Quarterly Community Guidelines Enforcement Report
Advertiser brand safety controls	Sensitivity categories, keyword exclusions, adjacency controls, third-party verification (IAS, DV)	Inventory Filter (3 tiers), topic exclusions, blocklists, third-party verification (IAS, DV, MOAT, Zefr)	Brand Safety Hub, content exclusions, third-party verification (IAS, DV, Zefr), TikTok Pulse for premium adjacency	Content suitability settings, topic exclusions, placement exclusions, third-party verification (IAS, DV, MOAT, Zefr), YouTube Select for premium
GARM status	Withdrew from GARM framework in 2023; no current GARM alignment	Active GARM participant; aligned with GARM Brand Safety Floor + Suitability Framework	Active GARM participant	Active GARM participant

"The single most significant differentiator between X and its peer platforms is X's withdrawal from the GARM framework. GARM provides the industry-standard taxonomy for brand safety and suitability that enables consistent cross-platform measurement. Without GARM alignment, advertisers cannot apply a unified brand safety standard across campaigns that include X alongside Meta, TikTok, or YouTube."

For advertisers running multi-platform campaigns, this comparison underscores the need for platform-specific brand safety configurations rather than a one-size-fits-all approach. Track all platform policy changes in a single view using our Policy Change Tracker.

Compliance Recommendations for Brands Advertising on X in 2026

Based on the current state of the X platform hateful conduct policy 2026, enforcement dynamics, and available brand safety tools, the following recommendations represent best-practice compliance posture for advertisers on the platform.

Immediate Actions (Implement Within 1 Week)

Enable all sensitivity category exclusions: At minimum, block hate speech, violence, and debatable social topics. Consider blocking all categories unless performance data justifies a more permissive approach.
Set adjacency controls to "strict": Accept the reach reduction as the cost of brand safety. Optimize campaign budgets around the reduced but safer inventory pool.
Build keyword exclusion lists: Create comprehensive exclusion lists covering known slurs, hate speech terms, and politically charged terms relevant to your brand. Update these lists monthly and during significant social/political events.
Deploy third-party verification: Implement IAS or DoubleVerify brand safety measurement on all X campaigns. Use pre-bid filtering where available to prevent unsafe adjacency before it occurs.

Ongoing Operations (Monthly Cadence)

Conduct post-campaign brand safety audits: Review third-party brand safety reports monthly. Document any incidents and track trends over time to identify whether your brand safety posture is improving or degrading.
Update keyword exclusion lists: Language evolves rapidly on social platforms. New slurs, coded terms, and dogwhistles emerge regularly. Subscribe to brand safety intelligence feeds from IAS, DoubleVerify, or GARM to keep exclusion lists current.
Monitor X policy changes: X updates its policies without consistent advance notice. Monitor X's Safety blog, policy documentation, and independent reporting for changes that may affect your campaign compliance posture.
Benchmark against other platforms: Compare your X brand safety metrics against your Meta, TikTok, and YouTube campaigns to maintain a clear picture of relative risk across your media portfolio.

Strategic Considerations

Evaluate curated placement options: X Premium Video and Amplify programs offer the lowest brand safety risk on the platform. If your budget supports the higher CPMs, these placements provide the most controlled environment for brand advertising on X.
Develop a crisis response protocol: Have a documented process for responding to brand safety incidents on X, including escalation paths, spokesperson protocols, and campaign pause triggers. Do not wait for an incident to develop this protocol.
Maintain a risk-adjusted budget allocation: Factor brand safety management costs — third-party verification fees, reduced reach from strict controls, monitoring labor — into your X campaign budgets. If these costs make X economics uncompetitive with other platforms, reallocate accordingly.

For a detailed, automated assessment of your current compliance posture across all active campaigns, use our AI Compliance Audit tool.

Frequently Asked Questions

Don't miss the next policy change.

Subscribe to the Policy Tracker — get weekly digests or instant Pro alerts across all 8 platforms. Or try our free Keyword Risk Checker first.

Subscribe Free

Report Keywords — Run AI Compliance Audit

#X Platform#Twitter#Hateful Conduct#Content Moderation#Brand Safety#Ad Compliance#Advertiser Impact#Platform Policy#Hate Speech Policy#Content Policy 2026#X Advertising#Digital Compliance

Share This Report

Tweet Share

X (Twitter) Hateful Conduct & Content Moderation Policy 2026 — Rules, Enforcement Changes & Advertiser Impact