OCNexus is an autonomous development pipeline. Every request is classified by a safety layer before any code is written or any LLM tokens are spent. This page documents what OCNexus will and won't build, and how the classifier works.
Requests in these categories are always blocked. The user receives a specific decline message explaining why and what to do if the block was incorrect.
competitive_product Building an AI-powered development pipeline that automates issue-to-PR workflows — OCNexus's core product.
illegal_activity Systems designed to facilitate fraud, unauthorized access, data theft, or any criminal activity.
ai_policy_violation Tools designed to circumvent Anthropic, OpenAI, or other AI provider usage policies.
malware Malicious code, vulnerability exploits, ransomware, spyware.
deceptive_systems Applications designed to deceive users about their nature or harvest data without disclosure.
destructive_operations Requests to delete production data, destroy infrastructure, or cause irreversible harm.
Requests in these categories are not blocked but require human approval before the pipeline runs. The user sees a gate message and must explicitly approve.
high_risk_financial Changes to payment processing, payout logic, or subscription billing in production.
auth_system Modifications to authentication, authorization, or session management.
data_export Any operation that exports or permanently deletes significant user data.
production_infrastructure Railway, Supabase, DNS, or SSL changes in production environment.
The safety classifier is a lightweight Claude pass that runs before the main pipeline. It takes less than 2 seconds and costs a fraction of a cent per classification.
User submits an issue or chat message.
Claude classifies the request against prohibited and restricted categories. The request text is hashed (SHA-256) and logged.
Safe: Pipeline runs normally. Restricted: Human gate required. Prohibited: Request blocked with decline message.
If an account triggers 5+ blocks in 24 hours, the account is flagged for manual review.
If you believe a request was incorrectly blocked:
We review appeals within 24 hours and update the classifier if the block was incorrect.
All safety classifications are logged and visible in the OCNexus dashboard under Safety → Recent Logs. Each log entry includes:
We do not store the full text of classified requests. Only the hash is retained for audit purposes.