Governed, performant, deployable anywhere
Deploy in your environment, manage policies centrally, audit every request. No shadow AI, no data leakage.
SOC2 Type II, HIPAA-ready architecture, GDPR DPA. Evidence export for audits, BAA available.
Reduce LLM costs 30–50%, improve p95 latency 60%, track spend per department/project in real-time.
| Criterion | SaaS (Dnyana-managed) | Private VPC (Your AWS/Azure/GCP) | On-Premises (Your datacenter) |
|---|---|---|---|
| Time to Deploy | < 1 day (API keys ready) | 3–5 days (Terraform/Helm) | 2–4 weeks (dependencies, airgap) |
| Data Residency | US/EU/APAC regions | Your chosen region | Your datacenter |
| Network Isolation | Shared tenancy, encrypted | Private VPC, your firewall rules | Air-gapped / private network |
| Ops Responsibility | Dnyana (24/7 SRE) | Shared (Dnyana support + your ops) | Your team (Dnyana assists) |
| Compliance Posture | SOC2, GDPR-ready, BAA available | Inherit your VPC compliance + ours | Full control for HIPAA, PCI, FedRAMP |
| Upgrade Cadence | Weekly (automated) | Monthly (coordinated) | Quarterly (scheduled with you) |
| Cost Model | Token-based + SaaS fee | Token-based + infra passthrough | Annual license + token-based |
Recommendation: Start with SaaS for pilot, migrate to Private VPC or On-prem for production if required by security/compliance posture.
| Model | Risk Tier | Use Cases | Restrictions | Approval |
|---|---|---|---|---|
| GPT-4o-mini, Claude 3.5 Haiku | Low | General Q&A, summaries, tagging | No PII, 30-day retention | Auto-approved |
| GPT-4o, Claude 3.5 Sonnet | Medium | Analysis, code review, reports | PII redaction required | Manager approval |
| o1, o1-pro (reasoning) | High | Strategic planning, research | Zero retention, audit all | CISO approval |
| Fine-tuned / custom models | Restricted | Special projects only | Case-by-case review | Security committee |
User requests access to restricted model via UI
Manager/CISO receives notification with context
Decision logged, access granted or denied
All usage tracked in immutable log
{
"request_id": "req_7f3a9b2c",
"timestamp": "2025-10-27T14:32:18Z",
"user": "alice@company.com",
"org": "acme-corp",
"model": "gpt-4o",
"input_tokens": 245,
"output_tokens": 512,
"cost_usd": 0.0189,
"latency_ms": 1240,
"status": "success",
"pii_redacted": true,
"policy_checks": ["approved_model", "no_phi"]
}
| Scenario | Direct LLM API | Dnyana.cloud | Savings |
|---|---|---|---|
| Mixed workload (GPT-4o + mini) | $2,500 | $1,650 | -$850 (34%) |
| + Dnyana platform fee | — | $500 | — |
| Total | $2,500 | $2,150 | -$350 (14%) |
*Savings increase with volume; typical enterprise customers save 30–50% at scale due to caching + smart routing.
SAML 2.0, OAuth2, OIDC support. SCIM provisioning for automatic user sync.
| Role | Permissions | Typical User |
|---|---|---|
| Viewer | Read logs, view dashboards | Auditors, analysts |
| User | Call API, use approved models | Developers, end users |
| Manager | Approve exceptions, manage quotas | Team leads, PMs |
| Admin | Configure policies, manage users | Platform team, SRE |
| Owner | All permissions, billing | CTO, CISO |
us-east-1 (Virginia), us-west-2 (Oregon)
eu-central-1 (Frankfurt), eu-west-1 (Dublin)
ap-southeast-1 (Singapore), ap-northeast-1 (Tokyo)
Your datacenter (air-gapped or VPN)
Data sovereignty: All request data, logs, and configs stay in your chosen region. No cross-border transfers without explicit consent.
| Data Type | Default Retention | Options | Deletion |
|---|---|---|---|
| Request prompts | 30 days | 0 / 7 / 30 / 90 / 365 days | Auto-purge + manual |
| LLM responses | 30 days | 0 / 7 / 30 / 90 / 365 days | Auto-purge + manual |
| Audit logs | 365 days | 90 / 365 / 2555 days (7 yrs) | Manual only (compliance) |
| Metrics / analytics | 90 days | 30 / 90 / 365 days | Auto-aggregate |
| User PII | Until deleted | GDPR: right to deletion | Immediate on request |
User-initiated: Self-service deletion via UI/API → data purged within 24 hours → confirmation email
GDPR request: Submit request → identity verification → deletion within 30 days → certificate of deletion
Automated: Retention policy expires → data auto-purged → logged in audit trail
| Tier | Base Fee | Token Pricing | Support | SLA |
|---|---|---|---|---|
| Pilot | $500/month | 20% markup on LLM costs | Email, 24-hour response | 99.5% uptime |
| Professional | $2,500/month | 15% markup on LLM costs | Dedicated CSM, 4-hour response | 99.9% uptime |
| Enterprise | Custom (typically $10K+/month) | 10% markup + volume discounts | TAM, 1-hour response, on-call | 99.95% uptime + credits |
| Metric | Professional | Enterprise |
|---|---|---|
| Uptime | 99.9% (43 min/month) | 99.95% (22 min/month) |
| API Latency (p95) | < 500ms | < 350ms |
| Support Response | 4 hours (business) | 1 hour (24/7) |
| Credits (breach) | 10% monthly fee | 25% monthly fee |
Industry: Financial Services | Size: 15,000 employees | Region: Global
"Dnyana.cloud gave us the control and visibility our CISO needed, while unblocking our teams. We passed our SOC2 audit with zero findings related to LLM usage." — VP of Information Security
Share architecture docs, SOC2 report (under NDA), complete security questionnaire
Provision SaaS sandbox, onboard 5–10 users, provide API keys and documentation
Integrate use cases, configure policies, measure results, get CISO sign-off
Finalize commercial terms, execute MSA/DPA/BAA, plan production deployment