Skip to main content

Product Strategy designed

AI Cloud is a secure self-service GPU platform for discovering capacity, provisioning compute, accessing running allocations, monitoring usage, and paying based on consumption. The product direction is to evolve from GPU node self-service into an app platform with clear developer, operator, and security surfaces.

Product Goals

GoalMeaning
Fast time-to-computeUsers should get from intent to usable GPU capacity in minutes
Transparent usage and billingUsers and operators can understand cost, credits, burn rate, and billing outcomes
Safe multi-user operationTenant/project scope, roles, audit, policy, and isolation are first-class
Operator-friendly controlAdmin and ops teams can manage inventory, releases, incidents, and evidence
App platform pathDevelopers can package apps and use GPUaaS as the runtime/product shell

Personas

PersonaPrimary needPortal path
End userLaunch and operate GPU-backed workloadsUse AI Cloud
Tenant/customer adminManage projects, access, usage, and billing postureUse AI Cloud
Platform adminManage nodes, users, allocations, audit, payments, and operational postureOperators
App developerBuild, package, test, and promote apps on AI CloudBuild on AI Cloud
Security reviewerUnderstand controls, gaps, evidence, and release disciplineSecurity & Production Readiness
Architect/engineerUnderstand system boundaries, contracts, and implementation modelArchitecture

Current Product Scope

AreaIncluded now
Auth and accessOIDC-backed auth, role-aware authorization, tenant/project direction, token refresh/logout patterns
Catalog and inventorySKU/catalog availability, node inventory, admin node workflows
Allocation lifecycleRequest, provisioning, active, releasing, released, failed, release-failed states
Runtime accessBrowser terminal, SSH/access posture, terminal token/session flow
Billing and paymentsUsage accrual, immutable ledger, balance warnings, Stripe checkout/webhooks, payment sessions
StorageObject-storage-backed user storage operations and path safety
Admin/OpsUsers, nodes, allocations, audit, payment sessions, operational telemetry direction
App platformApp catalog, manifest, SDK, artifact trust, and promotion path direction

Product Maturity

StrengthGap to keep visible
Contract-first API/event foundationGenerated API reference needs to become first-class in portal
Clear v3 IA and user journey modelUI migration and route consolidation are still active work
Billing, allocation, terminal, and storage flows are documentedOperator diagnostics and monetization depth need more product surface
App SDK direction is strongPublished SDK artifacts and internal developer onboarding need a formal release path
Security and production readiness are explicitEnforcement separation and environment/ring maturity still need implementation

Product Operating Principle

The portal should explain the product without becoming the product backlog. Canonical specs, contracts, and implementation plans remain in source docs; the portal gives each team a maintained path through them.