Overall Platform Architecture designed
This is the top-down architecture ladder for AI Cloud. A reviewer should be able to start here, understand the product shape, then choose the right next packet without reverse-engineering the repo.
The System In One Sentence
AI Cloud is a shared control plane for GPU runtimes, apps, and future platform products, with reusable IAM, billing, policy, evidence, registry, and runtime access services underneath product-specific workflows.
The Architecture Ladder
What Each Layer Answers
| Layer | Main question | Primary page |
|---|---|---|
| Vision | What are we building and why is it a platform, not just one app? | Architecture and Design Principles |
| Shared model | Which capabilities are shared across products vs product-owned? | Platform Shared Services |
| Runtime topology | How does traffic, control, async work, and node access actually flow? | System Overview |
| Code/layer ownership | Where does code belong and how is deployment separation supposed to evolve? | Code Structure And Layer Model |
| Deep engineering | What is the detailed model for each major service family? | Detailed Design Index |
| Product reality | How does this show up in user, admin, operator, and release workflows? | Use AI Cloud, Operators, Security & Production Readiness |
Platform Shape
Core Design Claim
The repo is not organized around “frontend vs backend” or “one service per feature.” It is organized around a stable control-plane model:
- contracts first;
- shared business authority in platform services;
- product-specific behavior in product domains;
- provider and environment specifics behind adapters and config;
- production as a profile of the same system, not a different product.
What Is Easy To Miss
The codebase already contains more than a thin provisioning app:
- shared IAM and access posture;
- immutable billing and metering model;
- audit, evidence, and release-readiness posture;
- terminal and workload runtime access surfaces;
- app-platform and SDK hooks;
- environment and deployment thinking that reaches beyond local demo mode.
That is why the architecture section needs both overview pages and deep service packets.
Recommended Reading Order
- Architecture and Design Principles
- Overall Platform Architecture
- System Overview
- Code Structure And Layer Model
- Platform Shared Services
- Detailed Design Index
Use This Page As The Jump Point For Reviews
- ARB/JAD: start here, then open Architecture Review Pack
- Product/platform alignment: continue to Product
- Security/control review: continue to Security & Production Readiness
- Developer design/implementation: continue to Developer Implementation Map
Canonical sources