Enterprises with Cutting-Edge AI
Sora Cloud is an LLM marketplace that allows enterprises to train and deploy models and manage GPU compute from a single platform. Built at a time when model access and infrastructure were typically handled separately, it brought both together to reduce the overhead of working across multiple vendors.

Efficient AI Training
Train and deploy models faster with automated workflows, reducing the need to manage infrastructure directly.
Custom AI Solutions
Use pre-configured workflows to set up deployments quickly, with common patterns and best practices built in.
Dynamic GPU Resources
Scale GPU compute based on demand, with flexible configurations designed to balance performance and cost.
Pick Your Model
Browse available LLMs and choose the right one for your use case. Filter by model size, GPU requirements, and pricing.


Deploy & Monitor
Launch a deployment and track real-time metrics across your setup, including GPU allocation, memory usage, and active jobs.
Scale Your Resources
Add GPU machines as needed, or enable auto-scaling to adjust compute dynamically based on workload.


Monitor Performance
Track QPS and latency in real time, and use live dashboards to balance performance with cost.
Test & Iterate
Use the Playground to test deployments with live requests, refine parameters, and validate outputs before production.

All work showcased here is my own and created in collaboration with respective teams and organisations. Some details, visuals, and identifiers may have been adapted or anonymised for confidentiality. Please do not reproduce, reuse, or distribute any part of this work without prior permission.
Return Home