Enterprises with Cutting-Edge AI

Sora Cloud is an LLM marketplace that allows enterprises to train and deploy models and manage GPU compute from a single platform. Built at a time when model access and infrastructure were typically handled separately, it brought both together to reduce the overhead of working across multiple vendors.

Efficient AI Training

Train and deploy models faster with automated workflows, reducing the need to manage infrastructure directly.

Custom AI Solutions

Use pre-configured workflows to set up deployments quickly, with common patterns and best practices built in.

Dynamic GPU Resources

Scale GPU compute based on demand, with flexible configurations designed to balance performance and cost.

Pick Your Model

Browse available LLMs and choose the right one for your use case. Filter by model size, GPU requirements, and pricing.

Deploy & Monitor

Launch a deployment and track real-time metrics across your setup, including GPU allocation, memory usage, and active jobs.

Scale Your Resources

Add GPU machines as needed, or enable auto-scaling to adjust compute dynamically based on workload.

Monitor Performance

Track QPS and latency in real time, and use live dashboards to balance performance with cost.

Test & Iterate

Use the Playground to test deployments with live requests, refine parameters, and validate outputs before production.

All work showcased here is my own and created in collaboration with respective teams and organisations. Some details, visuals, and identifiers may have been adapted or anonymised for confidentiality. Please do not reproduce, reuse, or distribute any part of this work without prior permission.

Return Home