← Work

Digital Planet

WPP HEXGenerativeMultimodal

Digital Planet

Personal project during the WPP HEX Creative Tech Apprenticeship. Framed primarily as R&D: validate multimodal orchestration (live Gemini vision + OpenAI Speech-to-Text + RAG / LLM) driving scene and material updates in NVIDIA Omniverse, with Substance-authored assets and Blender API where real-time Substance API access was not available.

Concept

Early creative north star was Solaris-inspired — a sentient, conversational planet — while the build prioritized integration proof (multi-model connectivity and real-time tooling) over a single polished narrative arc.

The pipeline ingests live camera and speech, analyzes vision through Gemini and voice through OpenAI Speech-to-Text, and routes fused intent through OpenAI RAG / LLM to select, arrange, and place content in Omniverse, with Blender API and Substance Designer (Smart Materials) for terrain and material updates.

Pipeline overview
Gemini
Pro Vision API
OpenAI
Speech-to-text
OpenAI
RAG · LLM
NVIDIA Omniverse
Blender
API
Substance Designer
Smart materials

Visuals

Initial planet surface before multimodal interaction.

Digital Planet — initial state

Work in progress: terrain generation, materials, and pipeline integration.

Work in progress — mesh and Blender height pipeline

Work in progress — Substance Designer landscape graph

Work in progress — gradient mapping and Omniverse viewport

Tech

Gemini Pro Vision · GCP · OpenAI Speech-to-Text · OpenAI RAG / LLM · NVIDIA Omniverse · Blender API · Substance Designer · WPP HEX