Sora Concept: Visual Analyzer (Zero-Shot Image Classification)

This tool simulates the visual understanding component required for advanced systems like Sora 2. Upload an image and provide candidate labels for zero-shot classification. Note: Due to computational limitations, running Sora 2 (Text-to-Video) directly in the browser via transformers.js is not currently feasible. We use a high-speed CLIP model (Xenova/clip-vit-base-patch32) for instant visual analysis.

1. Settings

Compute Device:

2. Input Image & Labels

Choose Image

Candidate Labels (One per line, max 5 recommended):

3. Analysis Result

Image Preview Area

Ready. Upload an image and provide labels.