OpenAI launches Sora 2 and a new app that puts video generation in your pocket

Sora 2

OpenAI has released Sora 2, its newest video and audio generation model. The company says it is more accurate, realistic, and controllable than the first version. It can now handle complex physical interactions, and it adds synchronized dialogue and sound effects. That makes it capable of generating Olympic gymnastics routines, paddleboard backflips that respect buoyancy, or anime battles that hold together without collapsing into chaos.

Earlier models often cheated physics to satisfy prompts. A missed basketball shot might suddenly teleport into the hoop. With Sora 2, the ball actually rebounds off the backboard. Even the mistakes feel more natural, resembling the kind of errors a real human performer might make. This makes the output more believable and shows that the model is starting to understand failure as well as success.

The system also does a better job of keeping continuity across multiple shots. That allows users to build sequences that feel coherent rather than stitched together. It works across styles too, moving from cinematic realism to anime while maintaining detail. Sora 2 can even drop real people, animals, or objects into generated environments. With just a short recording, it can capture likeness and voice with surprising accuracy.

Alongside the model, OpenAI is launching a new iOS app called Sora. The app is built around social creation. Users can make clips, remix others, and discover new content in a personalized feed. The cameo feature is central. It lets you record yourself once, then appear inside generated scenes that your friends can share and build upon.

OpenAI says this makes the app feel like a natural next step in communication. We went from text to emojis to voice notes. Now it is moving into video with your own likeness in the starring role.

The company is also trying to avoid the usual problems with feeds and algorithms. Recommendations can be shaped directly with natural language instructions. There are built-in wellbeing checks. Teens get stricter limits on how many generations they can see, and parents have controls to manage personalization and cameo use.

Safety is emphasized across the board. OpenAI has policies for likeness consent, provenance, and harmful content. Human moderators are being scaled up to quickly review problems if they arise.

Sora is rolling out as invite-only in the U.S. and Canada. Other regions will follow. Generations are free within limits, though OpenAI hints that small payments may eventually be needed to cover compute. ChatGPT Pro users will also get access to an experimental Sora 2 Pro version that produces higher quality output. Developers can expect an API soon.

To me, this looks like a game changer. OpenAI’s announcements are arriving fast and furiously, and it feels a lot like early Google when something new seemed to drop every week. If Sora 2 really is the GPT-3.5 moment for video, it may be our first glimpse of how generative AI could eventually simulate the physical world itself.

Avatar of Brian Fagioli
Written by

Brian Fagioli

Technology journalist and founder of NERDS.xyz

Brian Fagioli is a technology journalist and founder of NERDS.xyz. A former BetaNews writer, he has spent over a decade covering Linux, hardware, software, cybersecurity, and AI with a no nonsense approach for real nerds.

Leave a Comment