OpenAI’s Sora: A Groundbreaking New Text-to-Video AI Generator Announced

Feb 16, 2024 | 0 comments

OpenAI introduces a new text-to-video AI generator known as Sora. OpenAI’s Sora can create up to 60-second videos with highly detailed scenes, complex camera models, and multiple characters. The model also knows how things asked in a prompt actually exist in real life. OpenAI’s Sora can also generate videos from still images by animating the image’s contents. The model is also capable of extending an existing video or filling in missing frames.

Sora is a diffusion model, and it was trained using publicly available videos and some copyrighted videos licensed for the purpose; however, OpenAI has not mentioned the exact sources. Sora generates entire videos all at once, which makes sure that the subject stays the same even when it goes out of view temporarily. Sora uses the recaptioning technique from DallE3, which interprets a user’s prompt and creates highly descriptive captions that are used by the AI model to generate videos.

The technology has not yet been released, but it is available to only a limited number of visual artists, designers, and filmmakers. The company will take feedback from them and work to advance the AI model.

Screenshot from a Sample Video Generated by OpenAI's Sora — Screenshot from a Sample Video Generated by OpenAI’s Sora

Steps taken by OpenAI to ensure Security with Sora

With AI video-generating models, security has always been an issue, so OpenAI is taking important safety steps to ensure Sora is not used for ill purposes.

To test their model, they are working with red teamers who are experts in areas like misinformation, hateful content, and bias. They will be adversely testing the model to identify risks associated with it. OpenAI is also building tools that can detect whether a video was generated by Sora, and it also plans to add C2PA metadata in the future.

Just like DallE-3, Sora will also use the existing safety methods, like the text classifier, which checks and rejects input prompts that violate OpenAI’s usage policies. This includes the prompts that request extreme violence, sexual content, hateful imagery, and celebrity likeness.

OpenAI has also developed robust image classifiers that will check the frames of every generated video to make sure it adheres to their existing usage policies.

Current Weaknesses of OpenAI’s Sora

As the model is not yet released and is currently in the testing stage, there are some issues encountered while using it. OpenAI mentioned this in their blog post: “It may struggle with accurately simulating the physics of a complex scene, and may not understand specific instances of cause and effect. For example, a person might take a bite out of a cookie, but afterward, the cookie may not have a bite mark.” OpenAI also says that it sometimes confuses the spatial details of a prompt.

Videos generated by OpenAI’s Sora

OpenAI took to X and posted a series of AI videos generated solely by Sora. You can also see the sample videos generated by OpenAI’s Sora on their website. OpenAI has posted a series of sample videos along with their prompts. The video samples have shocked the world. The generated videos look highly realistic, and it can’t be said that they were created by AI. It is difficult to tell if these are not AI-generated.

Prompt: “A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. she wears a black leather jacket, a long red dress, and black boots, and carries a black purse. she wears sunglasses and red lipstick. she walks confidently and casually.… pic.twitter.com/cjIdgYFaWq
— OpenAI (@OpenAI) February 15, 2024

Conclusion

This is going to change how filmmakers and content creators create their videos. The AI-generated videos look highly realistic, and it is difficult to tell whether they are AI-generated. The possibilities of Sora are endless, but it can also be used to generate misleading videos. It will be a big headache for OpenAI to control the ill practices of their tool. So it is important for OpenAI to first fully establish security and privacy norms before deploying this model for everyone. It is only February, but it seems that OpenAI’s Sora is already one of the biggest things of 2024.

New Slack AI Features are here to Boost Productivity

← Previous Post Next Post →

Elon Musk’s xAI Announces Grok 1.5 with Great Capabilities

Mar 29, 2024

Image Credits: xAI Elon Musk's xAI launched Grok last year in November to compete with chatbots from big tech giants like Google, Microsoft, and OpenAI. Elon Musk's xAI is soon launching the next version of their chatbot, Grok 1.5, which performs really well as...

Meta’s Ray-Ban Smart Glasses Are Getting New AI Features

Mar 28, 2024

Image Credits: Meta Meta’s $300 smart glasses, made in collaboration with Ray-Ban, allow users to take pictures, record videos, make calls, hear music, and do much more. Now, new AI features are being added to Meta's Ray-Ban smart glasses. New AI Features in Meta’s...

Claude 3 beats GPT-4 for the First Time on LMSYS Leaderboard

Mar 28, 2024

Anthropic released the Claude 3 model family earlier this month, and they have become highly popular since their release. Now Anthropic's Claude 3 Opus Model beats OpenAI's GPT-4 model for the first time on the LMSYS Chatbot Arena Leaderboard. LMSYS Chatbot Arena is a...

0 Comments

Trackbacks/Pingbacks

Sora od OpenAI umí vytvářet minutové videa | Volty - […] Petr Dvořákzdroj: Al About AI Tech […]