Αποτελέσματα Αναζήτησης
[2024.05.27] 🎉 We are launching Open-Sora Plan v1.1.0, which significantly improves video quality and length, and is fully open source! Please check out our latest report . Thanks to ShareGPT4Video's capability to annotate long videos.
- Releases · PKU-YuanGroup/Open-Sora-Plan · GitHub
In version 1.3.0, Open-Sora-Plan introduced the following...
- Releases · PKU-YuanGroup/Open-Sora-Plan · GitHub
Although this version is experimental, it advances video generation architecture to a new realm, leading us to release it as v1.2.0. Compared to previous video generation models, Open-Sora-Plan v1.2.0 offers the following improvements: 1. **Better compressed visual representations**.
Open-Sora Plan. We are thrilled to present Open-Sora-Plan v1.0.0, which significantly enhances video generation quality and text control capabilities. See our report. We are training for higher resolution (>1024) as well as longer duration (>10s) videos, here is a preview of the next release.
In version 1.3.0, Open-Sora-Plan introduced the following five key features: A more powerful and cost-efficient WFVAE. We decompose video into several sub-bands using wavelet transforms, naturally capturing information across different frequency domains, leading to more efficient and robust VAE learning. Prompt Refiner.
17 Ιουν 2024 · Open-Sora 1.0 supports a full pipeline of video data preprocessing, training with ColossalAI acceleration, inference, and more. Our provided checkpoints can produce 2s 512x512 videos with only 3 days training.
Today, we are thrilled to launch a project called Open-Sora plan, aiming to reproduce OpenAI's video generation model. Therefore, we introduce our framework, which is comprised of the following components. Video VQ-VAE. This Compress video into latent in time and space dimensions. Denoising Diffusion Transformer. Condition Encoder.
11 Απρ 2024 · What is Open-Sora-Plan v1.0.0? Open-Sora-Plan v1.0.0 is a groundbreaking framework designed to advance video generation technology while empowering precise text control capabilities.