AI Speed Box

AI News and Information Link Collection (Mint Bear's ignorant SNS scraps, matured to become Visual AI News)
2025 AI Era Human Intelligence Conference
  1. AI
 
 
2025/01/12
Limited Release (Partial Release)
Voice Cursor
  1. AI Sound
  1. ETC sound
 
 
 
 
2024/12/22
Available Now (Available)
Photoshop (Beta) New Feature: Select Body Parts
  1. AI Image
  1. Adobe Photoshop
 
 
 
 
2024/12/19
Available Now (Available)
Report: AIBRAHAM
Kling 1.6
  1. AI Video
  1. Kling
 
 
 
 
2024/12/19
Available Now (Available)
Ideogram Batch Generation
  1. AI Image
  1. Ideogram
 
2024/12/18
Available Now (Available)
Midjourney Office Hours (2024-12-18)
  1. AI Image
  1. Midjourney
Recent updates Patchwork, Moodboards, Profile / v7 scheduled for January 2025 / Batch 8 capable of mass image generation in preparation.
https://twitter.com/blackowl777/status/1869499353045324127
2024/12/18
Coming Soon
Veo 2
  1. AI Video
  1. _Google
 
 
 
 
2024/12/17
Coming Soon
Midjourney Moodboards
  1. AI Image
  1. Midjourney
2024/12/17
Available Now (Available)
Google's New AI Glasses (Android XR)
  1. AR, XR, VR
  1. _Google
 
 
2024/12/16
Coming Soon
Video watermarking technology, Meta Video Seal
  1. AI Video
  1. _Meta
 
 
 
 
2024/12/15
Available Now (Available)
Pika 2.0 Update
  1. AI Video
  1. Pika
 
 
 
2024/12/15
Available Now (Available)
Motivo by Meta
  1. AI 3D
  1. _Meta
 
 
 
2024/12/15
Available Now (Available)
Leffa by Meta
  1. AI Image
  1. _Meta
 
 
 
2024/12/14
Available Now (Available)
The Gemini 2.0
  1. AI LLM
  1. Genmini
 
 
 
 
2024/12/13
Limited Release (Partial Release)
Krea Editor Updates
  1. AI Image
  1. Krea
 
 
2024/12/13
Trellis Trellis 3D
  1. AI 3D
  1. 3D
 
 
2024/12/12
Available Now (Available)
Rodin
  1. AI 3D
  1. ETC
 
 
2024/12/12
Available Now (Available)
Midjourney Patchwork
  1. AI Image
  1. Midjourney
 
 
2024/12/12
Available Now (Available)
DiffSensei
  1. AI Toons
  1. ETC toons
 
 
 
 
2024/12/11
Available Now (Available)
MMAudio: Video-to-Audio Synthesis
  1. AI Sound
  1. ETC sound
 
 
 
 
2024/12/11
Available Now (Available)
Sora v2 showing in London
  1. AI Video
  1. Sora
 
 
 
2024/12/09
Available Now (Available)
Leonardo - FlowState
  1. AI Image
  1. Leonardo
 
 
 
 
2024/12/07
Available Now (Available)
ElevenLabs _ Conversational AI
  1. AI Sound
  1. ElevenLabs
 
 
2024/12/06
Coming Soon
Open AI, 12 days of live
  1. AI
  1. OpenAI
  2. OpenAI o1
  3. Sora
 
 
2024/12/05
Limited Release (Partial Release)
Google DeepMind just dropped Genie 2
  1. AI Video
 
 
 
2024/12/05
Available Now (Available)
Swift-Edit
  1. AI Image
  1. ETC Image
 
 
 
2024/12/05
Available Now (Available)
Midjourney Office Hours (2024-12-04)
  1. AI Image
  1. Midjourney
 
 
2024/12/04
Coming Soon
Gen3 KeyFraming (Prototype)
  1. AI Video
  1. Gen-3
 
 
2024/12/03
Coming Soon
Hunyuan Video by Tencent
  1. AI Video
  2. AI Sound
  1. Hunyuan
 
 
 
 
2024/12/03
Available Now (Available)
Motion Prompting (Google DeepMind)
  1. AI Video
  1. _Google
 
 
 
 
2024/12/03
Coming Soon

Midjourney Office Hours (2024-12-18)

Category
  1. AI Image
Gen
  1. Midjourney
Date
2024/12/18
Summary 🍀🧸
Recent updates Patchwork, Moodboards, Profile / v7 scheduled for January 2025 / Batch 8 capable of mass image generation in preparation.
URL
https://twitter.com/blackowl777/status/1869499353045324127
Release
Coming Soon

Midjourney Office Hours (2024-12-18)

Reference)
Midjourney is said to provide Batch for mass image generation, and Ideogram has just released BatchGeneration.

Original

Midjourney Office Hours 2024-12-18
(Source: JamesGriffing
https://discord.com/channels/662267976984297473/1037743153471553618/1319048499441958982 )
Recent Feature Releases
Mood boards and multiple personalization profiles have been released
Encouragement to use custom models for improved results over baseline models
Introduction of an experimental research feature called "network" for world-building and storytelling exploration
Future Sharing and Exploration Tools
Plans to enhance exploration of S-REFs and mood boards for broader community sharing
A long-term goal: enable aesthetic exploration beyond all prior human history
Desire to coordinate community creativity to achieve an "aesthetic singularity"
Upcoming Batch Image Features
Consideration of "batch 8" features to manage and manipulate larger sets of images
Prioritization of mood board sharing before batch features
Ongoing debate on image resolution versus batch size optimization
Version 7 (V7) Model Development
V7 training is ongoing and may be ready by end of January
Focus of V7 is on character consistency and improved character references
Potential enhancements for style references, object references, and character-based storytelling
Exploration of optimal resolution, batch sizes, and upscaling strategies
After V7 release, plan to quickly iterate with follow-up models (eg, 7.1) and more frequent updates
Emphasis on decoupling architecture, data, and scaling releases to achieve a steady, frequent release cadence
Video Model Considerations
Current video models show mixed results
Trade-offs between quality, speed, and cost remain challenging
Plans to release some video functionality by January to gauge community interest
Evaluating whether to develop in-house video models, partner with third parties, or offer multiple options
Acknowledgment that high-quality video models may not be cost-effective yet
World-Building and Interactive Features
Long-term interest in world-building, storytelling, and immersive experiences
Possibility of future “walk-around” or real-time interactive features
Exploration of comic book-like storytelling and multiple character integration scenarios
Infrastructure and Server Capacity
Current surplus of server capacity due to early hardware arrivals
Introduction of a "holiday relaxathon" period to provide relaxed mode to all users
Intention to reduce or remove relax mode wait times, enabling near-unlimited image creation
Use of this period to gather data on server usage and community interest in high-volume generation
Emphasis on Fundamentals and Ongoing Improvement
Recognition that many users primarily use basic features rather than advanced tools
Commitment to improving core aspects: speed, resolution, quality, and prompt comprehension
Efforts to maintain a balance between new feature innovation and refining the basics
Aesthetics and Beauty in Model Output
Pursuit of models that not only produce realistic results but also more beautiful images
Continued encouragement for users to personalize models for better aesthetic outcomes
Plans to develop methods to improve overall visual appeal and engage more deeply with design aspects
Data, Scaling, and Release Cadence for 2025
New approach to model versioning to ensure more frequent and focused updates
Multiple planned models following V7 will incorporate different improvements (data, scaling, architecture)
Goal to continually learn from community feedback and usage patterns to guide future directions

Translation

Midjourney Office Hours: December 18, 2024

Recent feature updates

Moodboard and multiple personalized profile features launched
We recommend using a custom model
Introducing network features to support world-building and storytelling exploration: Patchwork

Future sharing and exploration tools

Plans to expand S-REFs and moodboard community sharing capabilities
Extending aesthetic exploration beyond human history
Aiming to achieve aesthetic singularity by mobilizing community creativity

Bulk image related features

Manage and review large volumes of images with Batch 8 functionality
Prioritize sharing mood boards
Discussing resolution and batch size optimization

V7 model development

V7 training in progress, possible release at the end of January
Improved character consistency and references
Enhanced style and object references
Review resolution, batch size, and upscaling optimization
Plans for rapid follow-up model releases and continuous updates

Video Model

Current video model quality is mixed
The trade-off problem between quality, speed, and cost
Some video features planned for January release
Consider in-house development, external collaboration, and multiple options
Lack of cost-effectiveness of high-quality video models

Worldbuilding and interactive features

Long-term interest in world building, storytelling, and immersive experiences
Exploring the possibilities of real-time interaction features
An attempt at comic book style storytelling and multi-character integration.

Infrastructure and server capacity

Secure server capacity
Provides relaxation mode to all users
Remove or reduce the relax mode wait time
Analyze server usage data and high-volume generation interest

Basic elements and continuous improvement

Improved speed, resolution, quality, and prompt comprehension
Maintaining a balance between refining core features and innovating new ones

Aesthetics of model output

Pursuing an image model that satisfies both realism and beauty
Improving aesthetic results through model personalization
Enhance visual appeal and explore design elements

Data, Expansion, and Release Cycles in 2025

A new approach to more frequent, focused updates
Integrating data, scaling, and architecture improvements
Adjust direction based on community feedback and usage patterns
👍