AI Speed Box

AI News and Information Link Collection (Mintbear's ignorant scraps, matured to become Visual AI News)

All

AI Image

AI Video

AI Sound

AI LLM

AI 3D

AR, XR, VR

AI Toons

AI SNS

FLUX.Context

AI Image

FLUX

2025/05/28

Available Now (Available)

Kling March Update - UI, SFX, Assets, Extend, Bloom Effect

AI Video

Kling

2025/03/27

Available Now (Available)

Midjourney, folder function upgraded Midjourney FOLDER!

AI Image

Midjourney

2025/02/21

Available Now (Available)

Report: AIBRAHAM

Hailuo Effects

AI Video

Hello

2025/02/14

Available Now (Available)

Adobe Video AI: FIREFLY! Adobe has launched Video AI.

AI Video

Adobe Firefly

2025/02/12

Report: Picassong

OpenAI Rebranding

AI LLM

OpenAI
ChatGPT

2025/02/05

Available Now (Available)

Report: Euncheol Kwak

Kling - Creating model images/videos (by Picasso)

AI Video

Kling

2025/02/01

Available Now (Available)

Report: Picassong

🎥 Hailuo T2V-01-Director : Camera Control

AI Video

Hello

2025/01/28

Available Now (Available)

"2025, Get AI in the New Year!"

AI LLM
AI Image
AI Video
AI Sound

Mintbear
OpenAI
ChatGPT
DALLE
Midjourney
Hello

2025/01/26

Available Now (Available)

OpenAI Agent: 'Operator'

OpenAI

2025/01/24

Limited Release (Partial Release)

OpenAI

Kling - KOLORS Image Reference

AI Video

Kling

2025/01/24

Available Now (Available)

Kling Elements! (Multiple characters, appearing at the same time!)

AI Video

Kling

2025/01/21

Limited Release (Partial Release)

Report: Picassong

Edits: Instagram AI Video (Meta, MovieGen, scheduled for March 3)

AI Sound
AI SNS

Edits
_Meta
MovieGen

2025/01/20

Coming Soon

Report: Picassong

Krea 3D .. daebak! (real-time rendering of 3D characters)

AI Image
AI 3D

Krea

2025/01/17

Limited Release (Partial Release)

Kling - Prompt dictionary, preset feature introduced

AI Video

Kling

2025/01/15

2025 AI Era Human Intelligence Conference

AI
AI Image
AI Video
AI Sound

Mintbear

2025/01/12

Event End

Kling Effects

AI Video

Kling

2025/01/10

AI Talk YouTube: OpenAI Workflow

AI
AI LLM
AI Image
AI Video

OpenAI o1
Sora

2025/01/07

Available Now (Available)

Kling - Image KOLORS 1.5 update & EndFrame & Virtual Try-On & LipSync updates

AI Video
AI Image

Kling

2024/12/27

Available Now (Available)

Voice Cursor

AI Sound

ETC sound

2024/12/22

Available Now (Available)

Photoshop (Beta) New Feature: Select Body Parts

AI Image

Adobe Photoshop

2024/12/19

Available Now (Available)

Report: AIBRAHAM

Kling 1.6 Update

AI Video

Kling

2024/12/19

Available Now (Available)

Ideogram Batch Generation

AI Image

Ideogram

2024/12/18

Available Now (Available)

Midjourney Office Hours (2024-12-18)

AI Image

Midjourney

2024/12/18

Coming Soon

Veo 2

AI Video

_Google

2024/12/17

Coming Soon

Midjourney Moodboards

AI Image

Midjourney

2024/12/17

Available Now (Available)

Google's New AI Glasses (Android XR)

AR, XR, VR

_Google

2024/12/16

Coming Soon

Video watermarking technology, Meta Video Seal

AI Video

_Meta

2024/12/15

Available Now (Available)

Pika 2.0 Update

AI Video

Pika

2024/12/15

Available Now (Available)

Motivo by Meta

AI 3D

_Meta

2024/12/15

Available Now (Available)

Google DeepMind just dropped Genie 2

Sample Videos

Genie 2: A large-scale foundation world model

Generating unlimited diverse training environments for future general agents

deepmind.google

Summary by GPT

Here’s a description of a new large-scale world model called Genie 2 developed by Google DeepMind. Here are the key takeaways:

Genie 2 Introduction and Overview

Genie 2 is a model that creates a 3D environment that can control various actions, allowing a human or AI agent to play via keyboard and mouse.

It creates new interactive virtual worlds based on specific prompt images, simulating AI or human behavior.

Key features of Genie 2

Generate a variety of environments: Genie 2 generates a variety of 3D environments, providing an infinite curriculum for training and evaluating general agents. This solves the bottleneck of agent training that can occur in limited environments.
Rapid Prototyping: Genie 2 can rapidly prototype interactive experiences, enabling AI researchers to quickly experiment in new environments.
Action Control: Perform actions via keyboard input, such as manipulating a robot or interacting with objects using the arrow keys.
Physical Interactions and Character Animation: Model object interactions (e.g. opening doors, popping balloons), character animation, gravity and lighting effects, reflections, water effects, and more.

Technological developments and applications

Autoregressive latent diffusion model: Genie 2 is an autoregressive latent diffusion model that learns from video data and simulates frame-by-frame actions and past frames.
SIMA Agent: Train an agent called SIMA to perform tasks in a 3D game world using natural language instructions in an environment generated by Genie 2. SIMA performs specified actions in an environment generated by Genie 2 and assists in evaluation.

Responsible technology development

Responsible Development: Genie 2 is committed to ethical use in generating diverse 3D environments based on large-scale world models, and is conducting research to enable AI agents to perform tasks in a useful way both online and in the real world.

Future development potential

Genie 2 is considered a major step forward toward AGI (generalized artificial intelligence) and is expected to play a key role in solving structural problems.

Summation

Genie 2 is a new world model that generates a variety of 3D environments based on a single prompt, allowing AI and humans to interact with each other.

It supports AI learning and evaluation through games, and includes various physical interaction and action control functions.
We also pursue innovation in AI research and creation processes through rapid environmental prototyping and large-scale learning.

Although this research is still in its early stages, it shows a wide range of potential applications and great potential for advancement in AI research.

Made with Slashpage