AI Speed Box

AI News and Information Link Collection (Mint Bear's ignorant SNS scraps, matured to become Visual AI News)
2025 AI Era Human Intelligence Conference
  1. AI
 
 
2025/01/12
Limited Release (Partial Release)
Voice Cursor
  1. AI Sound
  1. ETC sound
 
 
 
 
2024/12/22
Available Now (Available)
Photoshop (Beta) New Feature: Select Body Parts
  1. AI Image
  1. Adobe Photoshop
 
 
 
 
2024/12/19
Available Now (Available)
Report: AIBRAHAM
Kling 1.6
  1. AI Video
  1. Kling
 
 
 
 
2024/12/19
Available Now (Available)
Ideogram Batch Generation
  1. AI Image
  1. Ideogram
 
2024/12/18
Available Now (Available)
Midjourney Office Hours (2024-12-18)
  1. AI Image
  1. Midjourney
 
 
2024/12/18
Coming Soon
Veo 2
  1. AI Video
  1. _Google
 
 
 
 
2024/12/17
Coming Soon
Midjourney Moodboards
  1. AI Image
  1. Midjourney
2024/12/17
Available Now (Available)
Google's New AI Glasses (Android XR)
  1. AR, XR, VR
  1. _Google
 
 
2024/12/16
Coming Soon
Video watermarking technology, Meta Video Seal
  1. AI Video
  1. _Meta
 
 
 
 
2024/12/15
Available Now (Available)
Pika 2.0 Update
  1. AI Video
  1. Pika
 
 
 
2024/12/15
Available Now (Available)
Motivo by Meta
  1. AI 3D
  1. _Meta
 
 
 
2024/12/15
Available Now (Available)
Leffa by Meta
  1. AI Image
  1. _Meta
 
 
 
2024/12/14
Available Now (Available)
The Gemini 2.0
  1. AI LLM
  1. Genmini
 
 
 
 
2024/12/13
Limited Release (Partial Release)
Krea Editor Updates
  1. AI Image
  1. Krea
 
 
2024/12/13
Trellis Trellis 3D
  1. AI 3D
  1. 3D
 
 
2024/12/12
Available Now (Available)
Rodin
  1. AI 3D
  1. ETC
 
 
2024/12/12
Available Now (Available)
Midjourney Patchwork
  1. AI Image
  1. Midjourney
 
 
2024/12/12
Available Now (Available)
DiffSensei
  1. AI Toons
  1. ETC toons
 
 
 
 
2024/12/11
Available Now (Available)
MMAudio: Video-to-Audio Synthesis
  1. AI Sound
  1. ETC sound
 
 
 
 
2024/12/11
Available Now (Available)
Sora v2 showing in London
  1. AI Video
  1. Sora
 
 
 
2024/12/09
Available Now (Available)
Leonardo - FlowState
  1. AI Image
  1. Leonardo
 
 
 
 
2024/12/07
Available Now (Available)
ElevenLabs _ Conversational AI
  1. AI Sound
  1. ElevenLabs
 
 
2024/12/06
Coming Soon
Open AI, 12 days of live
  1. AI
  1. OpenAI
  2. OpenAI o1
  3. Sora
 
 
2024/12/05
Limited Release (Partial Release)
Google DeepMind just dropped Genie 2
  1. AI Video
Google DeepMind just released Genie 2, a game world simulator that allows AI to create rich, interactive 3D worlds from a single image or text.
https://x.com/minchoi/status/1864439424794198291
https://deepmind.google/discover/blog/genie-2-a-large-scale-foundation-world-model/
2024/12/05
Available Now (Available)
Swift-Edit
  1. AI Image
  1. ETC Image
 
 
 
2024/12/05
Available Now (Available)
Midjourney Office Hours (2024-12-04)
  1. AI Image
  1. Midjourney
 
 
2024/12/04
Coming Soon
Gen3 KeyFraming (Prototype)
  1. AI Video
  1. Gen-3
 
 
2024/12/03
Coming Soon
Hunyuan Video by Tencent
  1. AI Video
  2. AI Sound
  1. Hunyuan
 
 
 
 
2024/12/03
Available Now (Available)
Motion Prompting (Google DeepMind)
  1. AI Video
  1. _Google
 
 
 
 
2024/12/03
Coming Soon

Google DeepMind just dropped Genie 2

Category
  1. AI Video
Gen
Empty
Date
2024/12/05
Summary 🍀🧸
Google DeepMind just released Genie 2, a game world simulator that allows AI to create rich, interactive 3D worlds from a single image or text.
URL
https://x.com/minchoi/status/1864439424794198291
URL
https://deepmind.google/discover/blog/genie-2-a-large-scale-foundation-world-model/
Release
Available Now (Available)

Sample Videos

Genie 2: A large-scale foundation world model

Summary by GPT

Here’s a description of a new large-scale world model called Genie 2 developed by Google DeepMind. Here are the key takeaways:

Genie 2 Introduction and Overview

Genie 2 is a model that creates a 3D environment that can control various actions, allowing a human or AI agent to play via keyboard and mouse.

It creates new interactive virtual worlds based on specific prompt images, simulating AI or human behavior.

Key features of Genie 2

Generate a variety of environments: Genie 2 generates a variety of 3D environments, providing an infinite curriculum for training and evaluating general agents. This solves the bottleneck of agent training that can occur in limited environments.
Rapid Prototyping: Genie 2 can rapidly prototype interactive experiences, enabling AI researchers to quickly experiment in new environments.
Action Control: Perform actions via keyboard input, such as manipulating a robot or interacting with objects using the arrow keys.
Physical Interactions and Character Animation: Model object interactions (e.g. opening doors, popping balloons), character animation, gravity and lighting effects, reflections, water effects, and more.

Technological developments and applications

Autoregressive latent diffusion model: Genie 2 is an autoregressive latent diffusion model that learns from video data and simulates frame-by-frame actions and past frames.
SIMA Agent: Train an agent called SIMA to perform tasks in a 3D game world using natural language instructions in an environment generated by Genie 2. SIMA performs specified actions in an environment generated by Genie 2 and assists in evaluation.

Responsible technology development

Responsible Development: Genie 2 is committed to ethical use in generating diverse 3D environments based on large-scale world models, and is conducting research to enable AI agents to perform tasks in a useful way both online and in the real world.

Future development potential

Genie 2 is considered a major step forward toward AGI (generalized artificial intelligence) and is expected to play a key role in solving structural problems.

Summation

Genie 2 is a new world model that generates a variety of 3D environments based on a single prompt, allowing AI and humans to interact with each other.

It supports AI learning and evaluation through games, and includes various physical interaction and action control functions.
We also pursue innovation in AI research and creation processes through rapid environmental prototyping and large-scale learning.

Although this research is still in its early stages, it shows a wide range of potential applications and great potential for advancement in AI research.
👍