AI Speed Box

AI News and Information Link Collection (Mintbear's ignorant scraps, matured to become Visual AI News)

All

AI Image

AI Video

AI Sound

AI LLM

AI 3D

AR, XR, VR

AI Toons

AI SNS

FLUX.Context

AI Image

FLUX

2025/05/28

Available Now (Available)

Kling March Update - UI, SFX, Assets, Extend, Bloom Effect

AI Video

Kling

2025/03/27

Available Now (Available)

Midjourney, folder function upgraded Midjourney FOLDER!

AI Image

Midjourney

2025/02/21

Available Now (Available)

Report: AIBRAHAM

Hailuo Effects

AI Video

Hello

2025/02/14

Available Now (Available)

Adobe Video AI: FIREFLY! Adobe has launched Video AI.

AI Video

Adobe Firefly

2025/02/12

Report: Picassong

OpenAI Rebranding

AI LLM

OpenAI
ChatGPT

2025/02/05

Available Now (Available)

Report: Euncheol Kwak

Kling - Creating model images/videos (by Picasso)

AI Video

Kling

2025/02/01

Available Now (Available)

Report: Picassong

🎥 Hailuo T2V-01-Director : Camera Control

AI Video

Hello

2025/01/28

Available Now (Available)

"2025, Get AI in the New Year!"

AI LLM
AI Image
AI Video
AI Sound

Mintbear
OpenAI
ChatGPT
DALLE
Midjourney
Hello

2025/01/26

Available Now (Available)

OpenAI Agent: 'Operator'

OpenAI

2025/01/24

Limited Release (Partial Release)

OpenAI

Kling - KOLORS Image Reference

AI Video

Kling

2025/01/24

Available Now (Available)

Kling Elements! (Multiple characters, appearing at the same time!)

AI Video

Kling

2025/01/21

Limited Release (Partial Release)

Report: Picassong

Edits: Instagram AI Video (Meta, MovieGen, scheduled for March 3)

AI Sound
AI SNS

Edits
_Meta
MovieGen

2025/01/20

Coming Soon

Report: Picassong

Krea 3D .. daebak! (real-time rendering of 3D characters)

AI Image
AI 3D

Krea

2025/01/17

Limited Release (Partial Release)

Kling - Prompt dictionary, preset feature introduced

AI Video

Kling

2025/01/15

2025 AI Era Human Intelligence Conference

AI
AI Image
AI Video
AI Sound

Mintbear

2025/01/12

Event End

Kling Effects

AI Video

Kling

2025/01/10

AI Talk YouTube: OpenAI Workflow

AI
AI LLM
AI Image
AI Video

OpenAI o1
Sora

2025/01/07

Available Now (Available)

Kling - Image KOLORS 1.5 update & EndFrame & Virtual Try-On & LipSync updates

AI Video
AI Image

Kling

2024/12/27

Available Now (Available)

Voice Cursor

AI Sound

ETC sound

2024/12/22

Available Now (Available)

Photoshop (Beta) New Feature: Select Body Parts

AI Image

Adobe Photoshop

2024/12/19

Available Now (Available)

Report: AIBRAHAM

Kling 1.6 Update

AI Video

Kling

2024/12/19

Available Now (Available)

Ideogram Batch Generation

AI Image

Ideogram

2024/12/18

Available Now (Available)

Midjourney Office Hours (2024-12-18)

AI Image

Midjourney

2024/12/18

Coming Soon

Veo 2

AI Video

_Google

2024/12/17

Coming Soon

Midjourney Moodboards

AI Image

Midjourney

2024/12/17

Available Now (Available)

Google's New AI Glasses (Android XR)

AR, XR, VR

_Google

2024/12/16

Coming Soon

Video watermarking technology, Meta Video Seal

AI Video

_Meta

2024/12/15

Available Now (Available)

Pika 2.0 Update

AI Video

Pika

2024/12/15

Available Now (Available)

Motivo by Meta

AI 3D

_Meta

2024/12/15

Available Now (Available)

Hunyuan Video by Tencent

Reference Tencent Official Etc Tech

Reference

Tencent Official

GitHub - Tencent/HunyuanVideo: HunyuanVideo: A Systematic Framework For Large Video Generation Model

HunyuanVideo: A Systematic Framework For Large Video Generation Model - Tencent/HunyuanVideo

github.com

tencent/HunyuanVideo · Discussions

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

huggingface.co

https://yuanbao.tencent.com/chat

How to run HunyuanVideo on a single 24gb VRAM card

How to run HunyuanVideo on a single 24gb VRAM card.

t.co

Etc

Tech

HunyuanVideo is a large-scale model for text-based video generation, adopting the “Dual-stream to Single-stream” hybrid model design to effectively process text and video data.

Dual-stream stage: Text and video tokens are processed independently through multiple Transformer blocks, allowing each modality to learn its own appropriate representation.

Single-stream stage: Effective fusion of multimodal information is achieved by combining text and video tokens and feeding them to subsequent Transformer blocks.

Made with Slashpage