AI Speed Box

AI News and Information Link Collection (Mint Bear's ignorant SNS scraps, matured to become Visual AI News)
2025 AI Era Human Intelligence Conference
  1. AI
 
 
2025/01/12
Limited Release (Partial Release)
Voice Cursor
  1. AI Sound
  1. ETC sound
 
 
 
 
2024/12/22
Available Now (Available)
Photoshop (Beta) New Feature: Select Body Parts
  1. AI Image
  1. Adobe Photoshop
 
 
 
 
2024/12/19
Available Now (Available)
Report: AIBRAHAM
Kling 1.6
  1. AI Video
  1. Kling
 
 
 
 
2024/12/19
Available Now (Available)
Ideogram Batch Generation
  1. AI Image
  1. Ideogram
 
2024/12/18
Available Now (Available)
Midjourney Office Hours (2024-12-18)
  1. AI Image
  1. Midjourney
 
 
2024/12/18
Coming Soon
Veo 2
  1. AI Video
  1. _Google
 
 
 
 
2024/12/17
Coming Soon
Midjourney Moodboards
  1. AI Image
  1. Midjourney
2024/12/17
Available Now (Available)
Google's New AI Glasses (Android XR)
  1. AR, XR, VR
  1. _Google
 
 
2024/12/16
Coming Soon
Video watermarking technology, Meta Video Seal
  1. AI Video
  1. _Meta
 
 
 
 
2024/12/15
Available Now (Available)
Pika 2.0 Update
  1. AI Video
  1. Pika
 
 
 
2024/12/15
Available Now (Available)
Motivo by Meta
  1. AI 3D
  1. _Meta
 
 
 
2024/12/15
Available Now (Available)
Leffa by Meta
  1. AI Image
  1. _Meta
 
 
 
2024/12/14
Available Now (Available)
The Gemini 2.0
  1. AI LLM
  1. Genmini
(Coming soon) Google has announced its latest AI model, Gemini 2.0. It can process a variety of input data, including text, images, video, and audio, with improved multimodal capabilities. Native image generation and adjustable text-to-speech (TTS) capabilities have been added.

You can edit images with dialogue. This is a really crazy feature...
https://x.com/GoogleDeepMind/status/1867261817791427026
https://gemini.google.com/app
https://aistudio.google.com/u/1/prompts/new_chat
2024/12/13
Limited Release (Partial Release)
Krea Editor Updates
  1. AI Image
  1. Krea
 
 
2024/12/13
Trellis Trellis 3D
  1. AI 3D
  1. 3D
 
 
2024/12/12
Available Now (Available)
Rodin
  1. AI 3D
  1. ETC
 
 
2024/12/12
Available Now (Available)
Midjourney Patchwork
  1. AI Image
  1. Midjourney
 
 
2024/12/12
Available Now (Available)
DiffSensei
  1. AI Toons
  1. ETC toons
 
 
 
 
2024/12/11
Available Now (Available)
MMAudio: Video-to-Audio Synthesis
  1. AI Sound
  1. ETC sound
 
 
 
 
2024/12/11
Available Now (Available)
Sora v2 showing in London
  1. AI Video
  1. Sora
 
 
 
2024/12/09
Available Now (Available)
Leonardo - FlowState
  1. AI Image
  1. Leonardo
 
 
 
 
2024/12/07
Available Now (Available)
ElevenLabs _ Conversational AI
  1. AI Sound
  1. ElevenLabs
 
 
2024/12/06
Coming Soon
Open AI, 12 days of live
  1. AI
  1. OpenAI
  2. OpenAI o1
  3. Sora
 
 
2024/12/05
Limited Release (Partial Release)
Google DeepMind just dropped Genie 2
  1. AI Video
 
 
 
2024/12/05
Available Now (Available)
Swift-Edit
  1. AI Image
  1. ETC Image
 
 
 
2024/12/05
Available Now (Available)
Midjourney Office Hours (2024-12-04)
  1. AI Image
  1. Midjourney
 
 
2024/12/04
Coming Soon
Gen3 KeyFraming (Prototype)
  1. AI Video
  1. Gen-3
 
 
2024/12/03
Coming Soon
Hunyuan Video by Tencent
  1. AI Video
  2. AI Sound
  1. Hunyuan
 
 
 
 
2024/12/03
Available Now (Available)
Motion Prompting (Google DeepMind)
  1. AI Video
  1. _Google
 
 
 
 
2024/12/03
Coming Soon

The Gemini 2.0

Category
  1. AI LLM
Gen
  1. Genmini
Date
2024/12/13
Summary 🍀🧸
(Coming soon) Google has announced its latest AI model, Gemini 2.0. It can process a variety of input data, including text, images, video, and audio, with improved multimodal capabilities. Native image generation and adjustable text-to-speech (TTS) capabilities have been added.

You can edit images with dialogue. This is a really crazy feature...
URL
https://x.com/GoogleDeepMind/status/1867261817791427026
URL
https://gemini.google.com/app
URL
https://aistudio.google.com/u/1/prompts/new_chat
Release
Limited Release (Partial Release)
The Gemini 2.0 era has begun, and we look forward to you creating something new with it.
Here’s a quick recap of what’s been announced recently ⏪
Gemini 2.0 Flash ⚡
Lower latency and improved performance.
🔵 @GeminiApp is now available as an experimental version on the web, where Gemini Advanced users can try out our new AI research assistant, Deep Research.
🔵 Developers can get started developing with the Gemini API via @Google AI Studio and Vertex AI.
2.0 also enables new AI agent research prototypes.
🔵 Project Astra: Exploring the future possibilities of general-purpose AI assistants
🔵 Project Mariner: Showcasing the potential for human-agent interaction, starting with the browser
🔵 Jules: Experimental AI-based coding agent
Finally, 2.0 is exploring how agents can be used in a variety of domains, from exploring virtual worlds in video games to applying spatial reasoning abilities to robotics. 🤖
The Gemini 2.0 era is here. And we're excited for you to start building with it.
A quick rewind of what we just released ⏪
Gemini 2.0 Flash ⚡ comes with low latency and better performance.
🔵You can now access an experimental version in
@GeminiApp
on the web, while Gemini Advanced users can try Deep Research, a new AI research assistant.
🔵 Developers can begin building through the Gemini API in
@Google
AI Studio and Vertex AI
2.0 is also enabling new research prototypes of AI agents, including:
🔵 Project Astra, which explores future capabilities of a universal AI assistant
🔵 Project Mariner, which shows what's possible for human-agent interaction, starting with your browser
🔵 Jules, an experimental AI-powered coding agent
Finally, we're exploring how 2.0 can be used in agents across domains — from navigating the virtual world of video games to applying its spatial reasoning capabilities to robotics. 🤖
👍