#213 - Midjourney video, Gemini 2.5 Flash-Lite, LiveCodeBench Pro - Last Week in AI Recap

Podcast: Last Week in AI

Published: 2025-06-26

Duration: 37 min

Guests: Daniel Bashir

Summary

The episode delves into significant advancements in AI technology, such as Midjourney's new video generation model, Google's Gemini 2.5 Flash-Lite for efficient AI workloads, and the introduction of the LiveCodeBench Pro benchmark. These developments highlight the ongoing evolution and challenges in AI, particularly in reasoning and efficiency.

What Happened

Midjourney has launched its first AI video generation model, allowing users to create short videos from images and text prompts. This marks Midjourney's expansion beyond text-to-image generation, offering affordable access to video generation technology.

Google has unveiled updates to its Gemini AI family, including the Gemini 2.5 Flash-Lite model, which focuses on cost-effective AI workloads. These models aim to balance performance with efficiency and are designed to cater to different computational needs, indicating Google's strategic approach to maintaining competitiveness in the AI market.

YouTube plans to integrate Google's VO3 video model into its Shorts feature, aiming to enhance content creation on the platform. This integration is expected to boost the quality and virality of user-generated content on YouTube, leveraging advanced AI capabilities.

A new online resource, the OpenAI Files, aggregates critical information about OpenAI's practices and controversies. This resource, developed by tech watchdog organizations, aims to provide transparency and accountability for OpenAI's operations and decisions.

OpenAI has decided to part ways with Scale AI as a data provider, following Scale's CEO's move to Meta. This decision reflects the competitive and strategic dynamics within the AI industry, as companies reassess partnerships based on leadership changes.

The episode highlights the introduction of LiveCodeBench Pro, a benchmark that assesses LLMs' capabilities in competitive programming tasks. The benchmark reveals that current AI models struggle with observation-heavy problems, indicating areas for improvement in AI's reasoning capabilities.

Research from Waymo introduces scaling laws for autonomous vehicle planning, revealing that smaller models can perform well with larger datasets. This finding suggests potential shifts in data collection and model training strategies for self-driving technology development.

Key Insights