midjourney

30 articles about midjourney in AI news

Midjourney Plans 60-Second Ultrasound Spa in SF by 2027

Midjourney plans a 2027 SF spa with 60-second ultrasound scans, aiming for 100x faster than MRI.

Jun 18, 202675% relevant

Prompt Master: Free, Open-Source Claude Skill Generates Optimized Prompts for 18+ AI Tools

A new, free, and open-source Claude skill called Prompt Master generates optimized prompts for over 18 AI tools—including ChatGPT, Midjourney, and Cursor—on the first attempt, aiming to reduce wasted credits and re-prompts.

Mar 27, 202685% relevant

Google's Gemini 3.1 Flash Image: A New Contender in the AI Visual Generation Race

Google is reportedly developing Gemini 3.1 Flash Image, a specialized image generation model that could challenge Midjourney and DALL-E 3. This lightweight variant promises faster, more efficient visual creation while expanding Google's multimodal AI ecosystem.

Feb 25, 202685% relevant

ByteDance Open-Sources BAGEL: 7B Multimodal Model for Image Gen, Editing, Understanding

ByteDance open-sourced BAGEL, a 7B multimodal model for image gen, editing, style transfer, and understanding under Apache 2.0.

May 28, 202695% relevant

PerfectSquashBench Tests Image Model Anchoring Bias vs. Text Models

Wharton professor Ethan Mollick released PerfectSquashBench, a test showing image generation models exhibit stronger anchoring bias than text models, getting 'stuck' on initial directions and requiring context window clearing.

Apr 22, 202685% relevant

GPT-ImageGen-2 Likely Uses AI Models as Prompt Generators

Evidence suggests OpenAI's upcoming image model, GPT-ImageGen-2, operates as a tool where AI models generate the prompts, not users. This marks a shift from the transparent prompt display seen in DALL-E 3.

Apr 22, 202685% relevant

GPT-5.4 LLM Choice Drastically Impacts GPT-ImageGen-2 Output Quality

The quality of images generated by GPT-ImageGen-2 is heavily dependent on the underlying LLM used for reasoning. GPT-5.4 'Thinking' and 'Pro' models produce superior outputs, especially for complex concepts, a non-intuitive finding not documented by OpenAI.

Apr 22, 202685% relevant

Free-Claude-Code Proxy Routes Anthropic API to Free NVIDIA NIM Models

A developer released free-claude-code, a proxy that intercepts Claude Code's API calls and routes them to free NVIDIA NIM endpoints, unlocking free access to models like Kimi K2 and GLM 4.7. This bypasses Anthropic's subscription fees and adds remote execution via a Telegram bot.

Apr 22, 202691% relevant

GPT ImageGen-2 Passes 'Otter Test', Generates Academic Papers

Wharton professor Ethan Mollick reports OpenAI's GPT ImageGen-2 now reliably generates complex text within images, including academic papers and slides, marking a significant leap in multimodal AI capability.

Apr 21, 202683% relevant

GPT-Image-2 Adds Self-Review Loop for Iterative Image Correction

A new capability in GPT-Image-2 allows the model to review and iteratively correct its own image generations, aiming for higher accuracy before final output.

Apr 21, 202685% relevant

GPT-5.5 Generates Complex SVG in Single Prompt, User Reports

A developer shared that OpenAI's GPT-5.5 produced a sophisticated SVG image from a single prompt. This suggests improvements in the model's ability to generate precise, structured visual code.

Apr 19, 202685% relevant

Claude AI Generates Weekly Meal Plans with Nutrition Goals

A prompt library demonstrates Claude's ability to create personalized weekly meal plans that meet specific nutrition targets, potentially saving users hundreds on groceries and dietitian fees.

Apr 19, 202689% relevant

Ethan Mollick on AI's Impact: 'Everything Is Someone's Life Work' No Longer True

AI researcher Ethan Mollick notes the foundational assumption that 'everything around me is somebody's life work' is being invalidated by generative AI, signaling a profound shift in how we value human output.

Apr 18, 202685% relevant

OpenAI Codex Gains Screen Control, Long-Run Agents, and 90+ Plugins

OpenAI has upgraded Codex from a code-completion tool to an agentic macOS assistant that can see/click screens, run for weeks autonomously, and integrate with 90+ dev tools. This marks a strategic move into persistent, multi-modal coding agents.

Apr 16, 202686% relevant

Google's PaperBanana AI Generates Academic Diagrams, Beats Human Designs 3:1

Google released PaperBanana, an AI system that transforms raw methodology text into publication-ready academic diagrams using a 5-agent creative pipeline. In blind evaluations, humans preferred its outputs nearly 3 out of 4 times over manually designed figures.

Apr 16, 202695% relevant

Google Launches Gemini 3.1 Flash TTS with Prompt-Controlled Speech

Google has launched Gemini 3.1 Flash TTS, a text-to-speech model featuring prompt-based voice control and support for over 70 languages. This release expands Google's multimodal AI offerings directly to developers.

Apr 15, 202693% relevant

Anthropic Opus 4.7, ChatGPT Image 2 Rumored for Imminent Release

Analyst speculation suggests Anthropic's Claude Opus 4.7 and OpenAI's ChatGPT Image 2 could launch imminently, with DeepSeek's expected release next week creating competitive urgency. (199 chars)

Apr 15, 202689% relevant

ChatGPT App Code Hints at Upcoming Image Feature Announcement

A developer found new strings in the ChatGPT app's code referencing an 'image announcement,' signaling a likely upcoming feature reveal from OpenAI.

Apr 15, 202685% relevant

AI-Generated Content Surpasses Human Content Online, Per New Study

For the first time, the volume of newly published AI-generated content online has surpassed human-generated content, according to a study cited by AI researcher Rohan Paul. This represents a fundamental shift in the composition of the public internet.

Apr 14, 202687% relevant

Inflection's MAI-Image-2-Efficient: 22% Faster, 4x More Efficient

Inflection AI has released MAI-Image-2-Efficient, a production-ready image generation model claimed to be 22% faster and 4x more efficient than its predecessor while maintaining quality.

Apr 14, 202685% relevant

HeyGen Launches CLI Tool for AI Video Generation from Terminal

AI video platform HeyGen has launched a CLI tool, allowing users to generate videos with avatars, voice, and script via terminal commands. This moves video synthesis from a web dashboard into developer workflows.

Apr 13, 202685% relevant

Demis Hassabis: AI Tools Enable Billion-Dollar Startups by 'Kids'

Demis Hassabis stated that current AI tools are so powerful that young entrepreneurs could build multi-billion dollar businesses by discovering novel applications, as labs focus on model development, not exhausting use cases.

Apr 10, 202675% relevant

VoxCPM2 Open-Source Voice AI Outperforms ElevenLabs on Key Benchmarks

Researchers from OpenBMB and Tsinghua University released VoxCPM2, a 2B-parameter open-source voice AI that clones voices from short clips and creates voices from text descriptions. It outperforms ElevenLabs on the Minimax-MLS benchmark and runs locally with no API costs.

Apr 10, 202695% relevant

PixVerse V6 Launches: 15-Second 1080P Video with Full Audio

AI video startup PixVerse launched its V6 model, capable of generating 15-second, 1080p videos with full audio from text prompts. This marks a significant upgrade in output length and quality for the platform.

Apr 9, 202689% relevant

Gamma Launches 'Imagine' AI for Instant Presentation Visuals

Gamma launched an AI feature called 'Imagine' that creates presentation-ready visuals from text descriptions. The tool aims to reduce reliance on designers for routine business slides.

Apr 6, 202687% relevant

Chamath Palihapitiya: AI's Biggest Profits Won't Go to Model Makers

VC Chamath Palihapitiya posits that the greatest financial winners in AI will be application builders with unique distribution, not the foundational model creators, drawing a parallel to refrigeration and Coca-Cola.

Apr 5, 202675% relevant

NemoVideo AI Automates Video Editing Based on Text Prompts

A video creator states NemoVideo AI now automates complex editing tasks like cuts and transitions from simple text descriptions, reducing a 5-hour manual process to a prompt-driven workflow.

Apr 5, 202685% relevant

Meta Halts Mercor Work After Supply Chain Breach Exposes AI Training Secrets

A supply chain attack via compromised software updates at data-labeling vendor Mercor has forced Meta to pause collaboration, risking exposure of core AI training pipelines and quality metrics used by top labs.

Apr 4, 202697% relevant

OpenAI Image Generation V2 Release Imminent, Per Leak

A post from a known leaker indicates OpenAI's next image generation model, potentially DALL-E 4, is about to be released. This would mark a major competitive move in the rapidly evolving text-to-image space.

Apr 4, 202687% relevant

OpenAI Testing New Image Model in ChatGPT, User Reports 'Very Good'

A user reports OpenAI is testing a new image generation model in ChatGPT, describing its output as 'very good.' This signals ongoing internal development of visual AI capabilities.

Apr 4, 202685% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety