pop up
30 articles about pop up in AI news
Gustaf Westman Pop-Up Opens at Selfridges Through July 25
Gustaf Westman pop-up at Selfridges Corner Shop, open until July 25, showcasing colorful, chunky designs.
Ollama Now Supports Apple MLX Backend for Local LLM Inference on macOS
Ollama, the popular framework for running large language models locally, has added support for Apple's MLX framework as a backend. This enables more efficient execution of models like Llama 3.2 and Mistral on Apple Silicon Macs.
Ollama Now Runs Codex Locally: DeepSeek V4, Gemma 4, Qwen 3.6 Supported
Ollama integrates Codex support for DeepSeek V4, Gemma 4, Qwen 3.6, enabling free local code generation, challenging OpenAI's API model.
GPT-5.5 Launches: The Super App Strategy, Not the Model
OpenAI released GPT-5.5, codenamed Spud, 48 days after GPT-5.4. The model itself is less interesting than the super app strategy, 35x cost reduction on GB200 hardware, and 48-day release cadence that signals a deliberate acceleration.
NVIDIA Nemotron 3 Super: 120B Hybrid Mamba-Transformer MoE with 1M Context
NVIDIA has released Nemotron 3 Super, a 120B parameter open hybrid Mamba-Transformer Mixture of Experts model with 12B active parameters and 1M token context length. The company claims it delivers up to 7.5x higher throughput than similar open models.
Pinterest's Request-Level Deduplication
Pinterest's engineering blog details 'request-level deduplication,' a critical efficiency technique for modern recommendation systems. By eliminating redundant processing of massive user sequences, they achieve 10-50x storage compression and significant training speedups, while solving novel training challenges like batch correlation.
Kimi 2.6 Code Model Teased in Leaked Image, Suggesting Moonshot AI Update
A screenshot circulating online appears to show a 'Kimi 2.6' code model interface, suggesting Moonshot AI is preparing an update to its Kimi Chat platform focused on coding tasks.
AI Compute Crisis: GPU Prices Up 48%, Anthropic API at 98.95% Uptime
The AI industry faces a severe compute capacity crisis, with GPU prices up 48%, Anthropic API uptime falling to 98.95%, and OpenAI shutting down Sora to reallocate resources. Demand for agentic AI is outstripping supply, forcing rationing and product cancellations.
MiniMax M2.7 Model Deploys on NVIDIA NIM Endpoints with OpenClaw Support
Chinese AI firm MiniMax has made its M2.7 model available through NVIDIA's GPU-accelerated NIM endpoints. This deployment includes support for the OpenClaw and NemoClaw frameworks, integrating it into a major AI development ecosystem.
Coupang Eats Secures Patent for Budget-Based Food Recommendation System
Coupang Eats has been granted a patent for a food recommendation engine that factors in a user's defined budget. This system aims to provide more relevant suggestions than basic price filters by integrating budget as a core ranking signal. It represents a strategic move to enhance user experience and conversion in the competitive delivery market.
MLX-LM v0.9.0 Adds Better Batching, Supports Gemma 4 on Apple Silicon
Apple's MLX-LM framework released version 0.9.0 with enhanced server batching and support for Google's Gemma 4 model, improving local LLM inference efficiency on Apple Silicon. This update addresses a key performance bottleneck for developers running models locally on Mac hardware.
mlx-vlm v0.4.4 Launches with Falcon-Perception 300M, TurboQuant Metal Kernels & 1.9x Decode Speedup
The mlx-vlm library v0.4.4 adds support for TII's Falcon-Perception 300M vision model and introduces TurboQuant Metal kernels, achieving up to 1.9x faster decoding with 89% KV cache savings on Apple Silicon.
Axios Supply Chain Attack Highlights AI-Powered Social Engineering Threat to Open Source
The recent Axios npm package supply chain attack was initiated by highly sophisticated social engineering targeting a developer. This incident signals a dangerous escalation in the targeting of open source infrastructure, where AI tools could amplify attacker capabilities.
LimX's Oli Robot Demonstrates Autonomous Unboxing and Boot-Up via 31-DoF System
LimX's Oli robot autonomously exited its shipping container, powered up its 31-degree-of-freedom system, and began moving. The demo highlights progress in self-contained robotic deployment without human setup.
Axios NPM Package Under Active Supply Chain Attack, Potentially Impacts 100M+ Weekly Installs
The widely-used JavaScript HTTP client library Axios may be compromised via a malicious dependency in its latest release, exhibiting malware-like behavior including shell execution and artifact cleanup. With over 100 million weekly downloads, this represents a critical software supply chain threat.
The 5-Minute Claude Code Setup: What 'Bare Minimum' Actually Means in 2026
Forget complex configs. The 'bare minimum' Claude Code setup is now just three files: CLAUDE.md, .claudeignore, and a skills registry. Here's how to structure them.
PyPI Quarantines LiteLLM Package After Supply Chain Attack Compromises AI Integration Tool
The Python Package Index (PyPI) has quarantined the LiteLLM package after a supply chain attack distributed a malicious update. The action prevents automatic installation of the compromised version via pip.
POP.STORE Launches ECHO-ME, an Agentic AI Platform to Run Creator Businesses
POP.STORE has launched ECHO-ME, an 'agentic AI commerce platform' designed to autonomously manage the business operations for creators. It monitors social DMs, detects brand deals, ranks followers, and drives sales, aiming to act as an intelligent operating layer for 15,000 onboarded creators.
Stuart Russell Warns of Rapid AI Self-Improvement: An AI with IQ 150 Could Upgrade Itself to 250
UC Berkeley's Stuart Russell warns that an AI system with human-level intelligence could rapidly self-improve to superintelligent levels, leaving humans behind. A recent Meta paper echoes concerns about the risks of autonomous self-improving systems worsening alignment problems.
Minimax to Release Open Weights in Two Weeks, Highlighting Chinese Startup Momentum
Chinese AI startup Minimax announced it will release open weights within two weeks. This follows a pattern of rapid open-source releases from Chinese firms, contrasting with Meta's more controlled approach.
POP.STORE Launches ECHO-ME: An Agentic AI Commerce Platform for Creators
POP.STORE announced ECHO-ME, an agentic AI platform designed to autonomously run a creator's business operations. It monitors social channels, detects brand deals, and converts fan interactions into revenue, launching with 15,000 creators. This represents a shift from task automation to full business operation for the solo creator economy.
Projection-Augmented Graph (PAG): A New ANNS Framework Claiming 5x Speedup Over HNSW
Researchers propose PAG, a new Approximate Nearest Neighbor Search framework that integrates projection techniques into graph indexes. It claims up to 5x faster query performance than HNSW while meeting six practical demands of modern AI workloads.
AIVideo Agent Emerges: The Fully Autonomous Content Creation System That Requires Zero Setup
A new AI video production system called AIVideo Agent has launched, promising to run entire content pipelines autonomously 24/7 without API keys, technical setup, or configuration screens. Users simply describe what they want, and the system delivers finished video content.
Graph Neural Networks Revolutionize Energy System Modeling with Self-Supervised Spatial Allocation
Researchers have developed a novel Graph Neural Network approach that solves critical spatial resolution mismatches in energy system modeling. The self-supervised method integrates multiple geographical features to create physically meaningful allocation weights, significantly improving accuracy and scalability over traditional methods.
AI Disruption Accelerates: How Claude's New Feature Decimated a Startup Overnight
An AI startup founder reports their business was devastated overnight when Anthropic's Claude released a competing feature, causing their close rate to plummet from 70% to 20%. This incident highlights the accelerating pace of AI disruption and platform risk for startups building on top of AI models.
Balancing Empathy and Safety: New AI Framework Personalizes Mental Health Support
Researchers have developed a multi-objective alignment framework for AI therapy systems that better balances patient preferences with clinical safety. The approach uses direct preference optimization across six therapeutic dimensions, achieving superior results compared to single-objective methods.
TrapDoor supply-chain attack hits npm, PyPI, Crates.io — weaponizes AI config files
TrapDoor planted 34 malicious packages on npm, PyPI, and Crates.io, and injected poisoned AI config files into repos to weaponize Claude Code and Cursor.
Prefill-as-a-Service Paper Claims to Decouple LLM Inference Bottleneck
A research paper proposes a 'Prefill-as-a-Service' architecture to separate the heavy prefill computation from the lighter decoding phase in LLM inference. This could enable new deployment models where resource-constrained devices handle only the decoding step.
Prince Canuma's M3 Ultra 512GB & RTX Pro 6000 Setup for MLX Research
Independent developer Prince Canuma has assembled a powerful, community-sponsored home compute cluster for MLX research and model porting, featuring an M3 Ultra with 512GB RAM and an RTX Pro 6000.
Alibaba's Qwen Team Teases Qwen 3.6 Model, Signaling Major Open-Source LLM Update
Alibaba's Qwen team has teased the imminent release of Qwen 3.6, the next major version of its open-source large language model series. This follows the release of Qwen 2.5 in late 2024 and signals continued aggressive competition in the open-weight model space.