azure
30 articles about azure in AI news
OpenAI-Microsoft Deal: Loosens Ties, Boosts Azure AI Revenue
A new deal between OpenAI and Microsoft reduces Microsoft's grip on OpenAI while significantly boosting Azure AI revenue, with no profit-sharing for Microsoft on OpenAI's earnings.
Microsoft's Fairwater AI Data Center Launches Early, Boosts Azure Capacity
Microsoft has launched its Fairwater AI data center ahead of schedule. The facility adds significant high-performance computing capacity to Azure's AI infrastructure, crucial for training and running large models.
Azure ML Workspace with Terraform: A Technical Guide to Infrastructure-as-Code for ML Platforms
The source is a technical tutorial on Medium explaining how to deploy an Azure Machine Learning workspace—the central hub for experiments, models, and pipelines—using Terraform for infrastructure-as-code. This matters for teams seeking consistent, version-controlled, and automated cloud ML infrastructure.
Building Sequential AI Workflows with Microsoft Agent Framework and Azure AI Foundry
A technical walkthrough of implementing a sequential agent workflow for security incident triage using Microsoft's Agent Framework and Azure AI Foundry. Demonstrates how to structure multi-stage AI processes where each agent builds on previous outputs with full conversation history.
Fine-Tuning Strategies for AI Agents on Azure: Balancing Accuracy, Cost, and Performance
A technical guide explores strategies for fine-tuning AI agents on Microsoft Azure, focusing on the critical trade-offs between model accuracy, operational cost, and system performance. This is essential for teams deploying autonomous AI systems in production environments.
How to Build a Remote MCP Server for Azure Data Explorer (Kusto)
Connect Claude Code directly to your Kusto database using a custom Remote MCP Server, enabling natural language queries without manual SQL/KQL.
AWS Launches 'Generative AI on AWS' Developer Hub
AWS has launched 'Generative AI on AWS,' a new central portal for its AI services, SDKs, and tutorials. This move consolidates its offerings to better compete with Google's Vertex AI and Microsoft's Azure AI Studio.
Why I Skipped LLMs to Extract Data From 100,000 Wills: A System Design Story
An engineer details a deterministic, high-accuracy document processing pipeline for legal wills using Azure's Content Understanding model, rejecting LLMs due to hallucination risk and cost. A masterclass in pragmatic AI system design.
Anthropic Publishes Zero-Trust Architecture for AI Agents
Anthropic released a zero-trust architecture framework for AI agents addressing four threat vectors across three implementation tiers.
Oracle Builds Custom MCP Server for OCI Cloud Management via Natural Language
Oracle released a custom MCP server for OCI, enabling natural-language cloud management. First major cloud provider to ship a first-party MCP server.
Microsoft Open-Sources AI Engineer Coach, a Fitbit for Dev Workflows
Microsoft open-sourced AI Engineer Coach, a VS Code extension that scores developer AI workflow quality across 5 categories with 45 anti-pattern rules.
Compute Shortage to Split AI Market: Rich Get Agents, Poor Get Chatbots
Mollick warns compute shortage makes agents expensive while chatbots cheapen, splitting AI market by company resources.
Vibe-Coding Bottleneck: CPU Box Rental Gets Harder
SemiAnalysis flags that vibe-coding wave makes cheap CPU box rentals less routine, bottlenecking developers who need quick cloud compute for AI prototyping.
Cerebras WSE-3 Claims 10x Training Speed Over Nvidia H100 on GPT-Scale Model
Cerebras claims 10x training speed over Nvidia H100 for GPT-3-scale models using WSE-3. Benchmark lacks power and cost data, limiting independent verification.
NHN Deploys 7,656-GPU AI Cluster in Seoul
NHN launched a 7,656-GPU cluster in Seoul, South Korea, for domestic enterprise AI workloads. The cluster targets inference and training, competing with Naver and Kakao.
12-Metric Agent Eval Framework From 100+ Deployments Hits Production
12-metric evaluation framework for production AI agents from 100+ deployments targets task success, cost, latency, tool use, and safety.
Agentic Commerce: 50% of Online Transactions by 2027, Google Cloud Leads
Agents projected to handle 50% of online transactions by 2027. Payment reliability determines winners in agentic commerce, with Google Cloud leading enterprise rollouts.
GitHub Secret Scanning Now Supports MCP Server in GA
GitHub GA'd its Secret Scanning MCP Server, letting AI agents automate credential leak remediation via Anthropic's protocol.
CoreWeave Tops Kimi K2.6 Inference Speed
CoreWeave tops 10 other providers on speed and price-performance for Moonshot AI's Kimi K2.6 in Artificial Analysis benchmark.
Hermes Agent Gets Desktop App for Autonomous AI Workflows
Nous Research released a desktop app for Hermes Agent, moving from CLI to native UI with multi-agent management and persistent memory.
AWS Builds First Payment API for Agentic AI — Agents Can Now Checkout
AWS launched first payment API for autonomous agents, enabling agent-initiated transactions. Closes critical gap for enterprise retail agentic AI workflows.
Anthropic Doubles Claude Code Rate Limits, Leases All of SpaceX's Colossus 1
Anthropic doubled Claude Code's 5-hour rate limits and removed peak-hour throttling for Pro, Max, Team, and seat-based Enterprise plans, then disclosed the source of the new capacity: a lease on the entire Colossus 1 data center — 300 MW and ~220,000 NVIDIA GPUs in Memphis — that SpaceX absorbed when it took over xAI.
Anthropic Ships 10 Finance AI Agents as IPO Race with OpenAI Heats Up
Anthropic released 10 finance AI agents with Moody's data connectors. The launch intensifies the IPO race with OpenAI, backed by a $1.5B private equity JV.
Amazon's SageMaker Agentic Fine-Tuning Supports Llama, Qwen, DeepSeek, Nova
Amazon launched an AI agent on SageMaker that automates fine-tuning of Llama, Qwen, DeepSeek, and Nova models via plain-language instructions, abstracting API fragmentation.
Nscale to Deploy 66K+ Rubin GPUs for Microsoft in Portugal
Nscale will deploy 66,000+ NVIDIA Rubin GPUs for Microsoft at Portugal's Start Campus. The deal is a first for Rubin and signals Microsoft's geographic diversification.
OpenAI Claims 10GW AI Infrastructure Capacity Ahead of 2029 Target
OpenAI claims 10GW AI infrastructure capacity secured, adding 3GW in 90 days, ahead of 2029 target.
Hyperscalers' $90B+ Quarterly Capex Shows AI Demand Outstrips Supply
Amazon, Google, Meta spent $90B+ quarterly on AI infra; Google Cloud backlog hit $460B. Demand outstrips supply, pre-selling capacity not yet built.
Microsoft AI Run Rate Hits $37B as $627B Backlog Reveals Capacity Gap
Microsoft's $37B AI run rate and $627B backlog show AI demand outpacing data center capacity, with delivery windows stretching to 18 months.
Microsoft: LLMs Corrupt 25% of Docs in Long Edits
Microsoft paper shows LLMs corrupt ~25% of documents across 52 domains during 20-edit sessions, with failures compounding silently.
Open-Weight 1T Model Inference Margins Hit 88% on Rented GPUs
Renting a 128 GPU cluster to serve a 1T open model yields ~88% margin on tokens sold at $0.002/1K, exposing a structural arbitrage over proprietary APIs.