Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

azure

30 articles about azure in AI news

OpenAI-Microsoft Deal: Loosens Ties, Boosts Azure AI Revenue

A new deal between OpenAI and Microsoft reduces Microsoft's grip on OpenAI while significantly boosting Azure AI revenue, with no profit-sharing for Microsoft on OpenAI's earnings.

82% relevant

Microsoft's Fairwater AI Data Center Launches Early, Boosts Azure Capacity

Microsoft has launched its Fairwater AI data center ahead of schedule. The facility adds significant high-performance computing capacity to Azure's AI infrastructure, crucial for training and running large models.

92% relevant

Azure ML Workspace with Terraform: A Technical Guide to Infrastructure-as-Code for ML Platforms

The source is a technical tutorial on Medium explaining how to deploy an Azure Machine Learning workspace—the central hub for experiments, models, and pipelines—using Terraform for infrastructure-as-code. This matters for teams seeking consistent, version-controlled, and automated cloud ML infrastructure.

76% relevant

Building Sequential AI Workflows with Microsoft Agent Framework and Azure AI Foundry

A technical walkthrough of implementing a sequential agent workflow for security incident triage using Microsoft's Agent Framework and Azure AI Foundry. Demonstrates how to structure multi-stage AI processes where each agent builds on previous outputs with full conversation history.

94% relevant

Fine-Tuning Strategies for AI Agents on Azure: Balancing Accuracy, Cost, and Performance

A technical guide explores strategies for fine-tuning AI agents on Microsoft Azure, focusing on the critical trade-offs between model accuracy, operational cost, and system performance. This is essential for teams deploying autonomous AI systems in production environments.

95% relevant

How to Build a Remote MCP Server for Azure Data Explorer (Kusto)

Connect Claude Code directly to your Kusto database using a custom Remote MCP Server, enabling natural language queries without manual SQL/KQL.

87% relevant

AWS Launches 'Generative AI on AWS' Developer Hub

AWS has launched 'Generative AI on AWS,' a new central portal for its AI services, SDKs, and tutorials. This move consolidates its offerings to better compete with Google's Vertex AI and Microsoft's Azure AI Studio.

85% relevant

Why I Skipped LLMs to Extract Data From 100,000 Wills: A System Design Story

An engineer details a deterministic, high-accuracy document processing pipeline for legal wills using Azure's Content Understanding model, rejecting LLMs due to hallucination risk and cost. A masterclass in pragmatic AI system design.

85% relevant

Anthropic Publishes Zero-Trust Architecture for AI Agents

Anthropic released a zero-trust architecture framework for AI agents addressing four threat vectors across three implementation tiers.

85% relevant

Oracle Builds Custom MCP Server for OCI Cloud Management via Natural Language

Oracle released a custom MCP server for OCI, enabling natural-language cloud management. First major cloud provider to ship a first-party MCP server.

80% relevant

Microsoft Open-Sources AI Engineer Coach, a Fitbit for Dev Workflows

Microsoft open-sourced AI Engineer Coach, a VS Code extension that scores developer AI workflow quality across 5 categories with 45 anti-pattern rules.

95% relevant

Compute Shortage to Split AI Market: Rich Get Agents, Poor Get Chatbots

Mollick warns compute shortage makes agents expensive while chatbots cheapen, splitting AI market by company resources.

75% relevant

Vibe-Coding Bottleneck: CPU Box Rental Gets Harder

SemiAnalysis flags that vibe-coding wave makes cheap CPU box rentals less routine, bottlenecking developers who need quick cloud compute for AI prototyping.

75% relevant

Cerebras WSE-3 Claims 10x Training Speed Over Nvidia H100 on GPT-Scale Model

Cerebras claims 10x training speed over Nvidia H100 for GPT-3-scale models using WSE-3. Benchmark lacks power and cost data, limiting independent verification.

64% relevant

NHN Deploys 7,656-GPU AI Cluster in Seoul

NHN launched a 7,656-GPU cluster in Seoul, South Korea, for domestic enterprise AI workloads. The cluster targets inference and training, competing with Naver and Kakao.

90% relevant

12-Metric Agent Eval Framework From 100+ Deployments Hits Production

12-metric evaluation framework for production AI agents from 100+ deployments targets task success, cost, latency, tool use, and safety.

74% relevant

Agentic Commerce: 50% of Online Transactions by 2027, Google Cloud Leads

Agents projected to handle 50% of online transactions by 2027. Payment reliability determines winners in agentic commerce, with Google Cloud leading enterprise rollouts.

94% relevant

GitHub Secret Scanning Now Supports MCP Server in GA

GitHub GA'd its Secret Scanning MCP Server, letting AI agents automate credential leak remediation via Anthropic's protocol.

90% relevant

CoreWeave Tops Kimi K2.6 Inference Speed

CoreWeave tops 10 other providers on speed and price-performance for Moonshot AI's Kimi K2.6 in Artificial Analysis benchmark.

81% relevant

Hermes Agent Gets Desktop App for Autonomous AI Workflows

Nous Research released a desktop app for Hermes Agent, moving from CLI to native UI with multi-agent management and persistent memory.

93% relevant

AWS Builds First Payment API for Agentic AI — Agents Can Now Checkout

AWS launched first payment API for autonomous agents, enabling agent-initiated transactions. Closes critical gap for enterprise retail agentic AI workflows.

88% relevant

Anthropic Doubles Claude Code Rate Limits, Leases All of SpaceX's Colossus 1

Anthropic doubled Claude Code's 5-hour rate limits and removed peak-hour throttling for Pro, Max, Team, and seat-based Enterprise plans, then disclosed the source of the new capacity: a lease on the entire Colossus 1 data center — 300 MW and ~220,000 NVIDIA GPUs in Memphis — that SpaceX absorbed when it took over xAI.

100% relevant

Anthropic Ships 10 Finance AI Agents as IPO Race with OpenAI Heats Up

Anthropic released 10 finance AI agents with Moody's data connectors. The launch intensifies the IPO race with OpenAI, backed by a $1.5B private equity JV.

98% relevant

Amazon's SageMaker Agentic Fine-Tuning Supports Llama, Qwen, DeepSeek, Nova

Amazon launched an AI agent on SageMaker that automates fine-tuning of Llama, Qwen, DeepSeek, and Nova models via plain-language instructions, abstracting API fragmentation.

90% relevant

Nscale to Deploy 66K+ Rubin GPUs for Microsoft in Portugal

Nscale will deploy 66,000+ NVIDIA Rubin GPUs for Microsoft at Portugal's Start Campus. The deal is a first for Rubin and signals Microsoft's geographic diversification.

80% relevant

OpenAI Claims 10GW AI Infrastructure Capacity Ahead of 2029 Target

OpenAI claims 10GW AI infrastructure capacity secured, adding 3GW in 90 days, ahead of 2029 target.

100% relevant

Hyperscalers' $90B+ Quarterly Capex Shows AI Demand Outstrips Supply

Amazon, Google, Meta spent $90B+ quarterly on AI infra; Google Cloud backlog hit $460B. Demand outstrips supply, pre-selling capacity not yet built.

98% relevant

Microsoft AI Run Rate Hits $37B as $627B Backlog Reveals Capacity Gap

Microsoft's $37B AI run rate and $627B backlog show AI demand outpacing data center capacity, with delivery windows stretching to 18 months.

90% relevant

Microsoft: LLMs Corrupt 25% of Docs in Long Edits

Microsoft paper shows LLMs corrupt ~25% of documents across 52 domains during 20-edit sessions, with failures compounding silently.

90% relevant

Open-Weight 1T Model Inference Margins Hit 88% on Rented GPUs

Renting a 128 GPU cluster to serve a 1T open model yields ~88% margin on tokens sold at $0.002/1K, exposing a structural arbitrage over proprietary APIs.

85% relevant