fastapi
18 articles about fastapi in AI news
Building Production-Ready Agentic AI Systems with Docker and FastAPI
Towards AI published a practical guide on deploying production-ready agentic AI systems with FastAPI and Docker. The article covers scalable architecture, orchestration, and enterprise considerations for AI agents.
FastAPI-FullStack: Production-Ready Template for AI Agent Apps with FastAPI, Next.js, and Framework Choice
A new open-source template, fastapi-fullstack, provides a pre-built foundation for deploying AI agent applications. It integrates FastAPI, Next.js, and multiple agent frameworks with WebSocket streaming, authentication, and database support out of the box.
From DIY to MLflow: A Developer's Journey Building an LLM Tracing System
A technical blog details the experience of creating a custom tracing system for LLM applications using FastAPI and Ollama, then migrating to MLflow Tracing. The author discusses practical challenges with spans, traces, and debugging before concluding that established MLOps tools offer better production readiness.
10 Claude Code Skills That Actually Work: A Solo Developer's Vetted List
A curated list of the most effective Claude Code skills for developers, based on hands-on testing, focusing on practical MCP servers and workflow enhancements.
Catching Drift Before It Catches You
The author details implementing the open-source Evidently AI library to monitor a Kafka-powered movie recommender for data drift. This is a hands-on guide to a fundamental MLOps task for maintaining live AI systems.
Stop Using Claude Code for Small Edits
Claude Code users should stop using it for small edits and adopt a hybrid workflow: Cursor for quick fixes, Claude Code for agentic tasks.
Claude Code's Security Defaults: What It Ships When You Don't Ask
When building auth, uploads, and admin features, Claude Code defaults to importing bcrypt/JWT libraries while Codex uses standard library functions—neither adds rate limiting or security headers without explicit prompting.
KIMM's AI-Powered Wheels Adjust Stiffness in Real-Time for Terrain
Researchers at KIMM created wheels that autonomously adjust their stiffness based on terrain. On smooth ground, they stay rigid for efficiency; on rough terrain, they soften and deform to conform to obstacles.
How This Developer's PTC Pattern Cuts Financial Data Token Burn by 90%
Learn the PTC pattern that wraps MCP servers in Python modules, letting Claude Code process financial data in-workspace instead of in-context.
Building a Multimodal Product Similarity Engine for Fashion Retail
The source presents a practical guide to constructing a product similarity engine for fashion retail. It focuses on using multimodal embeddings from text and images to find similar items, a core capability for recommendations and search.
Dify AI Workflow Platform Hits 136K GitHub Stars as Low-Code AI App Builder Gains Momentum
Dify, an open-source platform for building production-ready AI applications, has reached 136K stars on GitHub. The platform combines RAG pipelines, agent orchestration, and LLMOps into a unified visual interface, eliminating the need to stitch together multiple tools.
How Weaviate Agent Skills Let Claude Code Build Vector Apps in Minutes
Weaviate's official Agent Skills give Claude Code structured access to vector databases, eliminating guesswork when building semantic search and RAG applications.
How to Manage Skills Across Multiple Repos with a Centralized Registry
Use symlinks and a dispatch skill to share Claude Code skills across projects without plugin namespacing overhead.
Stop Letting Claude Code Write Repetitive Code—Make It Write Generators Instead
The most effective token-saving technique isn't cheaper models or tiny prompts—it's making Claude Code write small scripts that generate repetitive code for you.
What 19M+ Claude Code Commits Tell Us About Real-World Usage
A new dashboard tracking Claude Code's GitHub footprint reveals TypeScript dominance, massive net code growth, and how developers are using it to ship.
Starlette 1.0 Released: Python ASGI Framework Hits First Stable Version After 8 Years
The Starlette ASGI framework for Python has released version 1.0, its first stable release after nearly eight years of development. The package has been downloaded nearly 10 million times.
Three Agents, One Mission: A Multi-Agent Architecture for Real-Time Fraud Detection
A technical walkthrough of a multi-agent system built with Mesa and XGBoost for real-time fraud detection. It moves beyond a simple classifier to a complete, observable, and actionable pipeline.
Why Companies End Up Using Triton Inference Server: A Simple Case Study
A case study explains the common journey from a simple ML experiment to a production system requiring a robust inference server like NVIDIA's Triton, highlighting its role in managing multi-model, multi-framework deployments at scale.