fastapi

18 articles about fastapi in AI news

Building Production-Ready Agentic AI Systems with Docker and FastAPI

Towards AI published a practical guide on deploying production-ready agentic AI systems with FastAPI and Docker. The article covers scalable architecture, orchestration, and enterprise considerations for AI agents.

Jun 26, 202666% relevant

FastAPI-FullStack: Production-Ready Template for AI Agent Apps with FastAPI, Next.js, and Framework Choice

A new open-source template, fastapi-fullstack, provides a pre-built foundation for deploying AI agent applications. It integrates FastAPI, Next.js, and multiple agent frameworks with WebSocket streaming, authentication, and database support out of the box.

Mar 20, 202685% relevant

From DIY to MLflow: A Developer's Journey Building an LLM Tracing System

A technical blog details the experience of creating a custom tracing system for LLM applications using FastAPI and Ollama, then migrating to MLflow Tracing. The author discusses practical challenges with spans, traces, and debugging before concluding that established MLOps tools offer better production readiness.

Apr 23, 202684% relevant

10 Claude Code Skills That Actually Work: A Solo Developer's Vetted List

A curated list of the most effective Claude Code skills for developers, based on hands-on testing, focusing on practical MCP servers and workflow enhancements.

Apr 21, 2026100% relevant

Catching Drift Before It Catches You

The author details implementing the open-source Evidently AI library to monitor a Kafka-powered movie recommender for data drift. This is a hands-on guide to a fundamental MLOps task for maintaining live AI systems.

Apr 20, 202696% relevant

Stop Using Claude Code for Small Edits

Claude Code users should stop using it for small edits and adopt a hybrid workflow: Cursor for quick fixes, Claude Code for agentic tasks.

Apr 17, 2026100% relevant

Claude Code's Security Defaults: What It Ships When You Don't Ask

When building auth, uploads, and admin features, Claude Code defaults to importing bcrypt/JWT libraries while Codex uses standard library functions—neither adds rate limiting or security headers without explicit prompting.

Apr 15, 2026100% relevant

KIMM's AI-Powered Wheels Adjust Stiffness in Real-Time for Terrain

Researchers at KIMM created wheels that autonomously adjust their stiffness based on terrain. On smooth ground, they stay rigid for efficiency; on rough terrain, they soften and deform to conform to obstacles.

Apr 11, 202687% relevant

How This Developer's PTC Pattern Cuts Financial Data Token Burn by 90%

Learn the PTC pattern that wraps MCP servers in Python modules, letting Claude Code process financial data in-workspace instead of in-context.

Apr 8, 2026100% relevant

Building a Multimodal Product Similarity Engine for Fashion Retail

The source presents a practical guide to constructing a product similarity engine for fashion retail. It focuses on using multimodal embeddings from text and images to find similar items, a core capability for recommendations and search.

Apr 5, 202696% relevant

Dify AI Workflow Platform Hits 136K GitHub Stars as Low-Code AI App Builder Gains Momentum

Dify, an open-source platform for building production-ready AI applications, has reached 136K stars on GitHub. The platform combines RAG pipelines, agent orchestration, and LLMOps into a unified visual interface, eliminating the need to stitch together multiple tools.

Apr 4, 202687% relevant

How Weaviate Agent Skills Let Claude Code Build Vector Apps in Minutes

Weaviate's official Agent Skills give Claude Code structured access to vector databases, eliminating guesswork when building semantic search and RAG applications.

Mar 27, 202695% relevant

How to Manage Skills Across Multiple Repos with a Centralized Registry

Use symlinks and a dispatch skill to share Claude Code skills across projects without plugin namespacing overhead.

Mar 25, 202672% relevant

Stop Letting Claude Code Write Repetitive Code—Make It Write Generators Instead

The most effective token-saving technique isn't cheaper models or tiny prompts—it's making Claude Code write small scripts that generate repetitive code for you.

Mar 24, 202696% relevant

What 19M+ Claude Code Commits Tell Us About Real-World Usage

A new dashboard tracking Claude Code's GitHub footprint reveals TypeScript dominance, massive net code growth, and how developers are using it to ship.

Mar 24, 202691% relevant

Starlette 1.0 Released: Python ASGI Framework Hits First Stable Version After 8 Years

The Starlette ASGI framework for Python has released version 1.0, its first stable release after nearly eight years of development. The package has been downloaded nearly 10 million times.

Mar 22, 202685% relevant

Three Agents, One Mission: A Multi-Agent Architecture for Real-Time Fraud Detection

A technical walkthrough of a multi-agent system built with Mesa and XGBoost for real-time fraud detection. It moves beyond a simple classifier to a complete, observable, and actionable pipeline.

Mar 18, 202672% relevant

Why Companies End Up Using Triton Inference Server: A Simple Case Study

A case study explains the common journey from a simple ML experiment to a production system requiring a robust inference server like NVIDIA's Triton, highlighting its role in managing multi-model, multi-framework deployments at scale.

Mar 16, 202675% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety