Building Generative AI Services with Fastapi: A Practical Approach to Developing Context-Rich Generative AI Applications - Paperback

Building Generative AI Services with Fastapi: A Practical Approach to Developing Context-Rich Generative AI Applications - Paperback

$84.65
Sale price  $84.65 Regular price 
Skip to product information
Building Generative AI Services with Fastapi: A Practical Approach to Developing Context-Rich Generative AI Applications - Paperback

Building Generative AI Services with Fastapi: A Practical Approach to Developing Context-Rich Generative AI Applications - Paperback

$84.65
Sale price  $84.65 Regular price 

by Alireza Parandeh (Author)

Ready to build applications using generative AI? This practical book outlines the process necessary to design and build production grade AI services with a FastAPI web server that communicate seamlessly with databases, payment systems, and external APIs. You'll learn how to develop autonomous generative AI agents that stream outputs in real-time and interact with other models. Web developers, data scientists, and DevOps engineers will learn to implement end-to-end production-ready services that leverage generative AI.

You'll learn design patterns to manage software complexity, implement FastAPI lifespan for AI model integration, handle long-running generative tasks, perform content filtering, cache outputs, implement retrieval augmented generation (RAG) with a vector database, implement usage/cost monitoring and tracking, protect services with your own authentication and authorization mechanisms, and effectively control stream outputs directly from GenAI models. You'll explore efficient testing methods for AI outputs, validation against databases, and deployment patterns using Docker for robust microservices in the cloud.

  • Build generative services that interact with databases, external APIs, and more
  • Learn how to load AI models into a FastAPI lifecycle memory
  • Monitor and log model requests and responses within services
  • Use authentication and authorization patterns hooked with generative models
  • Handle and cache long-running inference tasks
  • Stream model outputs via streaming events and WebSockets into browsers or files
  • Automate the retraining process of generative models by exposing event-driven endpoints

Ali Parandeh is a Chartered Engineer with the UK Engineering Council and a Microsoft and Google certified developer, data engineer, and data scientist.

Number of Pages: 528
Dimensions: 1.07 x 9.19 x 7 IN
Publication Date: May 20, 2025

Intentional design

We make things that work better and last longer. Our products solve real problems with clean design.

Quality first

We obsess over the details and strive to deliver the best products at the best prices, every time.

Customer care

We're always on your side: keeping our loyal customers happy is our top priority and number one goal.

Feature 1

Made with care and unconditionally loved by our customers, this signature bestseller exceeds all expectations.

Feature 2

Made with care and unconditionally loved by our customers, this signature bestseller exceeds all expectations.

At the heart of every product lies a unique story, driven by our passion for quality and innovation. Each item enhances your everyday life and sparks joy.