Ready to build production-grade applications with generative AI? This practical guide takes you through designing and deploying AI services using the FastAPI web framework. Learn how to integrate models that process text, images, audio, and video while seamlessly interacting with databases, filesystems, websites, and APIs. Whether you’re a web developer, data scientist, or DevOps engineer, this book equips you with the tools to build scalable, real-time AI applications.
Author Alireza Parandeh provides clear explanations and hands-on examples covering authentication, concurrency, caching, and retrieval-augmented generation (RAG) with vector databases. You’ll also explore best practices for testing AI outputs, optimizing performance, and securing microservices. With containerized deployment using Docker, you’ll be ready to launch AI-powered applications confidently in the cloud.
Build generative AI services that interact with databases, filesystems, websites, and APIs
Manage concurrency in AI workloads and handle long-running tasks
Stream AI-generated outputs in real time via WebSocket and server-sent events
Secure services with authentication, content filtering, throttling, and rate limiting
Optimize AI performance with caching, batch processing, and fine-tuning techniques
Visit the Book’s Website.
From the brand
Machine Learning, AI & more
Machine Learning
Artificial Intelligence
Deep Learning
Language Processing (NLP, LLM)
Sharing the knowledge of experts
O’Reilly’s mission is to change the world by sharing the knowledge of innovators. For over 40 years, we’ve inspired companies and individuals to do new things (and do them better) by providing the skills and understanding that are necessary for success.
Our customers are hungry to build the innovations that propel the world forward. And we help them do just that.
ASIN : B0F4ZX7Y21
Publisher : O’Reilly Media
Accessibility : Learn more
Publication date : April 15, 2025
Edition : 1st
Language : English
File size : 11.0 MB
Simultaneous device usage : Unlimited
Enhanced typesetting : Enabled
X-Ray : Not Enabled
Word Wise : Not Enabled
Print length : 840 pages
ISBN-13 : 978-1098160265
Page Flip : Enabled
Best Sellers Rank: #829,554 in Kindle Store (See Top 100 in Kindle Store) #16 in Generative AI #23 in Web Services #92 in Web Services & APIs
Customer Reviews: 3.9 3.9 out of 5 stars 12 ratings

