{"product_id":"building-generative-ai-services-with-fastapi-a-practical-approach-to-developing-context-rich-generative-ai-applications-9781098160302","title":"Building Generative AI Services with Fastapi: A Practical Approach to Developing Context-Rich Generative AI Applications","description":"\u003cp\u003eReady to build applications using generative AI? This practical book outlines the process necessary to design and build production grade AI services with a FastAPI web server that communicate seamlessly with databases, payment systems, and external APIs. You'll learn how to develop autonomous generative AI agents that stream outputs in real-time and interact with other models. Web developers, data scientists, and DevOps engineers will learn to implement end-to-end production-ready services that leverage generative AI. \u003c\/p\u003e\u003cp\u003e You'll learn design patterns to manage software complexity, implement FastAPI lifespan for AI model integration, handle long-running generative tasks, perform content filtering, cache outputs, implement retrieval augmented generation (RAG) with a vector database, implement usage\/cost monitoring and tracking, protect services with your own authentication and authorization mechanisms, and effectively control stream outputs directly from GenAI models. You'll explore efficient testing methods for AI outputs, validation against databases, and deployment patterns using Docker for robust microservices in the cloud. \u003c\/p\u003e\u003cul\u003e \u003cli\u003eBuild generative services that interact with databases, external APIs, and more \u003c\/li\u003e\n\u003cli\u003eLearn how to load AI models into a FastAPI lifecycle memory \u003c\/li\u003e\n\u003cli\u003eMonitor and log model requests and responses within services \u003c\/li\u003e\n\u003cli\u003eUse authentication and authorization patterns hooked with generative models \u003c\/li\u003e\n\u003cli\u003eHandle and cache long-running inference tasks \u003c\/li\u003e\n\u003cli\u003eStream model outputs via streaming events and WebSockets into browsers or files \u003c\/li\u003e\n\u003cli\u003eAutomate the retraining process of generative models by exposing event-driven endpoints \u003c\/li\u003e\n\u003c\/ul\u003e \u003cp\u003eAli Parandeh is a Chartered Engineer with the UK Engineering Council and a Microsoft and Google certified developer, data engineer, and data scientist.\u003cbr\u003e\u003cbr\u003e\u003cbr\u003e\u003cb\u003eAbout the Author\u003c\/b\u003e\u003cbr\u003e\u003cb\u003e\u003ci\u003eParandeh, Alireza:\u003c\/i\u003e\u003c\/b\u003e - Alireza Parandeh is a chartered engineer (CEng) with the UK engineering council, a Microsoft and Google Certified Developer, Data Engineer and Data Scientist. He has a strong background in web development, data science and machine learning having led engineering teams at large multinational consultancies and tech startups in London. Ali's portfolio of clients include Network Rail, High-Speed Train 2, Transport for London, International Fertilizer's Association and the Department for Transport. \u003c\/p\u003e\u003cp\u003e As a passionate educator, Ali dedicates his free time to teaching data science and web development through meetups and online platforms. In 2019, he founded London's Beginners Machine Learning (BML) group, a Microsoft-sponsored meetup aimed at helping professionals break into the field of Data Science \u0026amp; AI and obtain cloud certifications which has since grown to over 1,500 members.\u003c\/p\u003e","brand":"O'Reilly Media","offers":[{"title":"Default Title","offer_id":51276805898514,"sku":"9781098160302","price":50.99,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0831\/4771\/8930\/files\/img_4e09b07f-5db3-4baf-b170-aac3497d41f5.jpg?v=1747140034","url":"https:\/\/surprise-castle.myshopify.com\/products\/building-generative-ai-services-with-fastapi-a-practical-approach-to-developing-context-rich-generative-ai-applications-9781098160302","provider":"Surprise Castle","version":"1.0","type":"link"}