Publisher's Synopsis
Learn how to put Large Language Model-based applications into production safely and efficiently.
Large Language Models (LLMs) are the foundation of AI tools like ChatGPT, LLAMA and Bard. This practical book offers clear, example-rich explanations of how LLMs work, how you can interact with them, and how to integrate LLMs into your own applications. In LLMs in Production you will:
- Grasp the fundamentals of LLMs and the technology behind them
- Evaluate when to use a premade LLM and when to build your own
- Efficiently scale up an ML platform to handle the needs of LLMs
- Train LLM foundation models and finetune an existing LLM
- Deploy LLMs to the cloud and edge devices using complex architectures like RLHF
- Build applications leveraging the strengths of LLMs while mitigating their weaknesses
LLMs in Production delivers vital insights into delivering MLOps for LLMs. You'll learn how to operationalize these powerful AI models for chatbots, coding assistants, and more. Find out what makes LLMs so different from traditional software and ML, discover best practices for working with them out of the lab, and dodge common pitfalls with experienced advice.