Designing Production LLM Architectures

By Coursera on Coursera · Design
Price
Free

About This Course

This course is for ML engineers, solutions architects, and senior developers who build robust infrastructure powering large language models. This course teaches you how to design, deploy, and maintain the complex, interconnected systems required for scalable, resilient, and cost-effective LLM applications in the real world. You will learn to think like an architect, starting with foundational design choices. Using sequence diagrams and structured analysis, you will compare synchronous and asynchronous architectures and evaluate the critical trade-offs between self-hosting open-source models and using managed APIs, considering total cost of ownership, latency, and data privacy. The course then dives deep into building for resilience and scale, applying the 12-factor app methodology to design stateless, configurable microservices. You’ll learn to analyze multi-region deployment strategies for fault tolerance and to use container orchestration manifests like Helm to deploy scalable applications capable of handling production workloads. Finally, you’ll master the data backbone of your system by designing automated data pipelines with tools like Airflow and learning to manage the complexities of schema evolution.

Instructor

Professionals from the Industry

Frequently Asked Questions

How much does Designing Production LLM Architectures cost?
Visit the Designing Production LLM Architectures course page for current pricing and available discounts.
Who teaches Designing Production LLM Architectures?
Designing Production LLM Architectures is taught by Professionals from the Industry, Coursera.
What skill level is Designing Production LLM Architectures for?
This course is designed for all levels learners.