Giới thiệu AI Deployment

Giới thiệu AI Deployment | MinAI Learning

🛠️ Công nghệ sử dụng

TB5 min

Thành phần	Công nghệ	Mục đích
API Framework	FastAPI	REST API server
Container hóa	Docker	Đóng gói & triển khai
Điều phối	Docker Compose / K8s	Quản lý đa container
Cache	Redis	Cache phản hồi
Giám sát	LangSmith / W&B	Quan sát LLM
Bảo mật	API keys, rate limiting	Bảo vệ
CI/CD	GitHub Actions	Triển khai tự động

Checkpoint

Bạn đã biết các technologies chính cần thiết cho AI deployment chưa?

Task 3

💻 Bắt đầu nhanh: FastAPI + LangChain

TB5 min

python.py

1from fastapi import FastAPI
2from langchain_openai import ChatOpenAI
3from langchain_core.prompts import ChatPromptTemplate
4
5app = FastAPI()
6llm = ChatOpenAI(model="gpt-4o-mini")
7
8chain = (
9    ChatPromptTemplate.from_messages([
10        ("system", "Ban la AI assistant."),
11        ("human", "{question}")
12    ])
13    | llm
14)
15
16@app.post("/chat")
17async def chat(question: str):
18    response = await chain.ainvoke({"question": question})
19    return {"answer": response.content}

Bash

1# Run
2uvicorn main:app --host 0.0.0.0 --port 8000

Checkpoint

Bạn đã chạy thử FastAPI server với LangChain chưa?

Task 4

🛠️ Điều kiện tiên quyết

TB5 min

Bash

1# Tools can thiet
2python --version  # 3.10+
3docker --version  # 24+

Bash

1pip install fastapi uvicorn langchain langchain-openai
2pip install redis python-dotenv pydantic

Checkpoint

Bạn đã cài đặt đầy đủ các tools và packages cần thiết chưa?

Task 6

Trục	Cần chứng minh	Anti-pattern thường gặp
Architecture	Có phân tách rõ API, inference, storage, monitoring	Nhét tất cả vào 1 service monolith
Reliability	Có health checks, retry, timeout, fallback	Chỉ dựa vào happy path
Security	Secret management, auth, rate limit, input validation	Hard-code keys, thiếu validation
Operations	Log/metrics/alert + on-call ownership	Deploy xong mới nghĩ đến monitoring

Nguồn	Nội dung	Link
Docker	Container hóa ứng dụng	Docker Documentation
FastAPI	Framework API nhanh cho Python	FastAPI Documentation
Google Cloud Run	Triển khai container serverless	Cloud Run Documentation
AWS Lambda	Serverless compute từ AWS	AWS Lambda Documentation
Streamlit	Framework tạo web app cho ML/AI	Streamlit Documentation
Hugging Face Inference	Triển khai models trên HF Spaces	HF Inference Endpoints

Khóa học

Mentor & Hỗ trợ

Blog

Giới thiệu

Giới thiệu AI Deployment

🎯 Mục tiêu bài học

Sau bài này, bạn sẽ:

🔍 Khoảng cách Production

Checkpoint

📐 Kiến trúc Deployment

Checkpoint

🛠️ Công nghệ sử dụng

Checkpoint

💻 Bắt đầu nhanh: FastAPI + LangChain

Checkpoint

📝 Lộ trình khóa học

Checkpoint

🛠️ Điều kiện tiên quyết

Checkpoint

🎯 Tổng kết

Bài tập

Câu hỏi tự kiểm tra

🚀 Bài tiếp theo

🧠 Góc Nhìn Chuyên Gia: Deployment Readiness Framework

4 trục readiness trước production

Mốc triển khai khuyến nghị

📚 Tài liệu tham khảo