Oy 5 — Mashqlar to'plami
🟢 Easy
LLM Fundamentals
tiktokenbilan inglizcha va o'zbekcha matnda token solishtirish.- 5 ta modelni (GPT-4o-mini, Claude Haiku, Gemini Flash, Llama 3.1, Mistral) bir xil savol bilan.
- Temperature 0, 0.5, 1.5 — javob farqlarini ko'rish.
Prompt Engineering
- Zero-shot, few-shot, CoT — bir xil masala uchun.
- Instructor bilan structured Pydantic output.
- JSON output uchun prompt + validation.
APIs
- OpenAI streaming chat.
- Anthropic prompt caching.
- Function calling — 3 ta tool.
Vector DB
- ChromaDB'da 100 ta hujjat.
- Qdrant Docker setup.
- pgvector Postgres extension.
RAG
- Naive RAG — 10 hujjat, query.
- Citation —
[Source N]format. - Chunking strategiyalari solishtirish.
Agents
- Pydantic AI agent + 3 tool.
- CrewAI hello world.
- LangGraph oddiy workflow.
Fine-tuning
- Pretrained Llama 1B yuklash.
- 50 ta sintetik dataset (GPT bilan).
- LoRA config sintaksis tushunish.
🟡 Medium
Real loyihalar
- Multi-turn chatbot: history saqlash, context window manage.
- RAG over Wikipedia: 100 ta o'zbek Wikipedia maqolasi.
- PDF Q&A bot: PyPDF + Qdrant + Streamlit.
- Code review agent: GitHub PR diff → suggestions.
- Email summarizer: 50 ta email → daily digest.
Advanced techniques
- Multi-query RAG: query expansion bilan.
- HyDE: hypothetical embeddings.
- Hybrid search: dense + BM25.
- Reranking: cross-encoder bilan.
- Multi-agent: CrewAI 3 agentli tizim.
Fine-tuning
- TinyLlama: o'zbek instruction dataset bilan QLoRA (Colab).
- OpenAI fine-tuning: customer support classifier ($1 budget).
- Sintetik data: GPT-4 yordamida 500+ training pairs.
🔴 Hard (Production)
1. Documentation Q&A Bot
Talab:
- 100+ ta hujjat (PDF, markdown, websites) ingestion
- Qdrant + FastAPI + Celery
- Multi-query + reranking
- Citation va source links
- Streamlit UI
- Langfuse observability
- Cost tracking per user
2. AI Customer Support Agent
Talab:
- Telegram bot (aiogram)
- Multi-turn conversation
- Tools: FAQ search, order lookup, refund process, escalate to human
- LangGraph workflow
- Postgres memory
- Sentiment-based routing
- Admin dashboard
3. RAG Evaluation Framework
Talab:
- Test set yaratish (100+ Q&A pairs)
- RAGAS bilan automated evaluation
- A/B testing framework
- Continuous improvement loop
- Grafana dashboard
4. Domain-specific Fine-tuning Pipeline
Talab:
- Data collection + cleaning
- Synthetic data augmentation
- QLoRA fine-tuning (Llama 3.1 8B)
- vLLM serving
- Benchmark (vs base model)
- Production rollout strategy
Mini-loyihalar
Mini-loyiha 1: Voice-to-Text Meeting Assistant
- Whisper (audio transcription)
- LLM summarization
- Action items extraction
- Slack integration
Mini-loyiha 2: Code Review Bot
- GitHub webhook
- Diff parsing
- LLM analysis (security, performance)
- Inline PR comments
Mini-loyiha 3: Personal Knowledge Base
- Notion + Obsidian export
- Vector DB ingestion
- "Second brain" chatbot
- Smart search
Mini-loyiha 4: O'zbek Tilidagi Hukumat Hujjatlari Chatbot
- lex.uz, data.gov.uz scraping
- Multi-language (uz/ru)
- Citation
- Legal disclaimer
Quiz
LLM
- Token, context window, temperature, top_p — har birini tushuntiring.
- Pretraining, SFT, RLHF — qanday navbat?
- Hallucination nima va qanday kamaytirish?
- Proprietary vs Open Source LLM — tanlov mezonlari?
- Prompt caching qanday ishlaydi?
Prompt Engineering
- Zero-shot, few-shot, CoT qachon qaysi?
- Structured output (JSON) uchun pattern'lar?
- Prompt injection — xavf va himoya?
- Self-consistency texnikasi?
- ReAct pattern intuition?
RAG
- RAG vs Fine-tuning farqi?
- Chunking strategiyalari trade-off?
- HNSW algoritm qanday ishlaydi?
- Hybrid search nima?
- Cross-encoder reranking nima uchun yaxshilanish keltiradi?
Agents
- Agent va LLM call farqi?
- ReAct pattern — Thought/Action/Observation?
- Multi-agent qachon kerak?
- MCP (Model Context Protocol) nima?
- Agent xavfsizligi — sandbox patternlar?
Fine-tuning
- LoRA mathematik intuition?
- QLoRA — nima uchun 4-bit?
- Sintetik data generation strategiyalari?
- RAG vs Fine-tuning — qachon birinchisini, qachon ikkinchisini?
- vLLM nima uchun production'da tez?
✅ Oy 5 oxiri checklist
- LLM API'larni (OpenAI, Anthropic) ishlataman
- Prompt engineering texnikalarini bilaman
- Structured output (Instructor, Pydantic AI)
- Vector DB (kamida 2 ta) bilan tanish
- To'liq RAG pipeline yaratdim
- AI Agent (tool use) yozdim
- LoRA bilan kichik fine-tuning sinab ko'rdim
- Production'ga olib chiqdim (FastAPI + Docker)
- Langfuse / observability
- Capstone loyiha (chatbot/RAG)
- LinkedIn'ga post
Tabriklayman! Oy 6 — MLOps va Production — sizning asosiy maqsadingiz uchun eng muhim oy.