Oy 4 — Mashqlar to'plami
🟢 Easy
Computer Vision
- OpenCV bilan rasm yuklang, RGB/HSV/Grayscale ga aylantiring.
- Canny edge detection + contour topish.
- YOLOv8n pretrained model bilan rasm uchun inference.
- Pretrained EfficientNet bilan rasm uchun top-5 classification.
- Tesseract bilan oddiy matnli rasm uchun OCR.
NLP
- NLTK bilan tokenization, stop words olib tashlash, lemmatization.
- spaCy bilan NER va POS tagging.
- TF-IDF + Logistic Regression baseline (Spam SMS dataset).
- HuggingFace
pipeline("sentiment-analysis")10 ta gap uchun. - Sentence Transformers bilan 5 ta gap orasidagi cosine similarity matrix.
🟡 Medium
CV — Real loyihalar
- Document Scanner: telefon rasmidan hujjatni "tekislash" (contour + perspective).
- Custom YOLO training: 100-200 ta rasmni Roboflow'da label qiling, YOLOv8 fine-tune (Colab GPU).
- OCR pipeline: pasport rasmidan ism, familiya, raqamlarni ajratib olish.
- Image similarity search: 1000 ta rasmni pretrained CNN bilan embed qiling, query rasmga eng yaqin 10 tasini toping.
- Real-time webcam YOLO: webcam → bounding box + label.
NLP — Real loyihalar
- News classifier: BBC News dataset (5 kategoriya), TF-IDF + LR vs BERT solishtirish.
- O'zbek matn dataset: Telegram'dan 5000+ post yig'ing, classifier yarating.
- Multilingual sentiment: 3 til (en/ru/uz) uchun bitta model.
- Custom BPE tokenizer: o'zbek korpus uchun BPE o'rgating.
- Zero-shot classifier: 10 ta yangiliklarni "labels" bermay 5 ta kategoriyaga ajrating.
🔴 Hard (Production)
1. CV — Object Counter Service
Talab:
- FastAPI + YOLOv8 custom trained model
- Endpoint: rasm/video upload → count by class
- Celery + Redis (async processing)
- WebSocket real-time updates
- Docker + docker-compose
- Streamlit yoki React frontend
**Misol use case:**parking lotda mashinalar soni, do'konda odamlar oqimi
2. OCR — ID Card Reader
Talab:
- ID kart turini detect qilish (YOLO)
- Perspective correction (OpenCV)
- Field-by-field OCR (PaddleOCR)
- Validation + parsing (regex)
- PostgreSQL'da saqlash
- REST API + admin panel
3. NLP — Multilingual Customer Support Classifier
Talab:
- 3 tilda (en/ru/uz) keladigan support ticket'larni 10 kategoriyaga ajratish
- mBERT yoki XLM-R fine-tune
- FastAPI + caching
- Prediction monitoring (concept drift detection)
- Telegram bot integration
4. CV+NLP — Visual Question Answering
Talab:
- BLIP yoki similar VLM (Vision-Language Model)
- Rasm + savol → javob
- Streamlit demo
- Mobile app integration
Mini-loyihalar
Mini-loyiha 1: O'zbek Plate Number Recognition
- O'zbek raqam belgilari datasetini yig'ish (telefondan 100+ rasm)
- YOLO bilan plate detection
- OCR bilan raqamni o'qish
- FastAPI servisi
Mini-loyiha 2: Receipt Scanner
- Magazin chekining rasmini OCR
- Mahsulotlar va narxlarni ajratish
- Total summa va kategoriya bo'yicha guruhlash
- Telegram bot
Mini-loyiha 3: Sport Highlights Generator
- Futbol o'yini video
- Object detection (player, ball)
- Event detection (goal, foul)
- Avtomatik highlights montage (FFmpeg)
Mini-loyiha 4: Smart Document Search
- 100+ PDF hujjatni indexlash
- Sentence embeddings + FAISS
- Natural language search
- Streamlit UI
Quiz
CV
- Object detection va classification farqi?
- IoU va mAP nima?
- YOLO va Faster R-CNN tezligi va aniqligi farqi?
- Anchor box nima va anchor-free detector qanday ishlaydi?
- NMS (Non-Maximum Suppression) qachon ishlatiladi?
- Tesseract va modern OCR (EasyOCR/PaddleOCR) farqi?
- SAM (Segment Anything) ning special tomoni?
NLP
- Stemming va Lemmatization farqi?
- TF-IDF formulasi va intuitsiyasi?
- Word2Vec'ning Skip-gram va CBOW farqi?
- BPE va WordPiece tokenization farqi?
- BERT, GPT, T5 farqi (arxitektura)?
- Attention mechanism Q, K, V nima?
- Zero-shot classification qanday ishlaydi?
Production
- Pretrained model'ni qanday qilib production'ga olib chiqasiz?
- GPU inference uchun batching nima uchun foydali?
- Model versioning strategiyalari?
- CV servis uchun Docker image hajmini qanday kamaytirasiz?
- NLP servis uchun caching strategiyalari?
✅ Oy 4 oxiri checklist
- OpenCV bilan klassik image processing
- YOLOv8 inference va fine-tuning (Colab/Kaggle)
- OCR (kamida bitta kutubxona: Tesseract/EasyOCR/PaddleOCR)
- NLP klassik: TF-IDF + LR baseline
- spaCy bilan NER, POS
- HuggingFace Transformers (pipeline + Auto*)
- Sentence embeddings (RAG'ga tayyor)
- FastAPI'da CV yoki NLP servis
- Capstone loyiha GitHub'da
- LinkedIn'ga post
Tabriklayman! Oy 5 — LLM, RAG va AI Agentlar ga tayyormiz — endi siz LLM era'ga kirasiz!