Technical Skills & Experience:
- 3–5+ years of experience in AI/ML engineering.
- Strong programming skills in Python with frameworks such as PyTorch, TensorFlow, HuggingFace Transformers, LangChain, etc.
- Proven experience in training, fine-tuning, and deploying LLMs (preferably Qwen, GPT, LLaMA, Mistral, etc.).
Hands-on experience with:
- OCR models: Surya-OCR, VietOCR.
- NER, LILT, YOLOv5, Table Extraction.
- Experience with deploying AI pipelines on Cloud (AWS, GCP, Azure) or on-premise environments.
- Proficiency in CI/CD for AI/ML (GitLab CI, GitHub Actions, etc.).
- Familiarity with Vector Databases (Pinecone, Weaviate, Milvus) and RAG applications.
Preferred Qualifications:
- Experience in optimizing model inference speed and reducing operational costs.
- Knowledge of containerization and orchestration (Docker, Kubernetes).
- Prior work in AI projects involving document processing, OCR, and visual data analysis.