Back
Senior ML Engineer
Arbisoft
Lahore
PKR undefined - PKR undefined
In person
Full-time
6-10 Years
5 days ago
Apply
Job Description
Arbisoft

Arbisoft is looking for an experienced ML Engineer to design and deploy cutting-edge AI solutions, including LLMs, RAG pipelines, and agentic workflows. The ideal candidate brings deep expertise in Python, transformers, and scalable cloud-based ML systems.

Key Responsibilities:
  • Design, implement, and evaluate ML/DL models using PyTorch, TensorFlow, or similar frameworks.
  • Build and optimize LLM-based systems, including prompt-tuning, fine-tuning, and adapter-based training (e.g., LoRA, QLoRA).
  • Develop robust and scalable RAG pipelines. Demonstrate in-depth knowledge of embeddings and work with vector databases like FAISS, Pinecone, Weaviate, etc.
  • Construct and maintain agentic AI workflows involving multi-step reasoning, tool calling, memory components, and planning logic.
  • Work with proprietary APIs, as well as open-source libraries and models.
  • Develop modular and clean Python code, adhering to software engineering best practices (OOP, reusable components, testing).
  • Implement scalable solutions in cloud environments (like AWS), leveraging GPU/TPU resources effectively.
  • Design inference pipelines that are robust and optimized for latency and throughput.
  • Collaborate with research and product teams to translate ideas into production-grade ML features.
Required Skills:
  • 5+ years of experience in machine learning and deep learning, including building models from scratch.
  • Proven track record of shipping ML solutions that scale in production.
  • Strong proficiency in Python and deep understanding of software design principles.
  • Proven experience with transformer-based architectures, LLMs, and embedding models.
  • Hands-on experience with RAG systems, deep understanding of agent-based systems. Familiarity with LangChain, LlamaIndex, or similar frameworks.
  • Experience with cloud platforms (AWS/GCP/Azure) and understanding of scalability, resource optimization, and model deployment.
  • Familiarity with performance profiling, efficient model serving, and hardware-aware design (e.g., GPU utilization, quantization).
  • Ability to read, debug, and contribute to complex ML/DL codebases.
Good to Have:
  • Experience with MLOps, orchestration tools (e.g., Airflow, AWS Step Functions), containerization (Docker, Kubernetes).
  • Exposure to optimization toolkits (ONNX, TensorRT) and serving frameworks (Triton, TorchServe).
  • Experience with experiment tracking (e.g., Weights & Biases, Comet).
  • Understanding of alignment techniques like RLHF or curriculum learning.