Back to Jobs
ai-mlRemote (Nepal)full-time
✨ Senior LLM Researcher
OVERVIEW
We are seeking a Senior LLM Researcher with 7+ years of experience to lead cutting-edge research and development of Large Language Models for innovative Israeli deep tech startups. The ideal candidate will have a strong research background in LLMs, deep expertise in model architecture, fine-tuning, evaluation, and optimization techniques. A proven track record of working with state-of-the-art LLMs (LLaMA, Mistral, GPT-series, etc.) and publishing or implementing novel techniques is highly desired. Excellent communication skills, the ability to translate business objectives into research directions, and a strong aptitude for remote work are essential.
RESPONSIBILITIES
- 01Conduct advanced research on Large Language Models, including architecture design, pre-training, fine-tuning, alignment, and optimization.
- 02Develop novel techniques to improve model performance, efficiency, reasoning capabilities, and reduce hallucinations.
- 03Design and implement experiments for model evaluation, benchmarking, and ablation studies.
- 04Work on domain-specific adaptation of LLMs for deep tech applications.
- 05Collaborate with MLOps, Data, and Engineering teams to productionize research outcomes into scalable systems.
- 06Translate complex business and product requirements into actionable research problems and technical solutions.
- 07Stay up-to-date with the latest advancements in LLM research and reproduce/publish state-of-the-art results.
- 08Analyze and optimize training and inference pipelines for cost and performance.
- 09Mentor junior researchers and contribute to building a strong AI research culture within the team.
- 010Communicate effectively with remote teams, present findings, and document research clearly.
REQUIREMENTS
- Education: Master’s or PhD degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field (PhD strongly preferred).
- Experience: 7+ years of research or applied research experience focused on Large Language Models and Generative AI.
- Deep expertise in LLM techniques including prompting, RAG, fine-tuning (LoRA, QLoRA), RLHF, DPO, and model compression.
- Strong proficiency in PyTorch and experience with major LLM frameworks and libraries (Transformers, DeepSpeed, vLLM, Hugging Face, etc.).
- Proven track record of working with open-source LLMs (LLaMA, Mistral, Gemma, etc.) and large-scale training or inference.
- Publication record in top AI/ML conferences (NeurIPS, ICML, ICLR, ACL, EMNLP) is a strong plus.
- Proven ability to break down business objectives into research directions, excellent English communication, and self-motivation for remote work.
- Familiarity with cloud platforms (AWS preferred) for large-scale model training and experimentation.
- Bonus: Experience with multimodal LLMs, agentic systems, or long-context modeling.