ai-mlRemote (Nepal)full-time

✨ Senior LLM Researcher

OVERVIEW

We are seeking a Senior LLM Researcher with 7+ years of experience to lead cutting-edge research and development of Large Language Models for innovative Israeli deep tech startups. The ideal candidate will have a strong research background in LLMs, deep expertise in model architecture, fine-tuning, evaluation, and optimization techniques. A proven track record of working with state-of-the-art LLMs (LLaMA, Mistral, GPT-series, etc.) and publishing or implementing novel techniques is highly desired. Excellent communication skills, the ability to translate business objectives into research directions, and a strong aptitude for remote work are essential.

Important Note: The role is to train and explore large vision action models.

RESPONSIBILITIES

01Conduct advanced research on Large Language Models, including architecture design, pre-training, fine-tuning, alignment, and optimization.
02Develop novel techniques to improve model performance, efficiency, reasoning capabilities, and reduce hallucinations.
03Design and implement experiments for model evaluation, benchmarking, and ablation studies.
04Work on domain-specific adaptation of LLMs for deep tech applications.
05Collaborate with MLOps, Data, and Engineering teams to productionize research outcomes into scalable systems.
06Translate complex business and product requirements into actionable research problems and technical solutions.
07Stay up-to-date with the latest advancements in LLM research and reproduce/publish state-of-the-art results.
08Analyze and optimize training and inference pipelines for cost and performance.
09Mentor junior researchers and contribute to building a strong AI research culture within the team.
010Communicate effectively with remote teams, present findings, and document research clearly.

REQUIREMENTS

Education: Master’s or PhD degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field (PhD strongly preferred).
Experience: 7+ years of research or applied research experience focused on Large Language Models and Generative AI.
Deep expertise in LLM techniques including prompting, RAG, fine-tuning (LoRA, QLoRA), RLHF, DPO, and model compression.
Strong proficiency in PyTorch and experience with major LLM frameworks and libraries (Transformers, DeepSpeed, vLLM, Hugging Face, etc.).
Proven track record of working with open-source LLMs (LLaMA, Mistral, Gemma, etc.) and large-scale training or inference.
Publication record in top AI/ML conferences (NeurIPS, ICML, ICLR, ACL, EMNLP) is a strong plus.
Proven ability to break down business objectives into research directions, excellent English communication, and self-motivation for remote work.
Familiarity with cloud platforms (AWS preferred) for large-scale model training and experimentation.
Bonus: Experience with multimodal LLMs, agentic systems, or long-context modeling.

✨ Senior LLM Researcher

OVERVIEW

RESPONSIBILITIES

REQUIREMENTS

APPLY FOR THIS ROLE