Back to Jobs
ai-mlRemote (Nepal)full-time

✨ Senior LLM Researcher

OVERVIEW

We are seeking a Senior LLM Researcher with 7+ years of experience to lead cutting-edge research and development of Large Language Models for innovative Israeli deep tech startups. The ideal candidate will have a strong research background in LLMs, deep expertise in model architecture, fine-tuning, evaluation, and optimization techniques. A proven track record of working with state-of-the-art LLMs (LLaMA, Mistral, GPT-series, etc.) and publishing or implementing novel techniques is highly desired. Excellent communication skills, the ability to translate business objectives into research directions, and a strong aptitude for remote work are essential.

RESPONSIBILITIES

  • 01Conduct advanced research on Large Language Models, including architecture design, pre-training, fine-tuning, alignment, and optimization.
  • 02Develop novel techniques to improve model performance, efficiency, reasoning capabilities, and reduce hallucinations.
  • 03Design and implement experiments for model evaluation, benchmarking, and ablation studies.
  • 04Work on domain-specific adaptation of LLMs for deep tech applications.
  • 05Collaborate with MLOps, Data, and Engineering teams to productionize research outcomes into scalable systems.
  • 06Translate complex business and product requirements into actionable research problems and technical solutions.
  • 07Stay up-to-date with the latest advancements in LLM research and reproduce/publish state-of-the-art results.
  • 08Analyze and optimize training and inference pipelines for cost and performance.
  • 09Mentor junior researchers and contribute to building a strong AI research culture within the team.
  • 010Communicate effectively with remote teams, present findings, and document research clearly.

REQUIREMENTS

  • Education: Master’s or PhD degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field (PhD strongly preferred).
  • Experience: 7+ years of research or applied research experience focused on Large Language Models and Generative AI.
  • Deep expertise in LLM techniques including prompting, RAG, fine-tuning (LoRA, QLoRA), RLHF, DPO, and model compression.
  • Strong proficiency in PyTorch and experience with major LLM frameworks and libraries (Transformers, DeepSpeed, vLLM, Hugging Face, etc.).
  • Proven track record of working with open-source LLMs (LLaMA, Mistral, Gemma, etc.) and large-scale training or inference.
  • Publication record in top AI/ML conferences (NeurIPS, ICML, ICLR, ACL, EMNLP) is a strong plus.
  • Proven ability to break down business objectives into research directions, excellent English communication, and self-motivation for remote work.
  • Familiarity with cloud platforms (AWS preferred) for large-scale model training and experimentation.
  • Bonus: Experience with multimodal LLMs, agentic systems, or long-context modeling.

APPLY FOR THIS ROLE

DROP FILE OR CLICK TO UPLOADPDF, DOC, DOCX (MAX 10MB)