← Back to Job Listings
Databricks logo

Sr. ML Engineer - Gen AI & LLM

Databricks
Full Time2d

Job Location

Job Summary

The Machine Learning (ML) Practice team at Databricks is seeking a skilled professional to develop Large Language Model (LLM)-based solutions for customers. The ideal candidate will have experience building Generative AI applications, working with tools like HuggingFace and Langchain, and deploying LLMs on cloud platforms. This role involves collaborating with cross-functional teams, presenting at conferences, and providing technical mentorship to the ML community. The team values collaboration, life-long learning, and driving business value through ML. With a focus on customer-facing work, this position offers opportunities for growth and impact in the field of AI.

CSQ326R35

Mission

The Machine Learning (ML) Practice team is a highly specialized customer-facing ML team at Databricks facing an increasing demand for Large Language Model (LLM)-based solutions. We deliver professional services engagements to help our customers build, scale, and optimize ML pipelines, as well as put those pipelines into production. We work cross-functionally to shape long-term strategic priorities and initiatives alongside engineering, product, and developer relations, as well as support internal subject matter expert (SME) teams. We view our team as an ensemble: we look for individuals with strong, unique specializations to improve the overall strength of the team. This team is the right fit for you if you love working with customers, teammates, and fueling your curiosity for the latest trends in LLMs, MLOps, and ML more broadly. This role can be remote.

The impact you will have:

  • Develop LLM solutions on customer data such as RAG architectures on enterprise knowledge repos, querying structured data with natural language, and content generation

  • Build, scale, and optimize customer data science workloads and apply best in class MLOps to productionize these workloads across a variety of domains

  • Advise data teams on various data science such as architecture, tooling, and best practices

  • Present at conferences such as Data+AI Summit

  • Provide technical mentorship to the larger ML SME community in Databricks

  • Collaborate cross-functionally with the product and engineering teams to define priorities and influence the product roadmap 

  • What we look for:

    • Experience building Generative AI applications, including RAG, agents, text2sql, fine-tuning, and deploying LLMs, with tools such as HuggingFace, Langchain, and OpenAI

    • 5+ years of hands-on industry data science experience, leveraging typical machine learning and data science tools including pandas, scikit-learn, and TensorFlow/PyTorch

    • Experience building production-grade machine learning deployments on AWS, Azure, or GCP

    • Graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience

    • Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike

    • Passion for collaboration, life-long learning, and driving business value through ML

    • [Preferred] Experience working with Databricks & Apache Spark to process large-scale distributed datasets