Description:
Role: LLM Engineer Location: San Jose, CA (2 Days onsite) Duration: 12+ Months Only W2 Model Development & Optimization: Design, train, fine-tune, and evaluate large language models (LLMs) for performance, efficiency, and alignment with product or research goals. Systems Integration & Deployment: Implement scalable inference pipelines, optimize serving infrastructure (e.g., quantization, caching, distillation), and integrate models into applications or APIs. Research & Cross-Functional Collabor
Nov 13, 2025;
from:
dice.com