Description: Job Title: LLM engineer Duration: 1.5+ months (Hybrid 2 days onsite) ... scalable inference pipelines, optimize serving infrastructure (e.g., quantization, caching, distillation), and integrate ...
27 days ago