Machine Learning - Model Serving Job at Alexander Chapman, Bay County, FL

cFRodTF6cjViK0RmMFN5eUFRRVJvRXNveUE9PQ==
  • Alexander Chapman
  • Bay County, FL

Job Description

We are working with a company building intuitive, voice-first AI systems that blend natural interaction with powerful model performance. Founded by leaders from Meta, Oculus, and Google, they’re creating a new class of consumer devices powered by speech, vision, and LLMs.

The Role

You’ll help optimize and scale the inference stack, working across model serving, performance tuning, and deployment to support real-time, multimodal AI.

What You’ll Do

  • Improve serving systems for LLMs, speech, and vision models.
  • Optimize throughput, latency, and cost using advanced techniques like batching, caching, and kernel tuning.
  • Extend frameworks like VLLM or SGLang to push the limits of performance.
  • Collaborate with training teams to deploy faster, lighter models.
  • Experiment with compilers and hardware backends to boost efficiency.

What We’re Looking For

  • Strong experience with PyTorch or similar ML frameworks.
  • Deep knowledge of model serving and systems performance.
  • Skilled in low-level debugging, bottleneck analysis, and server optimization.
  • Familiar with VLLM, Ray, or deploying inference workloads at scale.
  • Comfortable owning complex infrastructure projects end to end.
  • Background in computer science or related field from a top-tier university (e.g. Stanford, MIT, Ivy League).
  • Experience at a top tech company (e.g. FAANG) or a successful, high-growth startup.

They’re looking for curious, impact-driven engineers ready to push what’s possible with real-time AI.

Job Tags

Similar Jobs

The Curare Group

Visa Sponsorship Available Near Nashville Tennessee Job at The Curare Group

 ...three certified nurse practitioners.This opportunity offers an income guarantee, RVU bonus incentives, CME expenses, and more. J1 and H1B visa assistance is available. Practice details include: ~ Hospital Employee ~ Income Guarantee ~ WRVU... 

Sony Pictures Entertainment, Inc

Executive Director of Product Management - Marketing & Activation (Culver City) Job at Sony Pictures Entertainment, Inc

 ...Sony Pictures Television, the worlds largest independent studio, is seeking an Executive Director of Product Management Marketing & Activation to join our Insights, Strategy & Analytics organization. Our studio produces award-winning original content for both linear... 

Apptad Inc

Splunk Engineer Job at Apptad Inc

 ...Position: Splunk Engineer Location: Baltimore, MD (Onsite) (Candidates must be local to District of Columbia, Maryland, Virginia) Duration: Long term Contract Exp: 10 Years Qualifications: A minimum of 10 years of experience as a Splunk... 

Scadea Software Solutions

Entry-Level Brand Ambassador Job at Scadea Software Solutions

 ...solutions to enterprise customers in various industries like banking, insurance, education, telecom, and healthcare. Our focus is on...  ...Role Description This is a full-time hybrid role for an Entry-Level Brand Ambassador at Scadea Software Solutions in Dallas, TX.... 

Adecco

Travel CT Technologist - $2,470 per week Job at Adecco

Adecco is seeking a travel CT Technologist for a travel job in Lawrenceville, Georgia. Job Description & Requirements ~ Specialty: CT Technologist ~ Discipline: Allied Health Professional ~ Duration: 13 weeks ~40 hours per week ~ Shift: 8 hours, days ~...