AI Inference Engineer Job at Signify Technology, Santa Rosa, CA

WStSdlNvWTNrWG5tdkpSNWhBOVBBUkg3Wnc9PQ==
  • Signify Technology
  • Santa Rosa, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

CHRISTUS Health

Orthopedic Joint Surgery Physician - Texas - Locum or Permanent Job at CHRISTUS Health

Join Our Team as an Orthopedic Joint Surgery PhysicianWelcome to CHRISTUS Trinity Mother Frances Health System! We are looking for a passionate and skilled Board Certified/Board Eligible Orthopedic Joint Surgeon to join our exceptional team in Northeast Texas. If you... 

Pursuit Collection

Baker (KFWL/Fox Island) Job at Pursuit Collection

What perks can you expect?: ~ Join an inclusive, global team and make life-long connections. ~ Enjoy free access to Pursuit attractions and 50% off for friends. ~ Get discounts on hotel stays, dining, and retail. ~ Access subsidized mental health and wellness...

Diversified Maintenance Systems, LLC

General Cleaner - KOHL'S MI Job at Diversified Maintenance Systems, LLC

Job Description Job Description General Cleaner Come work for Diversified Maintenance, a leading company in the Facilities Services Industry since 1973. At Diversified Maintenance we believe that details matter, as do each of our employees and customers. Through...

BEM Systems, Inc.

Environmental Permitting Specialist/Ecologist (Madison) Job at BEM Systems, Inc.

 ...and Sections 9 and 10 of the Rivers and Harbors Appropriation Act. Experience with the New York State Department of Environmental Conservation (NYSDEC), United States Coast Guard (USCG), NJ Pinelands Commission (Pinelands), Delaware and Raritan Canal Commission (DRCC),... 

Barringer Construction

Construction MEP Superintendent Job at Barringer Construction

 ...the following skills, knowledge and experience in commercial construction. Reasonable accommodations may be made to enable individuals with...  ...only a port-a-jon available for a restroom. This position requires early morning, weekend and night shift work as needed....