Scale is looking for an AI Infrastructure Engineer to join our Machine Learning Infrastructure team to build out our Training Platform. You will partner closely with Machine Learning researchers to understand their requirements and apply your own domain expertise and our compute resources to accelerate experimentation throughput.
The ideal candidate is someone who has strong fundamentals in machine learning, backend system design, and has prior ML Infrastructure experience. You should also be comfortable with infrastructure and large scale system design, as well as diagnosing both model performance and system failures.
You will:
- Build highly available, observable, performant, and cost-effective APIs for model training.
- Participate in our team’s on call process to ensure the availability of our services.
- Own projects end-to-end, from requirements, scoping, design, to implementation, in a highly collaborative and cross-functional environment.
- Exercise good taste in building systems and tools and know when to make build vs. buy tradeoffs, with an eye for cost efficiency.
Ideally you'd have:
- 4+ years of experience building machine learning training pipelines or inf...
Scale AI
A company that aims to make the transition from traditional software to AI faster across every industry.
Other jobs at Scale AI
Notifications about similar jobs
Get notifications to your inbox about new jobs that are similar to this one.
No spam. No ads. Unsubscribe anytime.
Similar jobs