Staff Machine Learning Infrastructure Engineer

Job offer posted directly by companies on Experteer, or researched by Experteer.
Apply on company website


Intuit AI is paving the way in Machine Learning and Artificial Intelligence for all facets of the Intuit product suite, allowing Intuit to power prosperity for everyone. Intuit AI Engineering is a dynamic and innovative group whose mission is to enhance overall engineering capabilities for the purpose of improving efficiency, productivity, and impact of AI practitioners. In this team, you’ll be tasked with owning and delivering cutting edge platforms, frameworks, and services that power AI at Intuit. 

Machine Learning Infrastructure Engineers work at the intersection of software and systems engineering to design and build large scale consumer experiences which are reliable, operable, secure, highly available, disaster-ready, and performant. Join a world-class operations team and utilize your programming and operations talents to apply the latest patterns in continuous delivery, cloud operations, containerization, and serverless technology and help us build the next generation AI/ML platform and delight millions of customers.

In this role, you’ll be part of a vibrant team of data scientists, product developers, and machine learning engineers.  You’ll be expected to help architect, code, optimize, and deploy machine learning models at scale using the latest industry tools and techniques.  You will be responsible for defining and maintaining the day to day as well as the future operations and reliability for a variety of use-cases. You will also help automate, deliver, monitor, and improve machine learning solutions while ensuring data and models are secure.  


  • Design and build systems which improve machine learning scalability, usability, and performance.
  • Work cross-functionally with product managers, data scientists, and engineers to understand, implement, and deploy machine learning pipelines.
  • Work with data scientists and MLE to build pipelines to train and deploy models.
  • Effectively communicate results to peers and leaders.
  • Explore the state-of-the-art technologies and apply them to deliver customer benefits.
  • Embed into development projects and assist teams in delivering new features, services, and consumer experiences through production with speed and quality. You honor operational rigor and think first about customer experiences
  • Consider security, reliability, operability, and performance when designing and implementing technical solutions
  • Full stack system engineering. Utilize your deep knowledge of networking, systems, and software engineering coupled with your understanding of service-oriented architectures, mastery of CI/CD and AWS to steer operational approaches and solutions
  • Work on significant assignments which are broad in scope and complexity, span organizational boundaries, and cover a wide range of technology
  • Lead Failure Mode Effect Analysis (FMEA) working sessions and automate chaos engineering within the Intuit AI ecosystem
  • Expertly build and operate services within AWS. Assist project teams with designing and setting up highly available cloud environments
  • Think pragmatically when introducing new technology. Balance the demand for realistic, short-term goals to meet project and seasonal deadlines, with the need for longer-term, potentially more disruptive innovation


  • BS, MS, in Computer Science or a related field, or equivalent practical experience.
  • 3+ years of cloud experience, working collaboratively with development teams, assisting with build and production support, writing deployment automation, setting up CI/CD pipelines, etc.
  • Knowledge of data query and data processing tools (i.e. SQL)
  • 2+ years of solid Configuration Management experience: (For example Chef, Puppet, Salt, Ansible, Cfengine, etc
  • Knowledge of agile project methodologies. Experience with writing user stories in Jira, grooming project backlogs, planning sprints, and working with Scrum or Kanban
  • Solid understanding of networking including TCP/IP stack, basic switching/routing concepts, and load-balancing
  • Experience with CI/CD tools (Jenkins/Jenkins2, Spinnaker, CodePipeline, TeamCity, Bamboo, etc)
  • Experience with big data stores and NoSQL databases (EMR (spark, hive, hadoop) – preferred, Vertica – preferred, Cassandra, Redshift, Kinesis, or AWS Elasticsearch)
  • Experience with software container technology (e.g., Kubernetes, Docker)

Additional Qualifications

  • Team player possessing strong analytical, problem-solving and communication skills
  • Demonstrated ability to work with global teams across time zones
  • Ability to work effectively in a fast-paced, complex technical environment
  • High adaptability and flexibility
  • Experience driving for results across cross-functional teams while maintaining effective working relationships
  • Self-starter attitude and the ability to make decisions independently

The Experteer Benchmark is an Experteer service that shows you the estimated market value of a specific position. It is not given by the recruiters, and it is an estimate based solely on market data.

Experteer's estimation is based on both the analysis of internal data, as well as external information. The Experteer Benchmark is the average market value at the respective level. This value may be up to 15% higher or lower in each company (greater variation may be possible in individual cases).

The Recruiter Benchmark serves as a reference. The actual salary to be obtained depends on factors such as professional experience and expertise.