job responsibilities –
• build software products with r&d teams that are openly collaborative, are non-hierarchical, respect contributions, and work with agility.
• provide vision & leadership for the technology roadmap of our products.
• plan and execute pocs as necessary.
• build and maintain the next-generation ml platforms and infrastructure
• create & maintain ci/cd pipelines for machine learning models on aws. define, deploy and manage processes and tools for continuous integration (ci/cd), test-driven development, and release management for ml/dl models (machine learning and deep learning-based) and data pipelines.
• work closely with the dev team to create software deployment strategies and solutions and be
accountable for designing, building, and optimizing automation systems with quality and speed
• accountable for architecture and technical leadership of complete devops infrastructure
skill requirement – mandatory
• understanding of machine learning pipeline
• experience with productionizing deep learning applications
• experience with training, inference and deploying deep learning models using devops principles
• familiarity with commonly used frameworks like tensorflow, torch, sklearn, etc
• experience with containerization
• experience with version control tools such as git, bitbucket etc
• good understanding of nlp models like gpt, bert, etc
• expertise in installation, configuration and file system management of linux.
• performance tuning, perform backup and restore.
• experience on configuration management tool like git and svn.
• configuring and managing apache webserver and mysql server.
• hands on experience on amazon web services.
• hands on different operating systems.
• hands on experience on virtualization software such as virtual box, vmware, vagrant and
docker
• designing and deploying a multiple application using almost all of the aws features (including ec2, route53, vpn, iam, s3, rds) focusing on high-availability, fault tolerance and