About PharmEasy
Website
We are seeking a motivated and detail-oriented Associate Site Reliability/Infrastructure/ Devops Engineer to join our dynamic team. The ideal candidate will work across AWS, Azure, and GCP cloud platforms, ensuring the reliability, scalability, and performance of our systems. This role involves collaborating with development and operations teams to implement DevOps practices, automate processes, manage Kubernetes environments, and maintain infrastructure that supports our applications and services.
Key Responsibilities
• Cloud Infrastructure Management: ◦ Assist in deploying, managing, and monitoring applications on AWS, Azure, and GCP. ◦ Support the maintenance and optimization of cloud resources to ensure cost-effectiveness and performance.
• Kubernetes Management: ◦ Deploy, manage, upgrade and maintain Kubernetes clusters across multiple cloud platforms. ◦ Implement best practices for Kubernetes orchestration, scaling, and security. ◦ Troubleshoot and resolve issues within Kubernetes environments to ensure high availability and performance.
• DevOps Practices: ◦ Collaborate with development teams to implement CI/CD pipelines using tools such as Jenkins,ArgoCD, Ansible, or Azure DevOps. ◦ Automate deployment processes and infrastructure provisioning using Infrastructure as Code (IaC) tools like Terraform or CloudFormation or Crossplane
• Monitoring and Incident Management: ◦ Monitor system performance and reliability using tools like Prometheus, Grafana, Datadog, or similar. ◦ Participate in incident response, troubleshooting, and root cause analysis to resolve outages and performance issues.
• System Reliability and Performance: ◦ Assist in designing and implementing strategies to enhance system reliability, scalability, and availability. ◦ Contribute to capacity planning and performance tuning of applications and infrastructure.
• Collaboration and Documentation: ◦ Work closely with cross-functional teams to ensure seamless integration and deployment of applications. ◦ Maintain comprehensive documentation for infrastructure, processes, and procedures.
• Security and Compliance: ◦ Support the implementation of security best practices in cloud and Kubernetes environments. ◦ Ensure compliance with relevant industry standards and organizational policies.
Qualifications
• Education: ◦ Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field. ◦ Relevant certifications (e.g., AWS Certified Solutions Architect, Microsoft Certified: Azure Fundamentals, Google Associate Cloud Engineer) are a plus.
• Experience: ◦ 1-3 years of experience in a similar role, preferably in site reliability engineering, DevOps, or cloud engineering.
◦ Hands-on experience with at least one major cloud platform (AWS, Azure, or GCP). Skills and Competencies
• Technical Skills:
◦ Proficiency with cloud services across AWS, Azure, and GCP.
◦ Strong experience with Kubernetes, including deployment, management, and troubleshooting of clusters.
◦ Experience with scripting languages such as Python, Bash, or PowerShell.
◦ Familiarity with containerization technologies like Docker and orchestration tools like Kubernetes.
◦ Understanding of networking concepts, including VPCs, subnets, load balancers, and DNS.
• DevOps Tools: ◦ Experience with CI/CD tools (e.g., Jenkins, GitLab CI, CircleCI). ◦ Knowledge of Infrastructure as Code (IaC) tools (e.g., Terraform, Ansible, CloudFormation).
• Monitoring and Logging: ◦ Familiarity with monitoring tools (e.g., Prometheus, VictoriaMetrics, Grafana, any APM) and logging solutions (e.g., ELK Stack, Splunk).
• Problem-Solving: ◦ Strong analytical and troubleshooting skills to identify and resolve system issues efficiently.
• Communication: ◦ Excellent verbal and written communication skills to collaborate effectively with team members and stakeholders.
• Adaptability: ◦ Ability to learn new technologies quickly and adapt to evolving project requirements. Preferred Qualifications
• Advanced Certifications: ◦ Additional cloud certifications or specialized training in SRE practices.
• Experience with Automation: ◦ Proven experience in automating routine tasks and processes to improve efficiency.
• Understanding of Agile Methodologies: ◦ Familiarity with Agile or Scrum frameworks and participating in iterative development cycles.
• Security Best Practices:
• Knowledge of cloud and Kubernetes security principles and experience implementing security measures in cloud and Kubernetes environments.