January 16, 2021

Site Reliability Engineer (Technical Support SRE Team)

  • Handle product/SaaS Service operation, including alert handling, service implementation and deliver solutions for SaaS in support of the following: a. Cloud Migration b. DevOps Support c. Cloud Native Improvement
  • Closely collaborate with global R&D teams as shared owners of SaaS. This includes understanding the service and providing recommendations on service performance, reliability, security and scalability. Key expectations include, but are not limited to: a. Maintaining SaaS SLO – MTTR, MTBF b. Eliminating toil c. 24/7 operation, including service operation and customer support
  • Lead or participate in continuous improvement and value creation projects. Key involvement includes: a. Solutions delivery and integration b. Process creation and optimization c. Cross-team investigation
Desired Experience and Skills
  • Experience in architecture and implementation of infrastructure designs in Public Cloud Environment in AWS and/or Azure.
  • Design high availability & fault tolerant infrastructure
  • Provide cloud advisory & advanced technical solutions
  • AWS Assoc. Solutions Architect certification or equivalent skill
  • Experience in solutions integration, tools development, and programming
  • One or more of the following programming languages
  • Python, PHP, Java, Bash, or PowerShell
  • API Manipulation
  • Source Code Management
  • Experience in implementing auto-healing concepts in AWS and Azure
  • Experience in networking configuration for on-premise and public cloud
  • Hands-on experience with Linux and Windows servers
  • Self-motivated, team player and eager to learn new things
  • Project management experience
  • Hands-on experience in container-related technologies (e.g. Kubernetes, Docker, etc)
  • Knowledgeable in web frameworks (e.g. Yii, Codeigniter, Flask)
  • Knowledge in developing and maintaining CI/CD pipeline for growing DevOps environment
  • Experience in automating infrastructure deployment or management (CloudFormation, Ansible, Chef, Jenkins, and/or Puppet)
  • Product/SaaS service knowledge including ApexOne, DSaaS, WFBSS