Skip links

Site Reliability Engineer (SRE)

Job Highlights

  • 3+ years of experience as a SRE or similar role
  • Experience with Azure/ Jenkins/ Prometheus/ Grafana
  • Strong scripting skills and analytical skills

Job Description

At Factorytalk, technology adoption is the key to our business and customer implementation success is the main catalyst to achieving that. We are seeking an experienced Site Reliability Engineer (SRE) with a strong background in Azure to join our team. As an SRE, you will be responsible for ensuring the reliability and performance of our systems, and for building and maintaining scalable infrastructure. You will work closely with development teams to implement best practices for continuous integration and delivery, and will be responsible for designing and implementing monitoring, logging, and alerting systems to ensure the reliability of our systems.

If you have a background in Azure and are passionate about cloud and SRE and are looking for a challenging and rewarding role, we would love to hear from you. We offer competitive salary packages, comprehensive benefits, and a dynamic work environment.

What You’ll Do

  • Manage and maintain our cloud infrastructure (primarily Azure) for our production environments, ensuring it is reliable, scalable, and secure
  • Work closely with DevOps team and development teams to improve services through rigorous testing and release procedures
  • Design and implement monitoring, logging, and alerting systems to ensure the reliability of our systems
  • Develop automation and tooling to improve efficiency and reduce the manual effort for site release and maintenance
  • Identify areas for improvement and work with cross-functional teams to implement changes
  • Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding
  • Provide cost estimation and optimization for our sites
  • Participate in system design consulting, platform management, and capacity planning
  • Balance feature development speed and reliability with well-defined service-level objectives

What You Have

  • 3+ years of experience as a Site Reliability Engineer or a similar role
  • Experience with Azure, including infrastructure as code, virtual machines, storage, networking, and security
  • Strong scripting skills (Python, Bash, PowerShell, etc.)
  • Experience with continuous integration and delivery tools (e.g. Jenkins, Azure DevOps, etc.)
  • Experience with monitoring and logging tools (e.g. Azure Monitor, Prometheus, Grafana, ELK Stack, etc.)
  • Strong problem-solving and analytical skills
  • Excellent communication and collaboration skills
  • Bachelor’s degree in Computer Science, Information Technology, or related field

What we offer 

  • Competitive Salary and Increment every year
  • 5 day work week and 12 day vacations
  • Fixed Bonus
  • Transportation Allowance
  • Provident Fund 3 – 15%
  • Medical Insurance and Annual Medical Check-up
  • Educational Assistance and Financial Assistance
  • Annual Outing, New Year Party, and Team Building Activities
  • Birthday Gift/Celebration
  • Anniversary Benefit
  • Flexible work from home/office

Apply for this position

or Call for Enquiries
+66 2630 4525




    Please attach your resume:

    By clicking Submit my application button, you agree to use our “Form” terms And consent cookie usage in browser.