itgeek.vnbeta
The best job search engine for the IT community

Site Reliability Engineer (Linux, AWS)
165 Thái Hà, Dong Da, Ha Noi
Tech stack
Thông tin công việc
Mô tả
We are looking for a Site Reliability Engineer responsible for designing, implementing, and monitoring the infrastructure and collaborating with other teams and team members to develop automation strategies and deployment processes. You will become an integral part of a CloudOps team, making every problem of the platform an issue of your own, and solving them accordingly.
As a Site Reliability Engineer you will:
Serve as level 3 support resource for responsible systems
Troubleshoot and resolve end-user issues independently and efficiently
Build knowledge base around common production support issues
Troubleshoot and fix the system when it breaks
Reduce the impact of errors and automate repetitive tasks
Maintain services once they are live by measuring and monitoring availability, latency and overall system health
Author and maintain documentation for related processes, procedures and system events
Identify areas of improvement within our systems and perform enhancements
Share the responsibility of being on-call
Engage in the entire lifecycle of services—from inception through operation and continuous integration
Lead incident triage, analysis, and resolution
Drive Root cause analysis and corrective action completion to help eliminate disruption of services and consequently to improve the day-to-day operations of the organization
Yêu cầu
Expert level troubleshooting skills across different levels of the stack
Scripting and software development across one or more programming languages (Powershell / Bash / Python)
Good understanding of cloud architecture both in Windows- and Linux based systems
Hands on experience with cloud infrastructure such as Azure or AWS minimum of 1 year.
Deep expertise in monitoring distributed systems application architectures
Exposure to and maintenance of CICD and orchestration tools at scale (Azure Automation, Octopus Deploy, Salt, Puppet, Chef etc.)
Diagnosing and troubleshooting user facing service outages
Exposure to system and application level telemetry for large distributed cloud architectures
Diagnosing and resolving problems in high-throughput web applications and network services
We would be very excited if you have experience with:
ElasticSearch
Understanding of ITIL terminology for incident and problem management
GIT
Kubernetes
Azure DevOps
Azure Active Directory
Thông tin khác
Being an Episerver's member, you are going to be offered:
A free “Hacking day” per month for self-studying and researching any IT-related subjects;
5 working days /week with flexible working time and no overtime;
Annual luxury Kick-off vacation;
International, professional, creative working environment and talented teams
Onsite opportunities in Europe and US;
Common cultural-sportive- art Clubs and activities, sponsored and/or supported by the Company (Ex: Football, GYM, Swimming, Guitar, English…).
Powerful workstation: Core i7-9700, 16-32 GB RAM, 02 x QHD 2560x1440 monitors (2K resolution);
100% official salary during the probation period, 13 th month salary, annual salary raise;
Up to 03 extra paid-leave days per year
Social, Health and Unemployed Insurance are based on 100% Gross salary and fully paid by Company;
Extra bonus at $ 60 per special occasions (Birthday, Labor Day, National Day, Solar New year, Lunar New Year);
Lunch allowance at $30 per month;
Baby allowance for a child under 03 years old is $ 12 per month;
AON Premium Healthcare Insurance package for employees and their children up to 18 years old.
Daily various foods, drink, and seasonal fresh fruits;
And many other benefits, let's join us to discover!
We are good listeners
Cutting-edge company
Great working environment
Nơi làm việc
- 165 Thái Hà, Dong Da, Ha Noi