
FENG GROUP
br{display:none;}.css-58vpdc ul > li{margin-left:0;}.css-58vpdc li{padding:0;}]]>
Our leading client from growing Computer Hardware company is looking for a Senior Reliability Engineer. Reliability Engineers troubleshoot, debug, evaluate and resolve alarms, perform systems management, perform software deployments and migrations, and automate routine operational tasks with the result being a more stable, reliable, and available environment.
Tasks
- Must be familiar with standard cloud architecture patterns and capabilities (AWS / GCP / Azure);
- Handle high-pressure situations in a calm and professional manner;
- Lead resolution of the effort of complex service problems from the network layer to the application at scale;
- Work hand-in-hand with software developers to facilitate the adoption of “Paved Road” solutions and DevSecOps toolchain;
- Review technical specifications to provide guidance and help development teams drive operational excellence;
- Support large-scale services across multiple environments;
- Diagnose and repair issues by editing code in python, modifying the configuration of supporting infrastructure such as MongoDB, Postgres, Redis, RabbitMQ, or programmatically modifying cloud-hosted resources;
- Create, edit, and maintain ad hoc scripts to resolve issues quickly with minimal user impact;
- Contribute to the development of new tools and automation that ensures the service can be optimized and tuned with minimal human intervention;
Requirements
- Bachelor’s Degree in CS, MIS, or equivalent experience;
- Relevant experience in Linux/Unix systems fundamentals, monitoring, cloud services, networking, storage, database, and application knowledge;
- Solid communications skills;
- Demonstrable knowledge of one or more languages: Python, Bash;
- Strong troubleshooting and problem-solving skills;
- Strong ability to work with customers and business partners such as Customer Service and Product Management to turn business requirements into technical implementation;
- Scripting and automation experience;
- Solid ability to work independently or as part of a team to deliver features on agreed-upon timelines;
- Ability to manage a team;
- Good analytical and problem-solving skills;
- Interested? Join us now!