Job Title: Site Reliability Operation Engineer
Job Description
The ideal candidate for the Site Reliability-Operations Engineering role has strong experience maintaining up time with production websites, Linux and Windows system administration and familiarity with front-line web application support. In this role, the Site Reliability-Operations Engineer will play a key role in maintaining and expanding our hosted platforms. Responsibilities will include problem diagnostics, security functions, spinning up, managing and monitoring cloud servers.
Primary Responsibilities
- Maintain uptime with Linux and Windows production systems
- Problem diagnostics, security functions, spinning up, managing and monitoring cloud servers
- Configuring and monitoring servers and applications ensuring reliability, high availability, scalability
- Quickly perform troubleshooting at various levels in the technology stack while maintaining composure during high stress situations
- Maintain and support backups, data redundancy and fail over protocol for systems
- Interact with management, upper level support, vendors and customers and be able to effectively communicate issues in a clear, straightforward manner
- Responsible for maintaining an on-call rotation, able to address problems remotely and in a timely and efficient manner
- Work well in a team setting and welcome input from team members
- Initiate and take ownership of work without requiring supervision
Requirements
- Minimum 3 years experience with Linux and Windows production systems administration in a 24x7 environment
- Experience with configuring load balancers, network routing (NAT, TCP/IP , firewalls, intrusion detection, networking services (DNS, DHCP, VPN)
- Experience with scalable systems, monitoring tools and load balancers
- Excellent understanding of TCP/IP networking
- Linux and windows scripting experience
- Excellent communication skills, both written and verbal
- 4 year University degree