Site Reliability Engineer

Location Southampton
Discipline: Infrastructure & Support
Job type: Permanent
Salary: £55k per annum

​A Leading Global Cruise Company based in Southampton are looking for a Site Reliability Engineer to join their team on a permanent basis.

About the company

The company has been the premier innovator in the cruise line industry for over 54 years. They were the first to eliminate set dining times and provide guests with the freedom to cruise on their schedule to over 450 incredible destinations around the world. Their award-winning fleet provides an unparalleled onboard experience and will continue to expand. In the coming years, their commitment to innovation will continue to raise the bar for the cruise industry.

About the role

The Site Reliability Engineer (SRE) is responsible for ensuring performance, reliability, and robustness goals for high-traffic, mission-critical e-commerce web and mobile applications. This position will also maintain and monitor on-ship and on-shore server and network infrastructure.

  • Install, configure and maintain Windows and Linux-based web and virtual servers.

  • Create, configure, and administer staging and production databases including data transfer, backup and recovery plans.

  • Administer and maintain the source code control environment.

  • Configure production web environments for the website.

  • Deploy launch ready websites to production.

  • Monitor and troubleshoot production installation of client websites.

  • Perform load testing as needed for high volume deployment, DNS and domain registrar administration.

  • Handle asset tracking, planning and documentation.

  • Lead the establishment and maintenance of network and computer use standards and guidelines .

  • Research and present hardware and software tools used in the support and maintenance of the website.

  • Develop, validate, and maintain an IT plan for system maintenance, disaster recovery and expansion.

  • Mentor team members in areas of web and database server administration.

  • Make recommendations for areas of improvement in internal processes, along with potential solutions.

  • Assist in remediating web application infrastructure security issues.

  • Monitor key metrics that reflect application reliability, performance, and robustness.

  • Understand key principles of operating high-scale distributed computing environments that support our on-shore and on-ship guests.

  • Perform other job relate functions as assigned.

Nature & Scope:

  • Incumbent reports directly to Manager Web Applications Architect under general supervision.

  • This role is focused both on the support of internal systems and production activities that are often guest facing with a commitment to client success through a balance of quality, prioritization, and efficiency.

  • An experienced SRE has deep knowledge and understanding of network systems and IT infrastructure for support, maintenance and planning purposes.

  • The successful SRE is team oriented, leading in their area of expertise and supporting peers with enthusiasm and mutual respect.

  • Curiosity, ability to quickly troubleshoot, and enthusiasm for peak operational excellence are hallmarks of the successful SRE.

  • Breadth of experience is extremely valuable.

Education:

  • Bachelor’s Degree in Computer Science or related field of study or any equivalent relevant experience in a related field.

Experience:

  • Minimum 4 years of IT support and planning experience.

  • Demonstrated experience with server support. Experience with Javascript, Scala, Akka, .NET, PHP, Ruby and other Web programming languages desired.

  • Experience with quality assurance activities and with agile development practices preferred.

  • Mission-ciritical production support experience is required.

Knowledge & Skills:

  • Deep knowledge and understanding of network systems and IT infrastructure for support, maintenance and planning purposes.

  • Thorough understanding of network architecture and problem solving.

  • Thorough understanding of Windows and Linux based computers systems.

  • Expert knowledge of server installation,configuration, and support.

  • Understands the benefits of the various design methodologies and development patterns.

  • Familiarity with the Java, Scala, Akka, PHP, JavaScript, Apache, Drupal, and Ruby.

  • Solid working knowledge of Docker, Kubernetes, VMware or related virtualization technology, LDAP, J2EE application servers, GIT, DNS, firewalls and load balancers.

  • Excellent verbal and written communication skills with ability to work well in a team based environment.

  • Ability to create enthusiasm among team members while embracing positive results-oriented change.