Site Reliability Engineer
Vacancy expired!
100% REMOTEThis Jobot Job is hosted by: Merwan ZattamAre you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.Salary: $110,000 - $125,000 per yearA bit about us:We operate a unique collective of tech-forward companies serving the residential, commercial, and vacation rental industries. Our brands' strategic partnerships deliver transformative software solutions and services across our brands. We believe that property and vacation rental managers should have the opportunity to choose the platforms that best support their business goals and that they should be able to benefit from strategic partnerships across our ecosystem.Why join us?100% RemoteGreat BenefitsWork with cutting edge technologyFast growing organization with growth opportunityJob DetailsJob Description As a Site Reliability Engineer (SRE) within the Cloud Engineering team, you will play an essential role in the reliability of our infrastructure services. SRE works in such a way that you think through all the scenarios in which our system would be vulnerable and then help drive requirements for our software engineers to address those issues. SRE don't mind rolling up their sleeves and building the necessary tools, frameworks, prototypes, and tests to help our team solve these reliability issues.The position ensures the timely delivery of high-quality IT services and projects supporting the company's systems and operations. This position will also engage in resolving complex performance issues across enterprise platforms.Functions and Responsibilities
- Manage development and production environments by monitoring reliability and availability
- Utilize automation to improve productivity, workflow, and technology deployment
- Build and support optimized network infrastructure
- Perform capacity planning and ensure teams anticipate and prepare for growth
- Responsible for maintaining tools/systems/platforms for cloud services
- Collaborate with development teams to ensure that platforms are designed with "operability" in mind.
- Assist in the roll-out and deployment of new product features and installations to new cloud infrastructure
- Provide primary operational support and engineering for multiple software applications
- Support incident escalation and troubleshooting
- Proven work experience as a Site Reliability Engineer or similar role
- Strong experience in Linux & Windows Server systems.
- Experience managing MS SQL/Postgres SQL applications.
- Experience with Azure and Amazon Cloud Services
- Experience with scripting language
- DevOps philosophies
- 2+ years with site reliability engineering
- 2+ years of experience with IaaS environments such as MS Azure, AWS, GCP
- Advanced knowledge of TCP/IP networking, architecture, and core technologies (such as DNS, HTTP/S, Routing, VPN)
- 3+ years Linux administration
- 2+ years automation process implementation
- Experience with Infrastructure as Code (IaC)
- Bachelor's degree in MIS, technical discipline, or equivalent experience