Scalyr | San Mateo | Site Reliability Engineer | Full-Time, Onsite
We are looking for a Site Reliability Engineer who can help keep our uptime promise to our customers by making sure we meet our SLOs and can help our engineering teams ship software to our customers fast and with quality. On this job, you will have an amazing opportunity to drive outcomes that improve reliability, stability and cost efficiency of Scalyr. We are looking to add a SRE with prior extensive operations experience for a SaaS product who can drive deployment re-architecture with focus on self-service and automation. Someone who has driven continuous deployment, has run incident post-mortems, has provided feedback to engineering architecture decisions and has automated repetitive operational tasks would be a great fit. You will join a like minded team of awesome SRE engineers who help run our operations smoothly at scale. We value good written communication skills, data driven decisions and a keen eye for continuous improvements. You’ll help simplify, have a passion for new ideas and know how to execute iteratively towards the final goal. We value candor and collaboration.
SRE and DevOps are two titles loosely used in the industry. A SRE engineer at Scalyr defines and provisions the common set of tools the engineering teams to use and facilitate dev <-> ops collaboration by consulting and driving best practices. SRE at Scalyr is also responsible for uptime and providing feedback to engineering on architecture. We dogfood Scalyr for our operations and therefore an SRE also acts as product owner providing product feedback.
Scalyr | Site Reliability Engineers | Full-Time | San Mateo, CA
Scalyr’s mission is to provide a different approach to unified observability and log management that is built for modern application development and deployment practices. Founded by Steve Newman, who is also the Writely (aka Google Docs) founder and lead engineer, and led by tech industry veteran and CEO Christine Heckart, Scalyr offers an integrated and extensible suite of monitoring, management, visualization and analysis tools that aggregate and search all the signals needed for real-time observability, including logs, metrics and traces. We are the only observability and log management provider that does not index data and scales horizontally, is blazing fast and is ultra-affordable. The opportunity in front of us is huge and we are still in the very early days. This is going to be one of those companies where people will look back and say “I wish I’d been there when…” well, this is your chance to be part of “when”.
We are growing fast and thanks to the unique purpose-build database technology laid by Steve. Our solution operates at Petabyte scale, brag blazing fast search and make data available for searches in ~2 seconds past ingesting. “Existing log management tools were often slow and clunky, so we were facing a challenge, but the good kind — an opportunity to deliver a new user experience through solid engineering”. With Scalyr, we keep users like you “in the zone” as they handle incidents or debug cloud applications.
You will have the opportunity to gain excellent on-the-job experience working for a fast-moving software division building full-stack microservices. You will develop performant, scalable applications that are translated into 23 languages and used in 192 countries. Scale messages per second processing throughput. Scale large volume of time based data. Applications run in Amazon Web Services and you will leverage Docker, AngularJS, WebPack, Bootstrap, TypeScript, NodeJS, C# .NET Core and Apache Kafka. Come join us if you like solving hard scaling problems that involve billions of rows.
Scalyr’s mission is to build the best tool for engineers to understand their operational systems. Our founder, Steve Newman, cofounded Writely (aka Google Docs). Frustrated by the fact that visibility tools – even Google’s in-house tools – weren’t keeping up, Steve started Scalyr to create a better solution. It’s lightning fast, feature-rich and customers love it. The opportunity in front of us is huge and we are still in the very early days. This is going to be one of those companies where people will look back and say “I wish I’d been there when…” well, this is your chance to be part of “when”.
We are looking for a Site Reliability Engineer who can help keep our uptime promise to our customers by making sure we meet our SLOs and can help our engineering teams ship software to our customers fast and with quality. On this job, you will have an amazing opportunity to drive outcomes that improve reliability, stability and cost efficiency of Scalyr. We are looking to add a SRE with prior extensive operations experience for a SaaS product who can drive deployment re-architecture with focus on self-service and automation. Someone who has driven continuous deployment, has run incident post-mortems, has provided feedback to engineering architecture decisions and has automated repetitive operational tasks would be a great fit. You will join a like minded team of awesome SRE engineers who help run our operations smoothly at scale. We value good written communication skills, data driven decisions and a keen eye for continuous improvements. You’ll help simplify, have a passion for new ideas and know how to execute iteratively towards the final goal. We value candor and collaboration.
SRE and DevOps are two titles loosely used in the industry. A SRE engineer at Scalyr defines and provisions the common set of tools the engineering teams to use and facilitate dev <-> ops collaboration by consulting and driving best practices. SRE at Scalyr is also responsible for uptime and providing feedback to engineering on architecture. We dogfood Scalyr for our operations and therefore an SRE also acts as product owner providing product feedback.
Any interest? Please reach out to me at jenny@scalyr.com, or apply directly at https://www.scalyr.com/careers/site-reliability-engineer/