OklahomaCityRecruiter Since 2001
the smart solution for Oklahoma City jobs

PwC Digital Products - Site Reliability Engineer (SRE - DBaaS)

Company: PwC
Location: Oklahoma City
Posted on: June 9, 2021

Job Description:

A career in Products and Technology would provide you the opportunity to be part of an organization that is building a leading tech experience that solves big challenges for our firm and our clients. Our products and tech-driven solutions are how we move faster, cut through complexity and fuel growth. We start with the problem and solve it with experience and tech know-how. Our skilled technologists, data scientists, product managers and business strategists are passionate about using technology to accelerate change. Our external facing team is responsible for moving products into the operations phase after being successfully built and released to clients. The Operations team provides white-glove support to our client's clients and runs, operates and maintains the product with the highest level of standards.

To really stand out and make us fit for the future in a constantly changing world, each and every one of us at PwC needs to be a purpose-led and values-driven leader at every level. To help us achieve this we have the PwC Professional; our global leadership development framework. It gives us a single set of expectations across our lines, geographies and career paths, and provides transparency on the skills we need as individuals to be successful and progress in our careers, now and in the future.

As a Manager, you'll work as part of a team of problem solvers, helping to solve complex business issues from strategy to execution. PwC Professional skills and responsibilities for this management level include but are not limited to:

  • Develop new skills outside of comfort zone.
  • Act to resolve issues which prevent the team working effectively.
  • Coach others, recognise their strengths, and encourage them to take ownership of their personal development.
  • Analyse complex ideas or proposals and build a range of meaningful recommendations.
  • Use multiple sources of information including broader stakeholder views to develop solutions and recommendations.
  • Address sub-standard work or work that does not meet firm's/client's expectations.
  • Use data and insights to inform conclusions and support decision-making.
  • Develop a point of view on key global trends, and how they impact clients.
  • Manage a variety of viewpoints to build consensus and create positive outcomes for all parties.
  • Simplify complex messages, highlighting and summarising key points.
  • Uphold the firm's code of ethics and business conduct.

Job Requirements and Preferences:

Basic Qualifications:

Minimum Degree Required:

Bachelor Degree

Additional Educational Requirements:

In lieu of a Bachelor Degree, 12 years of professional experience involving technology-focused process improvements, transformations, and/or system implementations.

Minimum Years of Experience:

5 year(s) years professional experience with various flavors of Linux and/or Windows, supporting and troubleshooting full stack applications, and cloud computing technology and its concepts (Azure, AWS, GCP).

Preferred Qualifications:

Degree Preferred:

Master Degree

Preferred Knowledge/Skills:

Demonstrates extensive abilities and/or a proven record of success in the following areas:

  • Providing SRE support for multiple distributed software applications (client-facing - internal & external);
  • Managing and continually improving platform infrastructure and applications with high reliability, resiliency, performance & quality, and faster time-to-market taking a holistic view of system health into account;
  • Gathering and analyzing metrics from both systems and applications for performance tuning and fault finding;
  • Partnering with development teams to improve services through rigorous testing and release procedures meeting security, compliance & performance requirements;
  • Participating in systems design, platform management, and capacity planning. Ensure that platforms are designed with "operability " in mind;
  • Pursuing the discovery of system faults throughout the application lifecycle - before & after release;
  • Defining, Implementing and being accountable for Velocity & Reliability (SLIs, SLOs, Error Budgets);
  • Creating & supporting sustainable systems and services through automation (to drive the problems away not just mere automation) and uplifts for infrastructure, testing, failover solutions, failure mitigation, etc.;
  • Writing, updating, and using documentation, including runbooks/playbooks; and,
  • Using Chaos Engineering to test the robustness of the systems and applications.

Demonstrates extensive abilities and/or a proven record of success in the following areas:

  • Having experience in one or more of the following: Go, Python, Ruby, Java, Perl, Shell, or Powershell;
  • Having experience with CI/CD tool chain- Git, Jenkins, Azure DevOps. Veracode, SonarQube, JFrog Artifactory;
  • Having experience with IaC with Terraform, ARM templates, and/or AWS CloudFormation templates;
  • Having experience with configuration management tools like Ansible, Puppet and/or Chef;
  • Having experience with DBaaS/Managed Cloud database technologies such as CosmosDB, DynamoDB, Managed SQL (RDS, SQL Database), In-memory (Cache for Redis, ElastiCache);
  • Having experience with application performance monitoring tools (AppDynamics, Azure application insights, Dynatrace, or Datadog) and log management tools (Azure Monitor's log analytics, Elastic Stack, and/or Splunk) defining, creating and configuring metrics for dashboards and alerts;
  • Having experience with distributed storage technologies like Azure (Blob, Files, Tables), S3, NFS, HDFS;
  • Having experience with Web server technologies- HTTP, Nginx, Apache, Tomcat;
  • Having experience in Kafka, Azure Event hubs or similar message queue technologies;
  • Having experience with Service mesh platforms such as Istio, Hashicorp Consul;
  • Having experience with Secrets Lifecycle management (Azure Keyvault, Hashicorp Vault);
  • Having experience on minimal or near zero downtime deployments as Blue-Green, Canary, rolling upgrades, etc.;
  • Defining and implementing HA, DR and rollback strategies along with the product and build teams;
  • Possessing proficiency in Networking concepts (HTTP/S, TCP/IP, DNS, Virtual Networks (VNet, VPC), Subnets, Routing, Firewalls, and Network Security, triaging packet loss etc) and knowledge on RESTful APIs;
  • Having experience with 24x7x365 monitoring, incident response/oncall support;
  • Having experience in troubleshooting that spans systems, network, and code;
  • Having experience determining & negotiating Error budgets, SLIs, SLOs, and SLAs with product owners;
  • Demonstrating systematic problem-solving approach, coupled with proven communication skills;
  • Demonstrating the ability to work independently and as a member of a greater team, including cross-team activities; and,
  • Having experience working in Agile Scrum, Kanban methodologies in SDLC.

Keywords: PwC, Oklahoma City , PwC Digital Products - Site Reliability Engineer (SRE - DBaaS), Other , Oklahoma City, Oklahoma

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category

Log In or Create An Account

Get the latest Oklahoma jobs by following @recnetOK on Twitter!

Oklahoma City RSS job feeds