Hi This is *Abdul Raheem* from *ExaTech Inc*. I have an immediate position mentioned below. Please review and let me know your interest.
*Job Title: SRE Lead Engineer* Location: Austin [Hybrid] Duration: 12+ months Interview: Video *Job Description:* We are currently seeking a highly skilled SRE hands-on Lead Engineer with solid experience to help lead transformational initiatives within IT operations, encompassing development as well. As a crucial figure in this role, you will participate/help designing and implementing cutting-edge SRE solutions, driving the transformation of IT operations organizations to adopt an engineering-centric approach. *Responsibilities:* Ø Participate in design, architecture of reliable, scalable, and high-performance systems and services with a focus on operational excellence, availability, and performance. Ø Primary skillset to be expertise in Observability as service, Telemetry data collection using Dynatrace APM, SolarWinds, Open-Source tools (Prometheus and Grafana), Log Aggregations (Kibana or Splunk) and AIOPS Tools Ø Configure application performance monitoring (APM), infrastructure monitoring, synthetic monitoring, RUM, and log monitoring. Ø Integrate Dynatrace with CI/CD pipelines, alerting tools, ITSM systems, and incident automation frameworks. Ø Tune alert thresholds, baselines, and AI-driven anomaly detection to reduce noise and improve actionable insights. Ø Deeper understanding of Login authentication mechanisms using Ping, ForgeRock and SiteMinder technologies (session management and cookie management) Ø Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications. Ø Evangelize SRE evolution within IT operations and promoting a culture of engineering excellence and best practices. Ø Define best practices and principles for SRE, including incident management, monitoring, alerting, and automation. Ø Collaborate with development teams on resiliency to ensure that services and applications are designed with operational reliability in mind. Ø Implement monitoring systems to assess the performance of applications and infrastructure, and proactively identifying areas for optimization. Ø Understanding incident and problem management process, post-mortems, and driving improvements to prevent future incidents. Ø Analyze resource utilization patterns and forecasting future capacity needs to ensure optimal performance and cost-efficiency. Ø Ensure that SRE practices align with security and compliance requirements and implementing measures to protect systems and data. Ø Operational excellence with focus on automation and developing tools to streamline operational tasks and increase efficiency. Ø Provide guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practice. Ø Ability to develop close relationship with other operational teams to integrate SRE practices and drive overall operational improvements across enterprise. Ø Stay up to date on industry trends, new technologies, and best practices in SRE and applying relevant advancements to the organization. Ø Ability to build strong working relationships across different levels, client focus mindset. *Qualifications:* Ø Around 10-12 years of SRE hands on experience with cloud technologies, development, SRE toolsets and automation Ø Own the design, configuration, deployment, and optimization of Dynatrace for enterprise-wide observability. Ø Define monitoring standards, best practices, and governance to ensure consistency and scalability. Ø Strong skills in APM, distributed tracing, synthetic & real user monitoring, log monitoring, and Davis AI configuration. Ø Experience to deploy and tune OneAgent, build end-to-end PurePath tracing, and leverage Smartscape topology for proactive performance monitoring and root-cause analysis. Ø Experience integrating Dynatrace with incident management, automation, and cloud platforms (AWS, Azure, GCP). Ø Strong problem-solving skills and ability to work in cross-functional, fast-paced environments. Ø Collaborate with application and infrastructure teams to troubleshoot performance issues and implement permanent fixes. Ø Correlation mechanisms and dashboards to have end to end visibility of requests from external to internal applications. Ø Strong hands-on experience with any Cloud Technology (AWS): Control Tower, Project Setup, Creating Accounts, RDS, SSO Ø Solid understanding and hands on experience with Docker/Kubernetes Ø Should have good experience with Linux Commands, GitLab CICD Setup and Terraform (state management, etc) Ø Monitoring & alerting setup experience with Splunk, Prometheus, Grafana, Kibana, ELK etc. Ø Good understanding of Observability Framework leveraging programmatic SLI/SLO blueprints to standardize the collection of golden signals. Ø Should have automation (data refresh, releases, DB snapshots) experience using Ansible or any other scripting languages Ø Experience with following languages (Groovy-DSL, Java, Python, Yaml and microservices architecture) Ø Good understanding and hands on experience with MQ, Kafka Ø Experience with Databases (Oracle, MySQL) *Good to have:*· Any of the relevant professional certifications – Certified Site Reliability Engineer (CSRE), Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer Professional, , Google Cloud Professional; DevOps Engineer Thanks and Regards *Abdul Raheem, Sr. Talent Acquisition Lead* *Email: **[email protected] <[email protected]>* Skype & Hangout: [email protected] *4555 Lake Forest Drive, Suite 650 **| **Cincinnati, OH 45242 * An E-Verified Company USA-Canada-INDIA -- You received this message because you are subscribed to "rtc-linux". Membership options at http://groups.google.com/group/rtc-linux . Please read http://groups.google.com/group/rtc-linux/web/checklist before submitting a driver. --- You received this message because you are subscribed to the Google Groups "rtc-linux" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion visit https://groups.google.com/d/msgid/rtc-linux/CAJHuG4OJaVhRgyKmVArJz7Q337Sp87%3DZ9d6Be-z%3DcR2HbYo-Gg%40mail.gmail.com.
