Jobs / Int***

Senior Observability Engineer

Int*** · PA, United States
Visa sponsorship details are locked. Unlock company name and apply link with .
PA, United StatesExp: 5-7 yrsOnsite
Remuneration
Not specified
Location
PA, United States
Visa sponsorship
Sponsors visa

Job summary

Seeking a Senior Observability Engineer to administer and maintain observability tools like Splunk, AppDynamics, and Zenoss, ensuring optimal performance and reliability of IT systems.

Qualifications

  • Minimum of 5–7 years in Observability/Monitoring/Site reliability engineering
  • Proven experience in implementing, managing and maintaining observability tools
  • Proficiency in Splunk and AppDynamics
  • Proficiency in Zenoss
  • Strong in MELT, Metrics, Events, Logs and Traces
  • Hands-on troubleshooting and support
  • Experience with OpenTelemetry instrumentation patterns
  • Maintain platform reliability, upgrades, patching, and security hardening
  • Exposure to Kubernetes observability
  • Strong knowledge of IT infrastructure, applications, and networking
  • Experience with scripting and automation tools
  • Familiarity with cloud environments
  • Excellent problem-solving and analytical skills
  • Strong communication and collaboration abilities
  • Ability to work independently and in a team-oriented environment
  • Experience with other monitoring and observability tools
  • Knowledge of DevOps practices and CI/CD pipelines
  • Hands-on Infrastructure-as-Code and Git-based workflows

Responsibilities

  • Administer and configure Splunk, AppDynamics, OTEL and Zenoss platforms
  • Perform regular updates, patches, and upgrades to observability tools
  • Continuously monitor the health and performance of the Splunk, APPD and Zenoss systems
  • Ensure data integrity and availability within the observability platforms
  • Provide support to internal users, assisting with troubleshooting and resolving issues
  • Develop and deliver training sessions for users
  • Create and manage dashboards, reports, and alerts
  • Work with stakeholders to define monitoring requirements
  • Manage onboarding and alert creation
  • Optimize system performance by tuning configurations
  • Maintain comprehensive documentation of configurations, processes, and procedures
  • Develop and enforce best practices for monitoring and observability
  • Collaborate with IT and DevOps teams
  • Participate in incident response efforts

Skills

AnsibleAppDynamicsAWSAzureBashGitGrafanaKubernetesOpenTelemetryPrometheusPythonSplunkTerraform

Degrees

Bachelor's degree in Computer ScienceInformation TechnologyRelated field

Relocation

No