Jobs / Okt***
Staff Site Reliability Engineer, Networking
Okt*** · San Francisco, CA, United States
Visa sponsorship details are locked. Unlock company name and apply link with .
San Francisco, CA, United StatesExp: 8+ yrs194,000-267,000 USD/yearlyHybrid
Remuneration
194,000-267,000 USD/yearly
Location
San Francisco, CA, United States
Visa sponsorship
Sponsors visa
Job summary
Okt*** is seeking a Staff Site Reliability Engineer to join the TCore team, which owns and operates all of Okt***'s networks. This role involves designing and implementing scalable, reliable network solutions, maintaining cloud infrastructure, and automating AWS infrastructure. The ideal candidate will have extensive experience in cloud networking and a passion for network responsiveness and performance.
Benefits
EquityBonusHealth insuranceDental insuranceVision insurance401(k)Flexible spending accountPaid leavePTOParental leave
Qualifications
- 8+ years experience in a Cloud Network Engineer role or related field
- In-depth understanding of TCP/IP networking stack (layer 2 through 7)
- Ability to implement a highly available VPC network, including inter-VPC connectivity
- Working knowledge of stateless and stateful firewalls
- Familiarity with DNS, web-application firewalls, and various cloud load balancing methods
- Deep knowledge of AWS/GCP network concepts such as Transit Gateway / Network Connectivity Center (NCC), Site-to-Site VPN / HA VPN, and Direct Connect / Cloud Interconnect
- Ability to troubleshoot network issues using AWS VPC flow logs, Cloudwatch metrics, GCP VPC Flow Logs / Cloud Logging, and standard packet captures
- Experience working with Terraform, Ansible, Chef, Puppet or similar automation tools
- Proficiency in Bash, Python, Golang, or similar scripting languages
- Experienced with Git
- Ability to collaborate effectively with multiple stakeholders
- Willingness to work on-call
- Experience working in a security-oriented cloud environment (extra credit)
- Working knowledge of Palo Alto next-gen virtual firewalls, including implementation, configuration of security policies, routing, and Global Protect (extra credit)
- Experience with GCP-specific advanced architecture like Shared VPC topologies, Cloud Router BGP configurations, and Network Connectivity Center (NCC) (extra credit)
- Ability to access federal environments and/or protected federal data
- Ability to submit documentation establishing U.S. Person status upon hire (e.g., U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee)
Responsibilities
- Design and implement scalable and reliable network solutions with various teams
- Maintain a highly available cloud infrastructure edge for the Okt*** identity platform
- Collect and analyze data to identify root causes for network-specific events
- Automate AWS infrastructure with Terraform and/or Chef
- Evolve the system by introducing changes to improve efficiency, scalability, and velocity
Skills
AnsibleAWSBashChefCloudWatchGCPGitGoOktaPuppetPythonSlackTerraform
Languages
BashPythonGolang
Work schedule
On-call
Relocation
No