Job Description
Site Reliability Engineer
Onsite- Bay Area, CA
Skills
Relevant Skills and Experience
What You’ll Do (Day-to-Day)
Own and manage our cloud infrastructure (GCP or AWS, on-prem).
Build, maintain, and optimize Kubernetes clusters (including GPU-backed clusters).
Implement and improve CI/CD pipelines (GitHub Actions).
Write and maintain Infrastructure as Code (Terraform).
Monitor system health and performance using Grafana and other observability tools.
Ensure high availability, reliability, and uptime across platforms.
Handle infrastructure maintenance, upgrades, and scaling.
Administer and improve our platform architecture and apply general security best practices across the stack.
Note: This is an internal-facing role — no customer interaction.
Must-Have:
4+ years in SRE, DevOps, or Infrastructure Engineering
Solid experience with GCP or AWS (hybrid/on-prem a plus)
Experience with Kubernetes cluster management (GPU experience a bonus)
Hands-on with Terraform and CI/CD (GitHub)
Experience with monitoring/observability (Grafana, etc.)
Strong understanding of high availability and infrastructure reliability
Familiarity with platform/cluster architecture and administration
Security mindset and ability to apply best practice
Nice-to-Have:
Startup experience (you enjoy building, not just maintaining)
Experience with scalable GPU infrastructure for AI/ML
...HQ. Position Summary: We are looking for a Junior Software Engineer to join our dynamic team supporting the development... ...security clearance. Preferred Qualifications Internship, co-op, or project experience. Experience with Generative AI applications. Experience...
...leading mental healthcare provider is seeking experienced Psychologists to join their clinical team in Manchester, NH. This role offers flexible work schedules, competitive compensation ranging from $145,000 to $155,000, and benefits including health, dental, vision, and a 4...
...engineering projects. As a subsidiary of Winn & Coales International, Denso provides solutions for industries including petrochemical, offshore, water, and oil and gas, with an emphasis on durability, long-term protection, and solutions for harsh and varied environments....
...If you're looking for a job ASAP and have general Test Technician or Computer or IT or Software experience then this is the position... ...process so you start work as soon as possible. Experience Level: Entry - Expert Level Skills: IT / Software /...
...Position DescriptionHuman and Legal RightsCommitteeVolunteer PositionThe NorthStar Services Humanand Legal Rights Committee (HLRC) usually meets monthly at 10:30 am via Go ToMeeting. This volunteer position is tobe a member of the HLRC. The purpose ofthe...