Senior Site Reliability Engineer
- Job ID
- 56219
Overview
At Ford, you’ll work on ideas that matter, alongside passionate people who want to make a global impact. Together, we’re shaping the next era of transportation—grounded in purpose, driven by progress. Make your move.
- Job Type: Full time
- Work Type: Hybrid
Senior Site Reliability Engineer
- Job ID
- 56219
Enterprise Technology plays a critical part in shaping the future of mobility. If you’re looking for the chance to leverage advanced technology to redefine the transportation landscape, enhance customer experience and improve people’s lives, this is the opportunity for you. Join us and challenge your IT expertise and analytical skills to help create vehicles that are as smart as you are.
In this position...
As a Senior Site Reliability Engineer, you will be instrumental in ensuring the reliability, performance, and scalability of the critical Ford Service Reservation Platform and its associated applications. This role demands a deep focus on SRE and platform engineering principles, advanced observability, robust automation, and proactive incident management.
Based in Dearborn, MI, this is a hybrid position with a required four-day onsite presence each week.
What you'll do...
SRE Leadership & Strategy:
- Lead the implementation and continuous evolution of Site Reliability Engineering (SRE) practices to ensure exceptional high availability, performance, and scalability for the Ford Service Reservation Platform and its applications.
- Define, implement, and rigorously maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets for key services, directly aligning reliability goals with critical business and customer outcomes.
- Generate regular SLO and error budget reports, collaborating closely with engineering teams to strategically prioritize reliability work, incident follow-ups, and targeted technical debt reduction efforts.
- Lead weekly status and reliability reviews, effectively communicating risks, performance trends, and improvement opportunities to key stakeholders in engineering and product.
- Champion data-driven decision-making, leveraging observability insights to significantly improve incident response, reduce Mean Time to Resolution (MTTR), and enhance the overall customer experience.
Observability & Monitoring:
- Own, evolve, and optimize comprehensive observability solutions, primarily utilizing Dynatrace for full-stack visibility, Real User Monitoring (RUM), synthetic monitoring, and infrastructure monitoring across critical user journeys of the Ford Service Reservation Platform.
- Design and implement robust Google Cloud Platform (GCP) observability patterns for logs, metrics, alerts, and dashboards specifically tailored for the Ford Service Reservation Platform and its associated applications.
- Leverage Dynatrace and GCP log analytics insights to proactively drive incident reduction, facilitate efficient root cause analysis, and foster continuous performance improvements across all Ford Service Reservation services.
Automation & Infrastructure as Code (IaC):
- Develop and deploy infrastructure as code using Terraform scripts for the provisioning and management of GCP resources, including networking, load balancing, and monitoring artifacts etc.
- Configure and maintain essential DevSecOps tools such as SonarQube, FOSSA, Cycode, and 42 Crunch to ensure code quality and security.
- Build reusable, scalable Terraform modules to automate the provisioning of GCP monitoring artifacts, including log-based metrics, alerting policies, uptime checks, and comprehensive dashboards.
- Develop and maintain robust CI/CD pipelines utilizing Tekton PAC and/or GitHub Actions for application code deployment, automated operational tasks (e.g., instance management, cache invalidation, and data backups), and infrastructure changes.
- Manage GitHub repositories for application code, automation scripts, and configuration management.
Incident & Problem Management:
- Establish and continually refine Incident Management and Problem Management processes, coordinating effectively with application teams for rapid resolution and thorough root cause analysis of issues.
- Identify systemic and application-specific issues through detailed analysis of observability data and collaborate proactively with development teams to prioritize feature requests and defect resolutions that enhance reliability.
Technical Skills & Competencies:
- Cloud & Infrastructure (GCP Focus)
- GCP Expertise: Deep understanding of Google Cloud Platform services, specifically networking (VPC, Firewalls), Load Balancing, GKE (Google Kubernetes Engine), and IAM.
- Infrastructure as Code (IaC): Advanced proficiency in Terraform for provisioning cloud resources and managing infrastructure state.
- Linux/Systems: Strong command of Linux internals and administration.
- Observability & Monitoring
- Dynatrace Mastery: Extensive experience using Dynatrace for full-stack monitoring, including Real User Monitoring (RUM), synthetic monitoring, and root cause analysis.
- Cloud Native Monitoring: Experience designing GCP-specific observability patterns using Log Analytics, Cloud Monitoring, and alerting policies.
- SRE Metrics: Ability to define and implement Service Level Indicators (SLIs) and Service Level Objectives (SLOs) that map to business value.
- Automation & CI/CD
- CI/CD Tooling: Hands-on experience building and maintaining pipelines using Tekton PAC (Pipelines as Code) and/or GitHub Actions.
- Scripting: Proficiency in at least one automation language, such as Python, Go, or Bash.
- Version Control: Advanced knowledge of GitHub for repository management and collaborative development.
- Security & Quality (DevSecOps)
- Security Tooling: Familiarity with integrating and managing security/quality tools such as SonarQube, FOSSA, Cycode, and 42 Crunch within the development lifecycle.
- Leadership & Process Skills
- Incident Management: Experience acting as an Incident Commander or leading "Post-Mortem" (Blameless Root Cause Analysis) sessions to prevent recurrence of systemic issues.
- Data-Driven Mindset: Ability to translate complex observability data into actionable insights for engineering and product stakeholders.
- Communication: Strong ability to lead weekly reliability reviews and communicate technical risks to non-technical stakeholders.
You'll have...
- Bachelor’s degree in Computer Science, Computer Engineering, Systems Engineering or equivalent combination of relevant education and experience.
- 7+ years of experience in Software Engineering, DevOps, or Systems Administration.
- 5+ years of dedicated experience in a Site Reliability Engineering (SRE) or Platform Engineering role.
- 2+ years of experience leading technical initiatives or mentoring junior engineers in an SRE context.
Even better, you may have...
- Master’s Degree in Computer Science, Computer Engineering, Systems Engineering or related field
- Certifications:
- Google Professional Cloud Architect or Google Professional Cloud DevOps Engineer.
- Dynatrace Professional Certification.
- Terraform Associate Certification.
- Platform Experience: Prior experience working on high-traffic reservation systems, e-commerce platforms, or automotive service applications.
You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!
For more information on salary and benefits, click here:
This position is a range of salary grades 7-8.
Visa sponsorship is available for this position.
Candidates for positions with Ford Motor Company must be legally authorized to work in the United States. Verification of employment eligibility will be required at the time of hire.
We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status or protected veteran status. In the United States, if you need a reasonable accommodation for the online application process due to a disability, please call 1-888-336-0660.
#LI- hybrid
#LI-LA1
Looking for jobs tailored to you?
Upload Your ResumeChange the world with your ideas.
Speak up. We’re listening. At Ford, we believe the right ideas, and the people behind them, can move an entire industry. Here, you'll work with teams who value your voice, push bold ideas forward, and leave a mark that lasts.
Support designed to go the distance.
From day one, Ford invests in you with medical benefits built to help you plan for what’s next. You’ll also get support for you and your family that meets you at every step, so you can move forward with confidence.
-
Health and Wellness
Comprehensive medical, dental, vision, mental health, and unique wellness perks keep you and your family supported every step of the way.
-
Financial and Retirement Programs
Build a strong financial future with robust retirement contributions, savings programs, and free personalized financial planning tools.
-
Flexible Vacation and Holidays
Enjoy your time away from work with generous vacation, holidays, and flexible family leave designed to help you balance life and work with ease.
-
Vehicle Discount Program
The best thing about building great products is driving them! The second-best thing: sharing your discount with family and friends so they can drive them, too.
-
Family Growth and Support
Grow your family confidently with fertility, surrogacy, and adoption assistance, paid parental leave, and a supportive new-parent ramp-up program.
-
Additional Programs
Enjoy unexpected extras like pet insurance, legal services, identity protection, and access to convenient health and wellness services.
Testimonial
Ford gives me the space to innovate, to lead, and to serve — all while staying true to who I am as an engineer, educator, and parent. It’s not just a career; it’s a community that drives change.
-
Built on one bold idea and the passion to define sustainable transportation for generations to come, Ford is a story about people with a vision that’s still being written.
What We Do -
Ford’s culture fuels the kind of momentum where ideas flow, progress is unstoppable, and our people keep redefining what it means to innovate.
Our People and Culture -
At Ford, your work matters, your life matters and we’re here to back the whole you—from growth to well-being—so you show up ready to realize your full potential.
Your Benefits
Jobs For You.
Explore roles tailored to your interests, based on your preferences and experience.
-
VHES Business Supervisor
- Dearborn, Michigan
-
Platform Hardware Technical Engineer
- Dearborn, Michigan
-
Category Implementation Buyer-Engine Castings
- Dearborn, Michigan
-
Commodity Calibration Engineer
- Dearborn, Michigan