Upshop is the market leader in Total Store Operation solutions for the Grocery and C-Store markets. We offer an AI-powered, SaaS platform connecting Fresh, Center, eCommerce, and DSD department operations to deliver a simplified, smarter, more connected store experience. Customers running Upshop realize significant improvements in sales, shrink, food safety and sustainability across the entire store. ๐ 2024 Exciting News! Upshop and Invafresh Unite to Transform Global Food Retail Technology ๐ We are thrilled to announce the launch of the first global retail operations platform built specifically for food retailers. This groundbreaking innovation is designed to continuously redefine best-in-class store operations and enhance the shopper experience. ๐น Global Reach with Best-in-Class Technology: Serving 35+ countries and 400+ retailers ๐น Addressing Major Food Challenges: Availability, affordability, and waste ๐น Empowering Associates: Enabling them to deliver the best shopping experience possible http://www.upshop.com
Careers at Upshop
About the Role
We are seeking a seasoned SRE / DevOps Manager to lead our reliability and operations engineering team. You will be responsible for ensuring the scalability, security, and performance of our infrastructure while fostering a culture of automation, ownership, and continuous improvement.
At Upshop, we believe that great businesses are built by great people. Our People function is at the heart of our companyโs growth, ensuring we attract, develop, and retain A Players who drive our mission forward.
Our Values:
- Extremely Accountable
- Customer Obsessed
- Always Innovating
- Demand Excellence
- Biased for Action
Key Responsibilities
Team Leadership
- Manage and mentor a team of SRE and DevOps engineers.
- Drive hiring, onboarding, and professional development.
- Set clear goals and performance metrics.
Reliability & Incident Management
- Own system uptime, performance, and reliability.
- Lead incident response and root cause analysis.
- Define and monitor SLAs, SLOs, and SLIs.
Infrastructure & Automation
- Oversee cloud infrastructure (Azure).
- Implement Infrastructure as Code (IaC) using tools like Terraform or other similar tools
- Drive automation of CI/CD pipelines and operational tasks.
- Build and manage a DevSecOps process to connect CI/CD pipelines with AzureDevOps, Gitlab etc.
Monitoring & Observability
- Implement and maintain monitoring, alerting, and logging systems.
- Use tools like Datadog or other similar tools like Prometheus, Grafana, ELK stack.
Security & Compliance
- Ensure infrastructure security and compliance with industry standards.
- Collaborate with InfoSec teams on audits and vulnerability management.
Cross-functional Collaboration
- Work closely with software engineering, product, and QA teams.
- Advocate for DevOps and SRE best practices across the organization.
Qualifications
- 10+ years of experience in DevOps, SRE, or infrastructure engineering.
- 2+ years in a leadership or managerial role.
- 3+ years of expertise with Cloud platform deployments
- 3+ years of experience working with MongoDB and cosmosdb
- Strong experience with cloud platforms (AWS, GCP, Azure).
- Proficiency in scripting languages (Power shell scripting, Python, Bash, Go).
- Hands-on experience with Kubernetes, Docker, CI/CD tools.
- Excellent communication and leadership skills.
Preferred Qualifications
- Experience with compliance frameworks (SOC 2, ISO 27001).
- Familiarity with Agile and DevOps methodologies.
- Certifications in cloud technologies or DevOps practices.
Benefits/Perks
- Hybrid Opportunity
- Competitive salary
- Employer-matched 401(k) plan
- Attractive paid time off policy
- Career growth and development opportunities