Filters (Clear filters)
Salary
Categories
Prometheus
Add
Company
Work model
Employment type
Find your next tech job
Most relevant

Prometheus jobs

Machine Learning ArchitectMachine Learning Architect
Tekion
Pleasanton, United States (city)
$225k - $300k
Architect
Front-end
Tensorflow
Apache
Python
Data science
AWS
Redis
DevOps
Grafana
MongoDB
PyTorch
Kafka
Cloud
Big Data Engineer
ML Engineer
Kubernetes
Postgres
Back-end
AI
ElasticSearch
Java
Docker
Prometheus
Open Source
Azure
Posted 16 hours ago
Site Reliability Engineer - CoreSite Reliability Engineer - Core
Blockchain.com
London, United States (city)
Python
Site reliability engineer
Prometheus
GitHub
Cloud
Bash
Network
GCP
Linux
Crypto
Developer
Golang
AWS
Grafana
Blockchain
Datadog
Terraform
Kafka
Posted 23 hours ago
Site Reliability EngineerSite Reliability Engineer
Roadie
United States, Northern America (country)
Grafana
Terraform
S3 Bucket
ElasticSearch
Agile
Kubernetes
Ruby
Golang
Bash
DevOps
Site reliability engineer
Ruby on rails
Postgres
React
Python
Swift
Network
CircleCi
Objective-C
Prometheus
Kafka
Android
GCP
Redis
AWS
Docker
Git
Helm
Posted 3 days ago
Director of Engineering, Cloud & ReliabilityDirector of Engineering, Cloud & Reliability
Boulevard
Canada, Northern America (country)
$218k - $312k
Grafana
Terraform
Cloud
Datadog
DevOps
AWS
Architect
Prometheus
Kubernetes
Posted 3 days ago
Site Reliability Engineering Technical Leader, Network Assurance Data PlatformNewSite Reliability Engineering Technical Leader, Network Assurance Data PlatformNew
Cisco ThousandEyes
Bengaluru, India (city)
ML Engineer
Big Data Engineer
Cloud
Prometheus
Terraform
Kafka
Unix
AWS
AI
DevOps
Linux
Kubernetes
Grafana
Python
Network
Site reliability engineer
Developer
Posted 4 days ago
Systems Administrator 1 - DevOps/Git/Linux/Windows/Cloud/SaltStack/PuppetSystems Administrator 1 - DevOps/Git/Linux/Windows/Cloud/SaltStack/Puppet
Captivation Software
Maryland, United States (region)
$130k - $270k
Linux
Python
Prometheus
Ansible
Kubernetes
Bash
GCP
Cloud
Chef
Docker
AWS
Azure
Git
Posted 5 days ago
Software Integration Engineer 3 - Bash/Python/Kubernetes/Docker/HelmSoftware Integration Engineer 3 - Bash/Python/Kubernetes/Docker/Helm
Captivation Software
Maryland, United States (region)
$130k - $270k
Linux
Grafana
Bash
Python
Jira
DevOps
Docker
Prometheus
Git
Kubernetes
Helm
Ansible
Posted 5 days ago
Senior System Engineer - Public Cloud ScalabilitySenior System Engineer - Public Cloud Scalability
Leaseweb
Amsterdam, Netherlands (city)
Linux
Golang
Cloud
Python
Chef
Nagios
Bash
Prometheus
MySql
Scrum
Kubernetes
DevOps
API
Shell
Posted 5 days ago
Staff Cloud EngineerStaff Cloud Engineer
Aetion
Spain, Southern Europe (country)
Grafana
Terraform
S3 Bucket
Agile
Kubernetes
Big Data Engineer
SQL
DevOps
GitHub
Azure
Site reliability engineer
Linux
Python
Network
Architect
Prometheus
Ansible
GCP
Unix
Cloud
Developer
AWS
Java
Shell
Jenkins
Posted 5 days ago
Site Reliability EngineerSite Reliability Engineer
Hometap
Boston, United States (city)
Javascript
Kubernetes
Site reliability engineer
Cloud
S3 Bucket
Lambda
Grafana
Jenkins
Terraform
Python
AWS
GitHub
Docker
GitLab
DynamoDB
Prometheus
DevOps
Posted 7 days ago
Senior LLM EngineerSenior LLM Engineer
SmartAsset
United States, Northern America (country)
$175k - $250k
Django
Kubernetes
Cloud
Marketing
LLM
GPT
Terraform
Python
MongoDB
GCP
Azure
ML Engineer
AWS
AI
Docker
GitLab
Data science
Helm
Prompt Engineer
Prometheus
Sales
DevOps
FastAPI
Posted 8 days ago
DevOps EngineerDevOps Engineer
Vonage
Tel Aviv, Israel (city)
Kubernetes
API
Cloud
S3 Bucket
Lambda
Grafana
Datadog
Terraform
EC2
AWS
GitHub
Docker
Helm
Prometheus
DevOps
Posted 9 days ago
Sr Site Reliability Engineer (Heavy K8; GCP/Backstage - Hosting/plugins)Sr Site Reliability Engineer (Heavy K8; GCP/Backstage - Hosting/plugins)
Vonage
Bengaluru, India (city)
Unix
Kubernetes
Site reliability engineer
API
Cloud
Grafana
Ansible
Ruby
Shell
Terraform
Python
Developer
GCP
AWS
Linux
Open Source
Helm
Prometheus
Posted 9 days ago
Software Engineer, ServerNewSoftware Engineer, ServerNew
Niantic
Sunnyvale, United States (city)
$132k - $230k
ElasticSearch
Kubernetes
Cloud
Grafana
C
Terraform
Python
Software engineer
SQL
Azure
AWS
Open Source
Prometheus
Java
Redis
Posted 10 days ago
Staff Software Engineer, InfrastructureStaff Software Engineer, Infrastructure
Freenome
San Francisco, Argentina (city)
$188k - $288k
Site reliability engineer
ML Engineer
Software engineer
Azure
Python
Kubernetes
Cloud
Linux
Big Data Engineer
Terraform
AWS
GCP
Kafka
Prometheus
Posted 12 days ago
Software Engineer, ServerSoftware Engineer, Server
Niantic
Bellevue, Australia (city)
$132k - $230k
Redis
Grafana
Prometheus
Cloud
ElasticSearch
Software engineer
SQL
C
Azure
Kubernetes
Java
Python
Terraform
Open Source
AWS
Posted 12 days ago
Software Engineer, ServerSoftware Engineer, Server
Niantic
San Francisco, Argentina (city)
$132k - $230k
Redis
Grafana
Prometheus
Cloud
ElasticSearch
Software engineer
SQL
C
Azure
Kubernetes
Java
Python
Terraform
Open Source
AWS
Posted 12 days ago
Site Reliability Engineer (Bucharest, Romania) - FulltimeSite Reliability Engineer (Bucharest, Romania) - Fulltime
Alchemy
Bucharest, Romania (city)
Docker
Python
Web3
Grafana
Rust
Prometheus
AWS
Ansible
AI
Developer
Kubernetes
Terraform
Java
Azure
GitLab
Jenkins
GCP
Site reliability engineer
Golang
Splunk
Blockchain
Cloud
Linux
Chef
Posted 12 days ago
Site Reliability EngineerSite Reliability Engineer
Alchemy
San Francisco, Argentina (city)
$135k - $240k
Web3
Grafana
Prometheus
AWS
AI
Developer
Kubernetes
Terraform
GCP
Helm
Site reliability engineer
Architect
Datadog
Blockchain
Cloud
Posted 12 days ago
DevOps EngineerDevOps Engineer
Aspire
Bengaluru, India (city)
Ansible
Kubernetes
Docker
GitLab
ElasticSearch
C
Linux
Prometheus
Chef
AWS
Azure
Bash
Git
Jenkins
Terraform
DevOps
Unix
Python
Java
Ruby
Cloud
Posted 13 days ago
Principal Cloud Engineer - Remote USPrincipal Cloud Engineer - Remote US
Seamless.AI
Columbus, United States (city)
Kubernetes
AI
Network
Solutions Architect
DevOps
Prometheus
AWS
EC2
Docker
Terraform
Search
Big Data Engineer
S3 Bucket
Lambda
Cloud
Sales
Posted 14 days ago
SRE ISRE I
OppFi
United States, Northern America (country)
$85k - $128k
Site reliability engineer
CircleCi
Prometheus
Bash
Chef
Terraform
Azure
Java
Lambda
Cloud
GitHub
Datadog
DevOps
AWS
Kubernetes
Ansible
Jenkins
Linux
Ruby
GCP
Splunk
Python
C
Posted 15 days ago
Engineering Manager, Machine Learning OperationsEngineering Manager, Machine Learning Operations
PitchBook Data
Seattle, United States (city)
$240k - $280k
Prometheus
ML Engineer
Redis
Data science
Apache
ElasticSearch
Open Source
PyTorch
Java
Tensorflow
Docker
AWS
Agile
Grafana
Kafka
AI
NLP
Cloud
LLM
Kubernetes
FastAPI
Engineering Manager
GCP
SQL
Python
Posted 16 days ago
Senior Python EngineerSenior Python Engineer
Inizio
London, Canada (city)
Open Source
Grafana
Scrum
Prometheus
Cloud
Kubernetes
AWS
Databricks
Python
Back-end
Agile
Azure
Posted 18 days ago
DevOps EngineerDevOps Engineer
Torq
Tel Aviv, Israel (city)
$70M - $70M
AI
Cloud
Terraform
GCP
C
Grafana
Developer
Prometheus
Jenkins
Kubernetes
GitHub
GitLab
AWS
Site reliability engineer
Bash
Python
DevOps
Docker
Posted 18 days ago
Principal – Infrastructure ArchitectPrincipal – Infrastructure Architect
Verisign
Reston, United States (city)
$180k - $244k
Jira
Prometheus
Cloud
Jenkins
Architect
GitHub
Kubernetes
AWS
Docker
Posted 18 days ago
Staff Site Reliability Engineer, DevOpsStaff Site Reliability Engineer, DevOps
Pismo
Brazil, South America (country)
Grafana
Prometheus
Terraform
Kubernetes
AWS
Site reliability engineer
DevOps
Docker
GCP
Posted 19 days ago
Principal DevOps ArchitectPrincipal DevOps Architect
Rumble
Sarasota, United States (city)
Redis
Site reliability engineer
nginx
Video
Grafana
Kubernetes
Bash
Cloud
PHP
C
DevOps
Python
Architect
Linux
Docker
MariaDB
Open Source
Prometheus
Rust
MySql
Posted 19 days ago
DevOps EngineerDevOps Engineer
Kinetik
Bangladesh, Southern Asia (country)
$140k - $200k
Jenkins
GitLab
Grafana
Bash
Kubernetes
Cloud
Lambda
Terraform
S3 Bucket
DevOps
Python
Docker
AWS
Prometheus
Developer
Network
Posted 19 days ago
Site Reliability EngineerSite Reliability Engineer
Qumulo Careers
Vancouver, United States (city)
$80k - $120k
Ansible
Linux
Python
AWS
Grafana
Azure
GCP
Kubernetes
Cloud
Site reliability engineer
Terraform
Prometheus
Posted 20 days ago
Network Reliability EngineerNetwork Reliability Engineer
May Mobility
Ann Arbor, United States (city)
$95k - $120k
Site reliability engineer
GCP
Ansible
iOs
Bash
Cloud
Golang
Shell
Terraform
DevOps
Python
Linux
Open Source
Azure
AWS
Chef
Prometheus
Network
Posted 21 days ago
Senior Machine Learning EngineerSenior Machine Learning Engineer
Censys
United States, Northern America (country)
$182k - $228k
Python
PyTorch
Cloud
Azure
Prometheus
Grafana
Reinforcement Learning
DevOps
ML Engineer
Kubernetes
GCP
Golang
AWS
Computer Vision
Docker
Open Source
Data science
Helm
Posted 21 days ago
Site Reliability EngineerSite Reliability Engineer
Fulfil Solutions
United States, Northern America (country)
$145k - $185k
GCP
Robotics
Ruby
Network
Docker
Prometheus
Kubernetes
Grafana
GitHub
Cloud
Python
Computer Vision
AWS
Azure
Site reliability engineer
Posted 21 days ago
Software Engineer, DevOpsNewSoftware Engineer, DevOpsNew
Verantos
Pune, India (city)
EC2
Helm
Linux
Lambda
ML Engineer
S3 Bucket
MySql
Kubernetes
Azure
AWS
GCP
DynamoDB
Ansible
DevOps
BitBucket
Grafana
AI
Docker
GitLab
Cloud
Prometheus
Terraform
Software engineer
Jenkins
GitHub
Posted 22 days ago
Site Reliability Engineer (m/f/x)Site Reliability Engineer (m/f/x)
commercetools
Berlín, El Salvador (city)
Cloud
Prometheus
Kubernetes
Grafana
Python
Site reliability engineer
Terraform
Developer
AWS
GCP
Bash
Posted 23 days ago
Data Engineer NewData Engineer New
Tech Holding
Ahmedabad, India (city)
Linux
React
Docker
AWS
Javascript
Prometheus
Big Data Engineer
Terraform
Node.js
Cloud
Grafana
DevOps
Python
Lambda
S3 Bucket
SQL
Git
Posted 27 days ago
SRE and DevOps EngineerSRE and DevOps Engineer
Sustainable Talent
Santa Clara, Argentina (city)
Kubernetes
DevOps
AI
Prometheus
GitHub
SQL
MySql
Jenkins
Cloud
Splunk
Site reliability engineer
Ansible
GitLab
Grafana
Posted 28 days ago
Devops EngineerDevops Engineer
Nexxen
Tel Aviv, Israel (city)
React
ElasticSearch
Kubernetes
DevOps
Prometheus
EC2
Datadog
Front-end
Angular
Terraform
Linux
AWS
Lambda
Cloud
S3 Bucket
Python
Node.js
Ansible
Bash
Chef
Grafana
GitLab
Posted 28 days ago
Senior Software Engineer, Developer ProductivitySenior Software Engineer, Developer Productivity
Lightning AI
San Francisco, Argentina (city)
$180k - $215k
Kubernetes
Architect
AI
Azure
Prometheus
GCP
GitHub
Golang
AWS
Docker
Jenkins
CircleCi
Cloud
Developer
PyTorch
GitLab
Grafana
Posted 29 days ago
Senior Data EngineerSenior Data Engineer
HousingAnywhere Group
Rotterdam, Netherlands (city)
Data science
Helm
Postgres
Kubernetes
AI
Open Source
Prometheus
GCP
Terraform
Data Warehouse
Marketing
Docker
Jenkins
Apache
Cloud
Kafka
ML Engineer
Python
Big Data Engineer
Search
Sales
Grafana
Posted 1 month ago
DevOps Engineer IIINewDevOps Engineer IIINew
PriceSpider
United States, Northern America (country)
$100k - $140k
S3 Bucket
DevOps
AWS
Terraform
Redis
Grafana
Bash
Cloud
GitLab
Prometheus
Docker
Azure
Git
ElasticSearch
Datadog
Site reliability engineer
GitHub
Python
Kubernetes
GCP
Posted 1 month ago
Senior Site Reliability EngineerSenior Site Reliability Engineer
Roadie
San Francisco, Argentina (city)
$80k - $120k
DevOps
Ruby
Agile
AWS
Docker
Bash
Network
Terraform
Site reliability engineer
Ruby on rails
Python
GCP
Kubernetes
S3 Bucket
Grafana
CircleCi
Objective-C
Golang
Swift
ElasticSearch
Redis
Postgres
Android
Prometheus
Kafka
Helm
Git
React
Posted 1 month ago
Lead DevOps EngineerLead DevOps Engineer
Eclipse
United States, Northern America (country)
Blockchain
DevOps
Prometheus
Cloud
Kubernetes
Crypto
Terraform
Solana
AWS
GCP
Posted 1 month ago
Principal Software Engineer - Product / Frontendtags.newPrincipal Software Engineer - Product / Frontendtags.new
Glide
New York, United States (region)
$80k - $100k
SQL
Kafka
GraphQL APIs
Video
Grafana
Datadog
Developer
Engineering Manager
Splunk
Tech lead
Site reliability engineer
Supabase
Apache
Prometheus
REST APIs
Cloud
API
Posted 1 month ago
Published: 2025-07-21  •  Pleasanton, United States (city)
Tensorflow
AI
Big Data Engineer
DevOps
AWS
ML Engineer
Docker
Data science
Kubernetes
PyTorch
Architect
Redis
Azure
Java
Python
Postgres
Back-end
Open Source
Front-end
Cloud
Apache
MongoDB
ElasticSearch
Kafka
Grafana
Prometheus
$225k - $300k
On-site
Full-time

About Tekion:

Positively disrupting an industry that has not seen any innovation in over 50 years, Tekion has challenged the paradigm with the first and fastest cloud-native automotive platform that includes the revolutionary Automotive Retail Cloud (ARC) for retailers, Automotive Enterprise Cloud (AEC) for manufacturers and other large automotive enterprises and Automotive Partner Cloud (APC) for technology and industry partners. Tekion connects the entire spectrum of the automotive retail ecosystem through one seamless platform. The transformative platform uses cutting-edge technology, big data, machine learning, and AI to seamlessly bring together OEMs, retailers/dealers and consumers. With its highly configurable integration and greater customer engagement capabilities, Tekion is enabling the best automotive retail experiences ever. Tekion employs close to 3,000 people across North America, Asia and Europe.

 

About Tekion:

Positively disrupting an industry that has not seen any innovation in over 50 years, Tekion has challenged the paradigm with the first and fastest cloud-native automotive platform that includes the revolutionary Automotive Retail Cloud (ARC) for retailers, Automotive Enterprise Cloud (AEC) for manufacturers and other large automotive enterprises and Automotive Partner Cloud (APC) for technology and industry partners. Tekion connects the entire spectrum of the automotive retail ecosystem through one seamless platform. The transformative platform uses cutting-edge technology, big data, machine learning, and AI to seamlessly bring together OEMs, retailers/dealers and consumers. With its highly configurable integration and greater customer engagement capabilities, Tekion is enabling the best automotive retail experiences ever. Tekion employs close to 3,000 people across North America, Asia and Europe

On Site:  4 days a week in Pleasanton, CA office

  * Internally this role is called Associate Principal Machine Learning Engineer

The Machine Learning Architect will help shape the future of Automotive Industry. At Tekion we transform the end to end experience of customers in automotive. Are you motivated by solving tough problems, business-focused and have a passion for data driven decisions? The successful candidate will be a self-starter, comfortable with ambiguity, have an affinity for building from scratch, demonstrate strong attention to detail, and have the ability to work in a fast-paced, complex and ever-changing environment. 

You’ll be able to bring your unique skills and expertise to shape the direction of our Data Science and Machine Learning team. You will have an opportunity to work with state-of-the-art machine learning algorithms on large datasets.  

Key Responsibilities

  • Serve as a liaison between R&D and stakeholders to ensure technical alignment with business objectives.
  • Contribute to the development, scaling, and optimization of ML/AI solutions in alignment with company goals.
  • Support shaping the R&D roadmap by collaborating with stakeholders to understand business needs.
  • Develop APIs and microservices to enable seamless integration of ML models into production systems.
  • Collaborate on designing and building feature pipelines for ML models.
  • Ensure integration of ML models with front-end applications, databases, and back-end services.
  • Collaborate with the team to publish findings and advance field knowledge in key areas.
  • Work with machine learning engineers and cross-functional teams to support team growth and collaboration.
  • Assist in building and maintaining MLOps pipelines for data collection, model training, validation, and monitoring.
  • Implement version control, testing, and model governance practices as part of the ML lifecycle.
  • Identify and address basic bottlenecks in ML models and services, with guidance from senior staff.
  • Apply techniques like model optimization and distributed training under mentorship.
  • Track metrics and support post-deployment optimization efforts.
  • Collaborate with cloud architects and DevOps teams to design scalable ML infrastructure.
  • Assist in the deployment and management of compute and storage resources for ML workflows.
  • Partner with applied scientists and analysts to translate model requirements into production-ready solutions.
  • Contribute to establishing monitoring and alerting systems for deployed models.
  • Assist in creating and maintaining documentation for ML architecture and best practices.
  • Stay informed about ML technologies and suggest relevant enhancements.

Required Qualifications

  • Bachelor’s/Master’s in Computer Science or a related field.
  • 7–10 years of hands-on experience as a Machine Learning Engineer or in a similar role, with a strong portfolio of deployed ML models.
  • Strong problem-solving and analytical skills.
  • Proficient in Python for model development and data manipulation.
  • Familiarity with Java or Scala for production systems and microservices.
  • Experience with messaging queues like Kafka or SQS.
  • Familiarity with MLOps tools and frameworks (e.g., MLflow, Kubeflow, Airflow).
  • Working knowledge of cloud platforms (AWS, Google Cloud, Azure) and containerization (Docker, Kubernetes).
  • Basic understanding of machine learning frameworks (TensorFlow, PyTorch, Scikit-learn).
  • Familiarity with data stores like Elasticsearch, Postgres, MongoDB, or Redis.
  • Experience with data processing tools (e.g., Apache Spark, Kafka).
  • Familiarity with monitoring tools such as Grafana or Prometheus.

Preferred Qualifications

  • Some experience with large-scale production systems or distributed computing.
  • Understanding of data engineering practices and data warehousing solutions.
  • Participation in open-source projects or the ML community is a plus.
  • Strong interpersonal and communication skills for collaborating with cross-functional teams.
  • A proactive mindset, with the ability to learn and adapt to new challenges

Perks and Benefits

  • Competitive compensation and generous stock options   
  • 100% employer-paid top-of-the-line medical, dental and vision coverage  
  • Great benefits including unlimited PTO, parental leave and free snacks and beverages  
  • The opportunity to work with some of the brightest minds from Silicon Valley’s most dominant and successful companies   
  • Be part of an early stage, hyper-growth start-up with the opportunity to grow and prosper    
  • Work on the latest and coolest technologies – everything is home-grown and built ground-up   
  • A dynamic work environment with a strong sense of community and collaboration   
  • The open and transparent culture that encourages innovation, rewards performance and discourages hierarchy   
  • Exciting opportunities for career growth and development   

Current Tekion Employees – Please apply via Greenhouse Internal Job Board

The salary range describes the minimum to maximum base salary range for this position across applicable US locations. The actual compensation offered may vary from the posted hiring range based on geographic location, work experience, education, licensure requirements and/or skill level and will be finalized at the time of offer.  In addition to the compensation listed, this position may be eligible for equity compensation, and a bonus or commission whereby total compensation may exceed base salary depending on individual or company performance. Your recruiter can share more about the specific salary range during the hiring process.

Base Salary Range

$225,000 - $300,000 USD

Tekion is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, victim of violence or having a family member who is a victim of violence, the intersectionality of two or more protected categories, or other applicable legally protected characteristics. 

For more information on our privacy practices, please refer to our Applicant Privacy Notice here.

Looking for talent?

Get in front of thousands of skilled ML/AI Engineers and discover a suitable candidate for your job opening.