Site Reliability Engineer

Warsaw / Remote work

Job openings     Site Reliability Engineer

Site Reliability Engineer

As a Site Reliability Engineer at Tensorflight, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems. You will work closely with our engineering and product teams to design, build, and maintain resilient infrastructure and services. Your primary focus will be on automating processes, optimizing system performance, and implementing strategies for continuous improvement.

Tensorflight is a Polish insurtech startup that is revolutionizing the insurance industry. We are developing an AI building inspector and give underwriters and insurers access to rich and accurate datasets for commercial property to create better insurance products. We automate inspections of commercial, residential and industrial buildings using aerial, satellite, and ground-level imagery. Our system combines derived data with geospatial information and public records to unlock valuable insight about buildings and properties such as replacement value, construction type, or roof characteristics.

Requirements


Experience and skills

  • 2+ years of experience on SRE position or similar

  • Experience working with public cloud providers, ideally Google Cloud Platform 

  • Proficiency with containerization and orchestration tooling such as Docker and Kubernetes

  • Knowledge of Terraform, or other IaaC tool

  • Experience with building CI/CD pipelines with GitLab or similar

  • Skillful with one of the programming languages, preferably Python or Golang

  • Basic knowledge of scripting languages, i.e. bash

  • Familiarity with observability and monitoring concepts and related tools, preferably few from: Grafana stack, Prometheus, Jaeger or OpenTelemetry

  • Familiarity with SRE principles and concepts

  • Very good communication skills


Bonus points for

  • Experience, or willingness to build and grow SRE/DevOps team in a startup culture

  • Experience in designing scalable and distributed systems

  • Background in Linux system administration

  • Practical experience with Service Mesh tooling like Istio

  • Experience with management of databases (PostgreSQL)

  • Experience with MLOps tools e.g. Nvidia Triton, or others

  • Familiarity with technical security best practices and guidelines, also in terms of certifications like ISO 27001

Responsibilities


  • Managing and extending existing cloud infrastructure running on Google Cloud Platform

  • Maintaining our cloud-hosted Kubernetes clusters

  • Monitoring of our system’s components across two regions (America and Europe), and extending our observability capabilities

  • Troubleshooting infrastructure-related issues and bugs

  • Participating in defining, maintaining and improving our SLOs

  • Enhancing scalability and performance of our system to reliably analyze thousands of locations every minute

  • Improving our CI/CD pipelines and build processes

  • Getting involved in blame-free incident management, resolutions and postmortems

  • Participating in system design consulting and reliability reviews with our engineering team

  • Spreading and sharing the DevOps related knowledge with the rest of the engineering team

Offer


Salary:  15,000 PLN - 21,000 PLN + VAT monthly

Contract: B2B contract

Work: hybrid


Benefits:

  • Paid holidays

  • Private health care and Multisport fitness card

  • Working in a Polish startup environment for USA-based  insurtech industry

  • Personal development - conferences, courses 

  • Hybrid model of work, we work 2 days per week in the office in the heart of Warsaw.

  • Flexible working hours; we start from 9 to 12 am.

  • Budget for lunch in the office

  • Integration events and trips


Our interview process:

1.  Discovery Interview (40 min) - online

2.  Technical Interview (120 min) - online


WHY TENSORFLIGHT


Tensorflight is a high-potential start-up backed by leading Silicon Valley, NY, Bermuda, and Australian investors from Fortune 500, including insurance industry leaders like QBE, and highly respected insurance partners such as Nephila and Hudson Structured. Our clients include top commercial carriers in the world like Zurich Insurance and a few more from S&P 500. 


We’re a group of creative and curious people who passionately move this project forward. We’ve got a friendly, informal, and open-minded startup atmosphere and we’re all united by a culture rooted in trust and authenticity. Our teams are led by talented and dynamic leaders whose care and integrity brought the whole organization to the path of success. 

Work type

Remote work
Onsite: Warsaw

Job position

Site Reliability Engineer

Employment type

Full-Time

15000 - 21000 PLN net / month

Contact person

Your potential manager

Jakub Waszkiewicz
Site Reliability Engineer

Share this Job

   

Warsaw

Recruitment powered by tomHRM