Job openings Site Reliability Engineer
As a Site Reliability Engineer at Tensorflight, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems. You will work closely with our engineering and product teams to design, build, and maintain resilient infrastructure and services. Your primary focus will be on automating processes, optimizing system performance, and implementing strategies for continuous improvement.
Tensorflight is a Polish insurtech startup that is revolutionizing the insurance industry. We are developing an AI building inspector and give underwriters and insurers access to rich and accurate datasets for commercial property to create better insurance products. We automate inspections of commercial, residential and industrial buildings using aerial, satellite, and ground-level imagery. Our system combines derived data with geospatial information and public records to unlock valuable insight about buildings and properties such as replacement value, construction type, or roof characteristics.
Experience and skills
2+ years of experience on SRE position or similar
Experience working with public cloud providers, ideally Google Cloud Platform
Proficiency with containerization and orchestration tooling such as Docker and Kubernetes
Knowledge of Terraform, or other IaaC tool
Experience with building CI/CD pipelines with GitLab or similar
Skillful with one of the programming languages, preferably Python or Golang
Basic knowledge of scripting languages, i.e. bash
Familiarity with observability and monitoring concepts and related tools, preferably few from: Grafana stack, Prometheus, Jaeger or OpenTelemetry
Familiarity with SRE principles and concepts
Very good communication skills
Bonus points for
Experience, or willingness to build and grow SRE/DevOps team in a startup culture
Experience in designing scalable and distributed systems
Background in Linux system administration
Practical experience with Service Mesh tooling like Istio
Experience with management of databases (PostgreSQL)
Experience with MLOps tools e.g. Nvidia Triton, or others
Familiarity with technical security best practices and guidelines, also in terms of certifications like ISO 27001
Managing and extending existing cloud infrastructure running on Google Cloud Platform
Maintaining our cloud-hosted Kubernetes clusters
Monitoring of our system’s components across two regions (America and Europe), and extending our observability capabilities
Troubleshooting infrastructure-related issues and bugs
Participating in defining, maintaining and improving our SLOs
Enhancing scalability and performance of our system to reliably analyze thousands of locations every minute
Improving our CI/CD pipelines and build processes
Getting involved in blame-free incident management, resolutions and postmortems
Participating in system design consulting and reliability reviews with our engineering team
Spreading and sharing the DevOps related knowledge with the rest of the engineering team
Salary: 15,000 PLN - 21,000 PLN + VAT monthly
Contract: B2B contract
Work: hybrid
Paid holidays
Private health care and Multisport fitness card
Working in a Polish startup environment for USA-based insurtech industry
Personal development - conferences, courses
Hybrid model of work, we work 2 days per week in the office in the heart of Warsaw.
Flexible working hours; we start from 9 to 12 am.
Budget for lunch in the office
Integration events and trips
1. Discovery Interview (40 min) - online
2. Technical Interview (120 min) - online
WHY TENSORFLIGHT
Tensorflight is a high-potential start-up backed by leading Silicon Valley, NY, Bermuda, and Australian investors from Fortune 500, including insurance industry leaders like QBE, and highly respected insurance partners such as Nephila and Hudson Structured. Our clients include top commercial carriers in the world like Zurich Insurance and a few more from S&P 500.
We’re a group of creative and curious people who passionately move this project forward. We’ve got a friendly, informal, and open-minded startup atmosphere and we’re all united by a culture rooted in trust and authenticity. Our teams are led by talented and dynamic leaders whose care and integrity brought the whole organization to the path of success.
Work type
Remote workJob position
Site Reliability EngineerEmployment type
Full-Time15000 - 21000 PLN net / month
Contact person
Your potential manager