Perception PointSr. DevOps Engineer
Dec. 2023Tel Aviv District, Israel● Part of a team managing large-scale AWS production infrastructure, focusing on on-the-spot troubleshooting and performance optimization (including 24/6 SRE weekly shifts)
● Convert all DevOps infrastructure running on EKS from Helm to Helmfile.
● Developed a custom Prometheus exporter using Python to enhance monitoring and observability.
● Manage and optimize the performance of a large AWS OpenSearch cluster.
● Large-scale, high-pressure clusters performance tuning (Linux kernel PSI metrics, k8s throttling metrics to ensure optimal efficiency and stability.
● Build staging environments on AWS from scratch (TGW, RabbitMQ, VPC endpoints/peering, EKS, IAM identity providers, S3 storage/web hosting, SSM/Secrets, RDS/DynamoDB, OpenSearch).
● Private/Public Hosted zones Route 53.
● Build and maintain GitHub Actions workflows.