Senior DevOps Engineer

paper-id

paper-id

Software Engineering

West Jakarta, Kebonjeruk, West Jakarta City, Jakarta, Indonesia

Posted on May 11, 2026

About Job

The Senior DevOps Engineer at Paper.id plays a critical role in ensuring the smooth operation of our software systems. This position is responsible for bridging the gap between development and operations teams, implementing deployment and operational strategies to improve efficiency and reliability. As a Senior DevOps Engineer, you will be instrumental in driving our digital transformation, leveraging your technical expertise to automate processes, monitor performance, and optimize our infrastructure for scalability and availability.

This role requires a unique blend of technical and business acumen, with a focus on collaboration, problem-solving, and continuous improvement. If you are passionate about DevOps and have a keen eye for detail, we encourage you to apply for this exciting opportunity.

Skills & Qualification

  • Experience with containerization tools such as Docker and Kubernetes

  • Strong understanding of cloud computing platforms, including AWS and Azure

  • Familiarity with automation tools like Ansible, Puppet, or Chef

  • Proficiency in scripting languages such as Python, Ruby, or PowerShell

  • Knowledge of continuous integration and continuous deployment (CI/CD) pipelines

  • Experience with monitoring and logging tools such as Prometheus, Grafana, and ELK Stack

  • Ability to analyze complex technical issues and develop effective solutions

  • Proven hands-on experience operating high-throughput production systems (handling millions of requests/day, peak traffic events like end-of-month invoice cycles, payment processing spikes)

  • Performance tuning at scale: load balancing, autoscaling (HPA/VPA), caching layers (Redis/CDN), database connection pooling, and query optimization under heavy concurrent load

  • Capacity planning — forecasting traffic growth, right-sizing infrastructure, and managing cloud cost efficiency at scale

  • Production incident leadership — led RCA / postmortems for high-severity outages on customer-facing systems

Responsibilities

  • Design, develop, and implement automation scripts to streamline deployment and operational processes, reducing manual errors and improving efficiency

  • Collaborate with development teams to ensure seamless integration of new features and applications, leveraging DevOps best practices to minimize downtime and maximize availability

  • Develop and maintain CI/CD pipelines to automate testing, building, and deployment of software components, ensuring consistency and reliability across the board

  • Implement monitoring and logging tools to track system performance, identify areas of improvement, and drive data-driven decision-making

  • Work closely with operations teams to develop and maintain infrastructure, ensuring scalability, reliability, and high availability

  • Develop and maintain technical documentation, providing clear guidance and support to colleagues and stakeholders

  • Analyze complex technical issues, develop effective solutions, and implement changes to improve system performance and reliability

  • Provide leadership and mentorship to junior team members, promoting knowledge sharing and skill development across the organization

  • Own the reliability of a multi-service platform serving high-volume B2B fintech traffic (invoicing, payments, tax submission). Define and uphold SLOs across critical services

  • Lead capacity planning and traffic management for peak-load scenarios (month-end, tax season, payment cycles), including load testing strategies and runbook ownership

  • Drive cost optimization without compromising reliability — right-sizing GKE workloads, optimizing data egress, tuning resource requests/limits