Please click on the Apply to verify the status of jobs posted more than 15 days ago, as they may have expired. Similar Jobs
Job Description
Company Description
Alkimi Exchange is a decentralised programmatic ad exchange restoring the value exchange between advertisers, publishers, and users. Our custom blockchain infrastructure on Sui delivers a fast, scalable, and transparent solution with 0% fraud and low transaction fees. We process 25,00030,000 queries per second (QPS) with sub-100ms latency requirements for real-time bidding auctions.
Learn more at www.alkimi.org.
About This Role
This is not a typical Senior DevOps position. Were hiring a senior engineer who is ready to grow into a DevOps Architect within 612 months and well actively build the conditions for that transition.
Youll join as a hands-on senior individual contributor with immediate ownership of our cloud and on-premise infrastructure, while simultaneously shadowing and co-designing the architectural and people decisions that will define Alkimis next stage of scale. By the time youre ready to step up, the title and responsibility will already be yours in practice.
If youve been the most capable person in your current team but havent had the platform to prove it at an architectural or leadership level, this is your opportunity.
What Youll Own From Day One
- Platform reliability Maintain 99% uptime and meet SLAs across all environments; own incident response, on-call rotation, and post-mortems
- Cost optimisation Drive a 2030% reduction in infrastructure spend through architectural improvements, rightsizing, and automation
- High-throughput infrastructure Design and operate deployment architecture for 25,00030,000 QPS systems with sub-100ms latency requirements
- Multi-cloud IaC Manage AWS, DigitalOcean, and GCP environments using Terraform, Terragrunt, and Ansible
- CI/CD & automation Build and maintain pipelines, monitoring systems, and automation across distributed microservices
- Data systems health Troubleshoot and tune Kafka, RabbitMQ, ClickHouse, Elasticsearch, and MySQL in production
- Application performance Diagnose and resolve Node.js, Python, and Java/Spring Boot application bottlenecks under load
- Security & compliance Implement and uphold best practices across OAuth, OIDC, SSO, disaster recovery, and data privacy
What Youll Be Building Toward (Architect Scope)
- Team architecture Input into hiring, mentoring, and structuring a growing infrastructure engineering team; transition into formal people leadership as the team scales
- Technical vision Co-own the infrastructure roadmap with the CTO; define standards, patterns, and guardrails that the whole engineering org follows
- Cross-functional influence Act as the DevOps voice in product and engineering planning cycles; translate business requirements into scalable infrastructure decisions
- Architectural governance Lead the design and review process for new systems, integrations, and migrations; own the DR and business continuity strategy
- Organisational maturity Drive improvements in engineering culture around reliability, observability, and operational excellence (SLOs, error budgets, runbooks)
Required Skills & Experience
7+ years in DevOps or Infrastructure Engineering roles, including 2+ years operating high-throughput systems (10,000+ QPS). You should be at or near the ceiling of your current role and ready to operate at the next level.
Infrastructure & Cloud
- Production-grade Infrastructure as Code with Terraform, Terragrunt, and Ansible
- Kubernetes and Docker at scale across complex microservices architectures
- Deep AWS expertise (VPC, EC2, ECS, Fargate, S3, Glacier, RDS, Route 53, CloudFront, Lambda, API Gateway, CloudWatch); DigitalOcean, Azure, or GCP experience also valued
- Advanced Linux system administration (RHEL, Ubuntu, Amazon Linux) and networking
Data & Messaging Systems
- Kafka consumer/producer optimisation, lag management, high-volume tuning
- RabbitMQ cluster management, message routing, Kubernetes failure debugging
- ClickHouse - production operations and query optimisation at billions-of-records scale (or similar columnar/time-series database experience)
- MySQL administration, replication, and backup/recovery
- Elasticsearch cluster health management and bulk indexing optimisation
Looking to get Placed? Try our Placement Guarantee Plan
Development & CI/CD
- CI/CD tooling: GitHub Actions, Jenkins, GitLab CI, ArgoCD (GitOps preferred)]
- Python (required), Shell scripting (required); Rust or Go strongly preferred
- JVM profiling, GC tuning, memory leak detection in Java/Spring Boot environments
- Strong understanding of microservices architectures and API design patterns
- Comfortable operating within agile and rapid-iteration engineering cultures
Observability & Incident Management
- Prometheus, Grafana, and the ELK stack (Elasticsearch, Logstash, Kibana, Filebeat)
- Systematic debugging under production load CPU, memory, network, latency
- RED and USE metrics methodology; SLO/SLA definition and error budget management
- Structured post-mortem culture and preventive engineering mindset
Leadership & Communication (Critical for Progression)
- Demonstrated ability to influence engineering decisions without formal authority
- Experience mentoring junior or mid-level engineers
- Ability to communicate infrastructure trade-offs clearly to non-technical stakeholders
- A track record of driving projects end-to-end, including stakeholder alignment
- Blockchain infrastructure experience/knowledge is added advantage (Optional)
Skills
BlockchainPythonDebuggingDevopsJavaKubernetesLinuxNode.jsShell ScriptingApiCloudIf an employer asks you to pay any kind of fee, please notify us immediately. Jobaaj does not charge any fee from the applicants and we do not allow other companies also to do so.
Important dates & deadlines?
Application Deadline
22 Jul 26, 02:41 PM IST
Similar Jobs
View All

