Website Gdit
About Gdit
Gdit is committed to delivering clarity with our cloud solutions and providing meaningful work.
Job Summary
Own the opportunity as a Senior Azure Engineer and help ensure the mission is never interrupted. Your work will be an important part of transforming our clients for the modern age and help them face any obstacle.
Key Responsibilities
- Ensure operational stability, availability, performance, and scalability of cloud-hosted systems across production and development environments supporting multiple agile teams.
- Provide real-time monitoring, alerting, incident response, and health checks for infrastructure and applications across all cloud layers (OS, app, DB).
- Implement and maintain dashboards, visualizations, and reports for system health, event management, and cost optimization using native Azure tools.
- Manage cloud resource thresholds and automate capacity planning, forecasting, and resource optimization strategies.
- Perform incident and event management (SIEM) operations, and support issue diagnosis, resolution, and reporting including RCA documentation.
- Track, document, and report monthly issues, including system performance, stability, ticket volumes, and time-to-resolution metrics.
- Monitor resource utilization (CPU, memory, disk space) across all deployed VMs, containers, and PaaS components.
- Contribute to the implementation of the ServiceNow Management, Instrumentation and Discovery (MID) servers.
- Support deployment automation and ensure systems are resilient, repeatable, and scalable via Infrastructure as Code (IaC).
- Execute daily or agreed frequency system health checks and maintain operational Runbooks and SOPs.
Required Qualifications
- Bachelor’s degree in Computer Science, Software Engineering, or related field.
- 5 – 8+ years experience in IT system engineering, systems development, systems coding and programming.
- Deep expertise with Azure services, including monitoring, logging, compute, storage, and networking.
- Proficiency in Infrastructure as Code (IaC) tools like Terraform, or Azure Bicep.
- Hands-on experience with monitoring and APM tools such as Azure Monitor, etc.
- Solid understanding of incident response, change management, and ITIL-based operational support.
- Familiarity with CI/CD toolchains and automation platforms (GitHub Actions, GitLab).
- Strong scripting skills (Python, PowerShell, Bash) for automation and orchestration.
To apply for this job please visit gdit.wd5.myworkdayjobs.com.