Jobgether
About Jobgether
Jobgether is a platform connecting job seekers with opportunities in various fields.
Job Summary
We are currently looking for a Linux Systems Administrator (Red Hat) AI ML Based in Malta. This role is a senior-level position focused on managing and optimizing large-scale Linux infrastructure in both production and development environments.
Key Responsibilities
- Manage, monitor, and maintain Ubuntu and Red Hat Linux servers across production and staging environments.
- Perform system upgrades, kernel updates, patch management, and performance tuning to ensure optimal reliability.
- Implement and enforce security policies, user access controls, and backup/recovery strategies.
- Troubleshoot hardware, OS, and network-related issues; ensure minimal service disruption.
- Maintain configuration management and deployment pipelines using Ansible, Puppet, or similar tools.
- Monitor system health, resource utilization, and AI/ML workloads to guarantee uptime and performance.
- Collaborate with DevOps, Cloud, and Software teams for environment provisioning and infrastructure scaling (AWS, Azure, or on-prem).
- Participate in capacity planning, disaster recovery, and incident response activities.
- Maintain detailed documentation, SOPs, and audit reports to support compliance and operational transparency.
Requirements
- 5+ years of hands-on experience in Linux system administration (Ubuntu & Red Hat).
- Strong expertise in Bash scripting and automation tools such as Ansible, Terraform, or Python basics.
- Experience with monitoring tools like Nagios, Zabbix, or Prometheus.
- Solid understanding of networking fundamentals, including DNS, DHCP, NFS, SSH, and firewalls.
- Knowledge of virtualization and containerization technologies (VMware, KVM, Docker, etc.).
- Troubleshooting skills for system logs, kernel issues, and service failures.
- Familiarity with version control systems (Git) and CI/CD pipeline environments.
- Exposure to cloud platforms (AWS, GCP, Azure) is advantageous.
- Red Hat Certified System Administrator (RHCSA) or Engineer (RHCE) preferred.
- Experience with high-availability clusters, load balancing, and RAID management is a plus.
- Excellent communication skills.
To apply for this job please visit jobs.lever.co.