Linux Systems Engineer responsible for managing Linux systems and enhancing cloud infrastructure. Collaborating across teams to ensure performance, security, and compliance standards are met.
Responsibilities
Monitor system and application health using tools such as Datadog, Prometheus, Grafana, or Azure Monitor.
Respond to alerts in real time, performing root cause analysis and executing immediate remediation when required.
Restart or recover Linux services, containers, and background processes safely and efficiently.
Analyze logs and system metrics to detect trends, prevent outages, and drive long-term stability improvements.
Participate in on-call rotation and defined incident response procedures.
Maintain, configure, and optimize Linux systems (e.g., Ubuntu, CentOS, or Red Hat) across production and development environments.
Manage, configure, and troubleshoot Apache web servers, including virtual hosts, SSL/TLS certificates, modules, and performance tuning.
Perform updates, patching, and configuration changes in line with change management and compliance standards (HIPAA, PCI, SOC 2).
Automate maintenance and monitoring routines using Bash, Python, or Ansible.
Support operational readiness for application releases and infrastructure changes.
Work closely with application, platform, and development teams to ensure seamless deployments and stable operations.
Collaborate regularly with the Managed Service Provider (MSP) to coordinate incident response, validate system health, and ensure SLA alignment.
Participate in joint troubleshooting sessions with the MSP to identify root causes and implement permanent resolutions.
Provide detailed system insights and maintain accurate communication channels between internal IT leadership and the MSP.
Partner with Engineering and Platform teams to improve alerting, logging, and observability.
Document all processes, incident reports, and runbooks in Confluence or equivalent repositories.
Ensure system configurations align with internal security policies and compliance standards.
Maintain logging and access controls consistent with HIPAA, SOC 2, and PCI DSS expectations.
Apply the principle of least privilege and use secure methods for credential and key management.
Requirements
3+ years of hands-on experience managing and troubleshooting Linux-based production environments (e.g., Ubuntu, CentOS, Red Hat), supporting at least 50+ servers or VMs in enterprise or high-availability settings.
Strong knowledge of Linux internals, including:
Process management (ps, top, htop, etc.)
Systemd service configuration and management
Journald log review and tuning
Performance tuning using tools like vmstat, iostat, sar, strace.
Direct experience configuring and managing Apache web servers, including:
Monitoring and alerting experience with at least one major tool (list in resume preferred):
Datadog, Prometheus, Nagios, Zabbix, Azure Monitor, or similar
Automation experience, including:
Bash scripting for recurring tasks (share sample scripts if applicable)
Python scripting or Ansible playbooks for config management, deployments, or maintenance
Experience with network and DNS troubleshooting (e.g., dig, nslookup, tcpdump, iptables, or netstat)
Understanding of load balancing concepts (e.g., HAProxy, Nginx, or cloud-native load balancers)
Demonstrated incident response or root cause analysis contributions (please highlight real examples in resume or cover letter)
Strong documentation habits: e.g., created or maintained runbooks, internal wikis, or system diagrams
Exposure to Azure (preferred) or other cloud platforms (AWS, GCP); ideally involved in VM provisioning, resource scaling, or hybrid infrastructure setup
Familiarity with containerized environments, including:
Senior Telecom Systems Engineer at BHG Financial optimizing communication platforms and supporting enterprise telecom systems. Collaborating with teams to enhance workflows and deliver seamless telecom solutions.
System Engineer focusing on AOI/EFEM in the semiconductor industry. Integrating equipment and developing optical inspection concepts at the Jena location.
UGV Systems Architect responsible for the system architecture of autonomous vehicles at Daimler Truck. Integrating UGV solutions into existing vehicle platforms and digital ecosystems.
Project Manager/ System Engineer developing mobile ground control stations at INFINTEQ GmbH. Responsible for technical design, integration, and collaboration across various teams for UAV systems.
Ausbildung als Fachinformatiker - Systemintegration bei ISGUS. Planung und Konfiguration von IT - Systemen sowie technische Unterstützung für Kunden in der Region Leonberg.
Ausbildung Fachinformatiker - Systemintegration bei ISGUS, einem führenden Systemhaus für Personal - und Zeitmanagement. Planung, Konfiguration und Betreuung von IT - Systemen bei Kunden.
Project Manager/System Engineer involved in technical planning for security systems. Join a growing team at Funkwerk Security Solutions to work on innovative projects.
IT Specialist responsible for designing, implementing, and maintaining IT systems for a leading German IT service provider. Focused on integration, security, and operational support.
System Engineer managing application operations and system architectures for Agrarmarkt Austria. Collaborating closely with software development and working in a team of eight.
System Engineer focusing on network technology with Aruba products and firewall solutions. Collaborating with clients and teams for infrastructure and network projects in Austria.