Edge Systems Engineer responsible for edge computing systems reliability and observability. Bridging hardware, software, and networking disciplines to deliver maintainable solutions.
Responsibilities
Deploy, configure, and validate edge computing systems across lab, field, and production environments.
Integrate and optimize system components spanning embedded hardware, networking, containerization, and cloud APIs.
Collaborate with software, infrastructure, and field teams to identify and resolve integration and runtime issues.
Ensure reliable device-to-cloud communication for telemetry, control, and analytics workloads.
Perform end-to-end triage across hardware, network, and application layers.
Use Linux CLI tools, container inspection, and telemetry analysis to isolate and correct complex system failures.
Reproduce field issues in controlled environments and contribute findings back into engineering processes.
Develop reusable diagnostic tools and test harnesses to validate system resilience.
Build and maintain monitoring, and recovery automation (e.g., Bash, Python, Go).
Contribute to orchestration frameworks such as Docker, K3s, or Kubernetes for edge deployments.
Enhance observability through metrics, dashboards, and alerting (Datadog, Grafana, Prometheus, etc.).
Identify opportunities for self-healing and reliability automation.
Author and maintain runbooks, standard operating procedures, and knowledge base articles.
Document troubleshooting procedures and design patterns to enable Tier 1 and Tier 2 support efficiency.
Participate in post-incident reviews and translate lessons learned into durable operational improvements.
Partner with software engineers, DevOps, and operations teams to drive incident resolution.
Act as a 24x7 escalation SME for complex edge or connectivity issues.
Leverage escalation learnings to define and drive system reliability and lifecycle management initiatives.
Requirements
Bachelor’s degree in Computer Engineering, Computer Science, Information Systems, or equivalent work experience
3-5 years minimum relevant experience
Strong proficiency with Linux systems and command-line diagnostics.
Experience with containerized environments (Docker, K3s, or Kubernetes).
Understanding of IoT or distributed systems architectures, including secure communication (TLS/mTLS).
Solid grasp of networking fundamentals: IP, routing, VPNs, DNS, and cellular/LTE connectivity.
Scripting ability in Bash, Python, or Go for automation and tooling.
Demonstrated ability to troubleshoot across hardware, network, and software boundaries.
Excellent written communication skills; comfortable producing procedural documentation.
Benefits
Adhere to all NOV HSE policies, utilize appropriate PPE, and actively participate in monthly safety meetings.
Cloud Systems Engineer managing cloud infrastructure projects for federal clients. Supporting stability and reliability of cloud - based systems and networks with a focus on innovative solutions.
Senior Systems Engineer supporting DIAs National Digital Exploitation and OSINT Center for a tech company serving public sector clients. Manage systems engineering tasks and lead a scrum software development team.
Principal Engineer defining and architecting distributed AI systems across heterogeneous compute platforms at Intel. Focusing on dynamic execution and optimization of large - scale AI computation graphs.
Lead Supply Management for Micron's semiconductor materials and delivery in Penang, Malaysia. Collaborate cross - functionally for global supply fulfillment and manage supplier relationships effectively.
System Engineer for network technology in telecommunications company offering tailored internet services. Responsible for firewall administration, network components, and strategic project management.
Principal Platform Systems Engineer leading test automation for maritime AI solutions. Focused on building infrastructure for cloud, edge compute, and embedded systems.
System Engineer focusing on Citrix and Azure Virtual Desktop at DATAGROUP in Kaunas. Develop technical solutions, implement them independently, and ensure smooth operations.
Principal Software Systems Engineer supporting Northrop Grumman's Sentinel program based in Colorado Springs or Huntsville. Involves collaboration in software development and best practices.
Systems Engineer specializing in automated warehouse systems for key warehouse automation projects at Dematic. Overseeing technical integration and collaborating with engineering teams to deliver innovative solutions.
Senior Business Systems Analyst focusing on Oracle Revenue Management Cloud Service. Ensure reliability and scalability of critical systems while collaborating with technical teams.