Observability Engineer designing and maintaining observability solutions for cloud infrastructure at PGTEK. Collaborating with teams to enhance system performance and resolve operational issues.
Responsibilities
design, implement, and maintain observability solutions with OpsRamp as the core platform
integrate OpsRamp with additional monitoring and observability tools (e.g., Prometheus, Datadog, Elastic Stack)
ensure data accuracy, quality, and integrity across observability systems
use OpsRamp data to troubleshoot performance issues, application errors, and operational incidents
collaborate with development and operations teams to identify root causes and implement fixes
participate in incident response activities to accelerate issue resolution
continuously assess and optimize the performance and effectiveness of the OpsRamp platform
stay current on OpsRamp features, enhancements, and industry best practices
proactively identify improvement opportunities and implement enhancements
Requirements
Strong understanding of cloud platforms: AWS, Azure, and/or GCP
Experience with container technologies: Docker and Kubernetes
Proficiency in scripting languages such as Python, Go, or Bash
Experience with SQL and NoSQL databases
Solid understanding of networking concepts and protocols (TCP/IP, HTTP)
OpsRamp experience strongly preferred
Experience with at least one additional observability tool (e.g., Datadog, New Relic, Prometheus, Grafana, Elastic Stack, Splunk)
Strong communication and collaboration skills
Excellent analytical and problem-solving abilities
Ability to work independently and within a team environment
Passion for continuous learning and improvement
U.S. Citizen with active Secret clearance
Required Certification: DoD 8570/8140 compliant security certification, such as Security+ or higher (e.g., CISSP, CASP)
Benefits
comprehensive PPO medical coverage with access to a Health Savings Account (HSA) option
a vision plan
dental insurance with the base dental plan option paid for by PGTEK
Life Insurance
Short and Long-Term disability
Critical Illness insurance premiums covered
matching 401(k) plan
discount on pet insurance through ASPCA Pet Insurance
Employee Assistance Program available at no cost to all employees
generous amount of PTO and Holidays
Education Assistance Program available after 12 months of employment
Staff Embedded Controls Engineer developing centralized vehicle motion control systems at Ford. Joining an agile team focused on electric vehicle innovations and safety - critical embedded systems.
Brake Cooling CFD Engineer developing simulations for automotive brake components and systems. Collaborating on design and testing based on fluid dynamics expertise.
Embedded Controls Engineer joining Ford's team to develop algorithms and lead integration of body control systems. Collaborating across teams to enhance vehicle functionalities and safety standards.
Command Center Monitoring Engineer responsible for monitoring Poly devices, resolving technical issues, and ensuring customer satisfaction through proactive solutions.
Smart Manufacturing Engineer developing and maintaining Python test automation for satellite production systems. Supporting AIT engineers and ensuring critical tests are stable and repeatable.
Senior Process Control Engineer supporting plant operations in a unique rare earth supply chain company. Engaging in process improvement and capital projects with a focus on PLC/SCADA infrastructure.
Project Engineer managing structural design and client interactions. Coordinates with teams on project - related tasks, ensuring timelines and budgets are met.
EDA Tools Software Engineer at Intel focusing on innovative software tool development for hardware design processes. Collaborating with cross - functional teams to enhance technology solutions.
Senior Density Fill Development Engineer at Intel creating algorithms and tools for semiconductor design processes. Collaborating across teams to enhance manufacturing and product innovation.
Designs and implements IoT control solutions across multi - site environments for Keedian. Focuses on practical, cost - efficient solutions that ensure operational reliability.