Onsite Lead System Engineer – Site Reliability Engineering

Posted 4 hours ago

Apply now

About the role

  • Lead System Engineer maintaining Oracle EBS/ERP applications at AT&T. Providing production support and troubleshooting for supply chain processes and integrations.

Responsibilities

  • Provide day-to-day production support for Oracle EBS/ERP (Procurement, OM, Inventory); resolve incidents/requests within SLAs and ensure service stability.
  • Troubleshoot end-to-end issues across business workflows, configuration, master/transaction data, batch/concurrent programs, and interface processing using strong analytical and SQL skills.
  • Support supply chain execution flows including Procure-to-Pay, Order-to-Cash, receiving, picking/packing/shipping, inventory transactions (transfers, adjustments, reservations, cycle counts), and reconciliation of on-hand balances.
  • Perform data triage and correction (where approved) and partner with functional leads for process/config changes.
  • Monitor and support 3rd-party integrations (e.g., ASN, receipts, shipment confirmations, inventory updates, returns/RMAs).
  • Partner with integration/middleware teams and vendors to resolve cross-system defects, mapping issues, sequencing/latency problems, and data mismatches.
  • Analyze and remediate interface errors in queues/tables/logs; validate reprocessing/replay and prevent duplicates.
  • Lead incident response and minimize downtime.
  • Build and maintain monitoring, alerts, and dashboards for proactive issue detection.
  • Create run books and automate operational tasks to improve efficiency.
  • Collaborate with development teams to define and meet non-functional requirements (reliability, performance, scalability).
  • Conduct blameless postmortems and drive continuous improvements.
  • Support release management, capacity planning, and security best practices.
  • Provide 24x7 on-call support as needed.

Requirements

  • 7+ years in Development, Functional and maintenance experience for Oracle Applications (EBS/Fusion – AR, AP, FA, PO, INV, PA, OM, Planning, etc.) - with SRE mindset
  • Proficiency in SQL/PLSQL, and Oracle technologies, Java/J2EE, scripting (Python, Shell), and automation (AI based automation a plus).
  • Strong skills with observability tools (Dynatrace, AppDynamics, Splunk, ELK, Grafana).
  • Experience with containerization (Docker, Kubernetes) and cloud services (Azure).
  • Experience with Middleware / Integration technologies – Oracle SOA/OIC, Mulesoft, Kafka/JMS, EDI.
  • Excellent problem-solving and communication skills.
  • Bachelor’s degree in Computer Science, IT, or related field.

Benefits

  • Medical/Dental/Vision coverage
  • 401(k) plan
  • Tuition reimbursement program
  • Paid Time Off and Holidays (based on date of hire, at least 23 days of vacation each year and 9 company-designated holidays)
  • Paid Parental Leave
  • Paid Caregiver Leave
  • Additional sick leave beyond what state and local law require may be available but is unprotected
  • Adoption Reimbursement
  • Disability Benefits (short term and long term)
  • Life and Accidental Death Insurance
  • Supplemental benefit programs: critical illness/accident hospital indemnity/group legal
  • Employee Assistance Programs (EAP)
  • Extensive employee wellness programs
  • Employee discounts up to 50% off on eligible AT&T mobility plans and accessories, AT&T internet (and fiber where available) and AT&T phone.

Job title

Lead System Engineer – Site Reliability Engineering

Job type

Experience level

Senior

Salary

$158,200 - $237,400 per year

Degree requirement

Bachelor's Degree

Location requirements

Report this job

See something inaccurate? Let us know and we'll update the listing.

Report job