Design and maintain monitoring and alerting solutions for infrastructure, application performance, and user experience.
Implement automation tools and processes for routine tasks, scalable infrastructure, and seamless deployments.
Ensure reliability, availability, and performance of applications and services, minimizing downtime and optimizing response times.
Lead incident response, including identification, triage, resolution, and post-incident analysis.
Conduct capacity planning, performance tuning, and resource optimization in collaboration with development and operations.
Collaborate with security teams to implement best practices, perform vulnerability assessments, and ensure compliance.
Manage deployment pipelines, release processes, and configuration management for consistent, reliable deployments.
Identify and drive improvements in reliability, performance, and efficiency through data and root cause analysis.
Create and maintain documentation, runbooks, and knowledge base articles, promoting knowledge sharing.
Develop and test disaster recovery plans, backup strategies, and failover mechanisms.
Collaborate with development, QA, DevOps, and product teams to align on reliability goals and incident response.
Participate in on-call rotations, providing 24/7 support for critical incidents and coordinating resolution and follow-up.
Requirements
4+ years of hands-on experience in RPGLE, CLLE, Java, COBOL, or other programming languages.
3+ years working on large-scale, client-facing, enterprise production software.
Strong English communication and collaboration skills.
Proficiency in modern development architectures (web, API), cloud platforms (AWS, Azure, Google Cloud), and infrastructure as code (Terraform, Ansible).
Experience with monitoring and logging tools (Prometheus, Grafana, DataDog, New Relic, Splunk, SumoLogic, ELK Stack), including dashboards and alerts.
Skilled in incident management (response, triage, RCA, post-mortem) and troubleshooting complex technical issues.
Proficiency in scripting languages (Python, Bash) and automation tools.
Experience with CI/CD pipelines (Jenkins, GitLab CI/CD, Azure DevOps).
Familiarity with Application Performance Monitoring (APM) and Real User Monitoring (RUM) tools.
Commitment to continuous learning, adaptability, and operational excellence.
Experience building FinTech, payment, or banking systems, including API design and third-party integration.
Familiarity with Agile environments, especially with bi-monthly production releases.
Knowledge of FIS products/services and the broader Financial Services Industry.
Experience with development tools: V7.4, Eclipse, Visual Studio, Azure DevOps, MDCMS, Git, Microsoft Office (Visio, RDi, X Analysis, Hawkeye, CheckMarx).
Understanding of OS/400 and Windows 11 operating systems.
Sr. Software Engineer developing solutions to improve student outcomes and organizational effectiveness for educational organizations. Involves full application development lifecycle from coding to debugging and integration solutions.
Technical Lead developing applications using Rockwell FactoryTalk Optix for industrial automation solutions. Collaborating on software architecture and mentoring team members while focusing on MES integration.
Lead Salesforce Developer driving the design and development of Salesforce solutions for Sales Cloud and CPQ. Mentoring junior developers while collaborating with stakeholders on robust solutions.
Workday Financial Integration and administration lead handling integrations in NYC. Leading integration tasks and managing financial systems in a hybrid work environment.
EBS PM Sr Technical Lead managing full lifecycle Oracle ERP implementation projects. Requires extensive experience in Oracle ERP and project management.
Senior Fullstack Developer responsible for application development in cloud environments. Focus on C#, .NET Core, and frontend technologies like Angular in a team setting.
Technical Lead focusing on enterprise - level integrations with Oracle Fusion Cloud systems and offering technical support. Documentation and collaboration with teams are key aspects of the role.
Performance Engineer specializing in Oracle Database administration and performance tuning for enterprise applications. Collaborating with teams to troubleshoot and enhance database performance.
Full Stack Developer developing applications in a microservice architecture for a client in the security sector. Collaborating with an agile team to tackle complex analysis tasks and solutions.
Lead Software Engineer guiding technical decisions and operational excellence for FanDuel product verticals. Collaborating across teams while mentoring engineers in a hybrid work environment.