Platform Management / SRE Intern at Lincoln Financial supporting Workplace Solutions technology. Collaborating with engineers on cloud infrastructure and automation in a hybrid work environment.
Responsibilities
Assist with monitoring cloud infrastructure health by reviewing dashboards, alerts, and system metrics to identify potential issues
Create and maintain documentation for platform processes, runbooks, and best practices to support team knowledge sharing
Develop automation scripts using Python or Bash to reduce manual tasks and improve operational efficiency
Support incident response activities by gathering data, documenting timelines, and contributing to post-incident reviews
Participate in infrastructure-as-code initiatives by learning and using tools like CloudFormation, Terraform, or AWS CDK under mentorship
Collaborate with senior engineers to implement monitoring improvements using tools such as CloudWatch, Datadog, or Prometheus
Contribute to CI/CD pipeline improvements by testing, documenting, and suggesting enhancements to deployment processes
Analyze system performance data and trending metrics to identify optimization opportunities and present findings to the team
Shadow on-call engineers to learn incident management, troubleshooting methodologies, and communication best practices during critical events
Research emerging technologies and SRE best practices, preparing presentations or documentation to share learnings with the team
Requirements
Currently pursuing or completed a Bachelor's degree in Computer Science, Information Technology, Engineering, or related field
Coursework in operating systems, networking, databases, or distributed systems preferred
Basic proficiency in at least one programming or scripting language (Python, Bash, PowerShell, or similar)
Fundamental understanding of cloud platforms (AWS, Azure, or GCP)
Familiarity with Linux/Unix command line and basic system administration
Experience with Git version control and collaborative development workflows
Understanding of networking concepts (DNS, HTTP, TCP/IP)
Benefits
Potential to turn into a full-time position after graduation
Receive ongoing coaching and mentoring
Add value to the organization through meaningful work
Effective productivity/technology tools and training
Professional Development Workshops
Gain exposure to senior executives and build a valuable network of peers, managers, and leaders across the company
SRE responsible for ensuring reliability and performance of IT systems at a digital transformation company specializing in public sector efficiency. Collaborating on system health, incident response, and automation tasks.
DevOps Senior role at Beyond Soluções managing CI/CD for .NET and Kubernetes applications. Collaborating on cloud solutions while fostering a culture of innovation and quality.
Senior Software Engineer at PayPal managing cloud infrastructure and DevOps solutions. Delivering complete SDLC solutions and guiding engineering teams for scalable and reliable services.
Senior Site Reliability Engineer at Diligent leading reliability, automation, and observability across cloud infrastructure. Build tools for incident response and enhance performance in fast - paced environments.
Perception Deployment Engineer deploying deep learning models on embedded systems at Caterpillar. Collaborating with cross - functional teams for integration and optimization of perception modules in vehicles.
Principal Site Reliability Engineer at AT&T required to design scalable solutions for critical operations with minimal downtime. Collaborating with teams to monitor and improve system performance in cloud environments.
DevOps Engineer managing AI SaaS infrastructure at a high - growth European company. Supporting AI model deployment and ensuring platform security and compliance with multiple systems integration.
Engineering Manager leading teams for observability platforms at LexisNexis. Owns operational excellence across software delivery lifecycle in Raleigh, NC.
Reliability Engineer optimizing site facility infrastructure and utility systems at Roche. Conducting root cause analyses and developing maintenance plans to enhance reliability and efficiency.
DevOps SME designing, implementing, and operating multi - cloud platforms for The Missing Link. Collaborating with engineering, security, and operations teams while embedding DevOps best practices.