Senior Cloud Systems Engineer providing Databricks administration for government/financial services. Responsible for platform management, security, and automation in a hybrid setup.
Responsibilities
Administer Databricks accounts and workspaces across SDLC environments.
Standardize configuration, naming conventions, and operational practices.
Configure and maintain clusters, compute policies, SQL warehouses, runtime versions, libraries, jobs, repositories, and workspace settings.
Monitor platform health through operational dashboards, alerts, and monitoring tools.
Maintain operational documentation, runbooks, and platform procedures.
Implement and enforce least-privilege access controls across platform resources.
Manage identity integrations including SSO, SCIM provisioning, and role-based access control.
Administer service principals and group-based access permissions.
Enable audit logging and support security monitoring and compliance reviews.
Implement secure secrets management and connectivity patterns.
Administer Unity Catalog including metastores, catalogs, schemas, and tables.
Manage data ownership, permission grants, and governance policies.
Configure and maintain external locations and storage credentials.
Support data classification, tagging, and lineage integrations with governance teams.
Coordinate with cloud and network teams to establish secure connectivity patterns.
Implement storage access controls and secure object storage integrations.
Support cloud logging, monitoring, and security integration with enterprise platforms.
Automate platform configuration and administration using APIs, CLI tools, and Infrastructure-as-Code frameworks.
Implement CI/CD pipelines for deploying jobs, notebooks, and configurations across environments.
Implement Databricks Asset Bundles (DABs) for standardized deployment workflows.
Reduce configuration drift through automated deployment processes.
Implement cost control policies such as cluster policies and auto-termination rules.
Analyze usage metrics and provide recommendations to improve cost efficiency.
Monitor and optimize SQL warehouse performance and cluster autoscaling.
Implement Delta Lake optimization strategies including OPTIMIZE, VACUUM, and Z-ordering.
Administer Delta Live Tables pipelines and support data engineering teams.
Monitor pipeline health and address job failures or performance issues.
Support integrations with business intelligence tools and metadata catalog systems.
Assist with troubleshooting data pipeline and query performance issues.
Maintain platform configuration documentation and governance standards.
Develop onboarding materials and self-service guides for platform user.
Support user onboarding and workspace access provisioning.
Provide guidance to platform users and development teams on best practices.
Conduct capacity planning and forecast resource usage based on platform growth.
Monitor concurrent workloads and resource allocation.
Recommend scaling strategies to support increased platform usage.
Ensure platform stability during peak usage periods.
Requirements
Bachelor’s Degree in Computer Science, Information Technology, Engineering, or a related field, or equivalent practical experience.
7+ years of experience in cloud infrastructure, data platform administration, or enterprise platform operations.
3+ years of hands-on experience administering Databricks environments.
System Engineer specializing in Functional Safety for international clients in automotive and aerospace sectors. Responsible for defining safety plans and managing safety verification methods.
Lead Systems Engineer providing technical expertise and oversight for NATO JEWCS Integrated Project Team. Responsible for system solutions and collaboration with various design teams.
Director of Systems Engineering within Cisco driving collaborative and innovative approaches for business success. Leading the architecture strategy aligned with market needs and growth imperatives.
Business Systems Analyst configuring and optimizing healthcare billing systems for Ventra. Collaborating with analysts and stakeholders to enhance revenue cycle management processes.
Systems Engineer specializing in power system theories and practices at GE Vernova. Responsible for insulation coordination studies, power system simulations, and network studies.
Learning Systems Engineer at OpenAI building infrastructure for AI - native learning experiences. Collaborating on dynamic education systems and analytics for adaptive learning.
Senior Business Systems Analyst delivering complex technology solutions for PNC's Retail Tech organization. Collaborating with cross - functional teams within an Agile/Scrum environment in the United States.
Ausbildung zum Fachinformatiker für Systemintegration bei COMPUTIME, einem IT - Unternehmen mit Fokus auf Netzwerke und IT - Systeme. Unterstützung bei IT - Projekten und Kunden im IT - Support.