The Director Enterprise Systems will lead site remediation engineering (SRE) focusing on the monitoring, maintenance, and continuous improvement of ATI’s applications and infrastructure within the IT portfolio. This role spans all ATI infrastructure and applications across Corporate, Clinical, Revenue, Operations, and supporting platforms, Microsoft Azure & O365. The Director will oversee incident management, conduct root cause analysis, and collaborate with IT Teams to enhance the performance and reliability of all systems. Additionally, the role requires proactive engagement with various ATI technical systems for discovery and planning, ensuring attention to detail and thoroughness in addressing architectural needs across systems impacted by ATI products. As a thought leader and technical expert, the Director Enterprise Systems will provide strategic direction while driving technical excellence.
• Lead, manage, and support a team of 4-6 members fostering a collaborative and high performance work environment
• Oversee production application/infrastructure monitoring efforts to improve operational stability and performance
• Own the monitoring of critical daily alerts and determine needs and process to remediate and optimize
• Provide weekly/monthly/quarterly updates of application health and performance for IT and executive review
• Collaborate with other technical leads across ATI to develop proactive monitoring standards and best practices
• Lead the coordination of incident responses leveraging proactive monitoring and reactive alerting to address issues swiftly
• Manage and resolve Level 3 types of application, operational, and infrastructure incidents
• Act as a key leader in incident resolution, developing and implementing plans to prevent future occurrences
• Drive efforts to improve application/infrastructure performance based on metrics from Application Performance Monitoring (APM), Infrastructure monitoring and other tools
• Develop and execute strategies to deliver ongoing improvements, proactive enhancements, and elimination of technical debt
Minimum Education
Preferred:
• Bachelor’s Degree in business, computer science, engineering, or related field.
• High School diploma acceptable with +12 years additional related experience
Minimum Experience
Required:
• 10+ years of IT experience working in a technical leadership role
• 10+ years of experience developing / supporting technical monitoring and Site Remediation Engineering (application & infrastructure)
• 10+ years of experience supporting incident management and complex IT incident problem solving
• 10+ years in a high demand, dynamic 24x7 application environment
• Demonstrated experience in leading large, highly visible projects that impact an entire organization
• Demonstrated experience working with technical solutions, L3 incident management, and incident methodology
• Experience working with all levels of an organization as a technical leader
• Experience building out architectural standards for an organization
Preferred:
• Experience in healthcare technology
• Experience with monitoring tools (Datadog, PRTG, etc.)
• Experience with incident management and tools (ServiceNow, PagerDuty, etc.)
• Experience in cloud environments (Azure, AWS, etc.)
• Experience with APM tools (Application Insights, New Relic, etc.)
Knowledge Skills and Abilities
• Knowledge of SRE best practices and principles
• Leadership skills in a fast-paced dynamic environment
• Extremely high-quality communication skills
• Architecture expertise including ability to act as technical lead of a project and/or team
• Ability to host meetings and act as the decision-maker on a technical level for important and highly visible projects
• Ability to meet with business and IT stakeholders to drive consensus, decisions and information gathering, along with product demonstration presentations
• Ability to strategize changes based on company goals
• Extreme attention to detail with a focus on creating a more reliable and performant applications & infrastructure
• Technical understanding in multiple languages and methodologies including .NET/C#, SQL, API layers
• Strong thinker for scalability/maintainability/availability planning
Software Powered by iCIMS
www.icims.com