Application Monitoring & Managed Services (Senior/Team Leader)
About this position
Responsibilities
• Monitoring System Expertise: Configure, manage, and optimize application performance monitoring tools (e.g., Dynatrace, Datadog) to ensure seamless operation.
• Collaboration: Work closely with cross-functional teams, including development, operations, and client stakeholders, to align monitoring practices with business goals.
• Compliance and Security: Ensure all monitoring tools and managed services comply with industry security standards and regulatory requirements.
• Documentation: Develop and maintain comprehensive documentation for monitoring configurations, incident resolutions, and operational processes.
• Innovation: Research and implement emerging technologies and best practices to enhance application monitoring capabilities.
• Mentorship: Provide guidance and technical support to junior engineers, fostering a culture of continuous learning and development.
• Client Interaction: Support client-facing activities, including performance reviews, reporting, and ensuring client satisfaction with managed services.
• Incident Response: Design and implement robust frameworks to effectively address and resolve cloud platform disruptions and security breaches, ensuring minimal operational impact and swift recovery.
Requirements
• Education: Bachelor’s degree in computer science, Information Technology, Software Engineering or a related field.
• Relevant certifications such as ITIL, or Dynatrace Professional Certification preferred.
• Experience: Minimum of 3 years of progressive experience in DevOps or Infrastructure and application monitoring roles.
• Technical Skills: Advanced knowledge of application performance monitoring tools such as Dynatrace, or Datadog.
• Strong understanding of application performance management (APM) methodologies.
• Familiarity with scripting and automation to enhance monitoring efficiency.
• Experience with incident management and root cause analysis.
• Knowledgeable in operating systems (Linux, Windows), server and infrastructure management across on-premises and cloud platforms (AWS, Azure), with a solid understanding of networking protocols (HTTP, TCP/IP, DNS).
• Knowledgeable in CI/CD pipelines and automation tools such as Jenkins and Ansible, with a solid foundation in DevOps processes.
• Knowledgeable in programming languages such as Python, Java, or JavaScript for integration and automation, with a strong foundation in database management.
• Skills and Competencies: Exceptional analytical and problem-solving skills.
• Strong technical expertise in application monitoring and performance management.
• Effective communication and collaboration abilities with stakeholders and team members.
• Ability to manage priorities and deliver results.