Qualifications
- 7+ years in an Information Technology related field with progressive experience in software development, production support, and technical leadership
- Demonstrated experience leading incident response and defect resolution in enterprise application environments
- Experience managing in an organization using Agile and Scrum principles, with the ability to organize and prioritize work in a fast-paced, interrupt-driven environment
- Working experience with development tools and frameworks: Vue.js, Node.js, Java/Spring, Software AG Webmethods, Rabbit MQ, and/or UI Path
- Working experience with DBMSs: Oracle, Mongo, and Postgres
- Working experience with testing frameworks and tools: Vitest, Jest, Playwright, Junit, and/or automated regression testing pipelines
- Working experience with containers (Cloud Foundry, Docker, Kubernetes) and CI/CD tools (GitHub, Jenkins, Codefresh)
- Experience with AI tooling such as Claude, GitHub Copilot, and Microsoft Copilot to enhance development workflows, code quality, and team productivity; demonstrated ability to drive AI adoption across a team and establish norms for effective, responsible use of AI in daily work
- Experience creating bug reports, incident runbooks, testing plans, technical diagrams, and collaborating with other technical and functional teams
- Strong communication skills (written, verbal, and presentation) with a proven ability to communicate incident status, impact, and resolution across technical and business stakeholders
- Result driven and self-motivated with the ability to remain calm under pressure; passionate about application stability and developing people to their full potential through coaching, mentoring, and servant leadership
Role Summary- Successful candidates will have a background in software development, production support, and people leadership. This role is responsible for managing a team of 4-8 direct reports focused on incident response, defect resolution, and test coverage across WWT's quoting, pricing, catalog, and rule engine application portfolio. The ideal candidate thrives under pressure, brings a systematic approach to root cause analysis and bug triage, and creates a high-performing team through active coaching, accountability, and servant leadership. They are passionate about application stability, quality assurance, and developing people to their full potential while fostering an atmosphere of mentoring, collaboration, and continuous improvement.
Key Responsibilities
- Lead and manage a team of 4-8 direct reports dedicated to incident response, bug fixing, and regression testing across quoting, pricing, catalog, and rule engine applications; create a high-performing team through active coaching, accountability, regular effective 1-on-1s, managing difficult conversations
- Own the incident management lifecycle- triage, prioritization, assignment, resolution, and post-incident review — ensuring production issues are resolved within SLA targets and root causes are identified and addressed
- Provide technical guidance and hands-on oversight for defect diagnosis, code fixes, and quality assurance across the application portfolio, ensuring fixes do not introduce regressions
- Build and maintain a robust testing strategy spanning unit, integration, functional, and end-to-end levels for quoting, pricing, catalog, and rule engine systems; drive measurable improvement in test coverage and defect escape rates
- Manage team sprints balancing incident priority, planned defect remediation, and test automation work; negotiate team roadmap and report results within the context of your Agile release train and overall organization
- Coordinate with product owners, architects, and peer development teams to communicate incident impact, recommend systemic fixes, and influence backlog prioritization to reduce recurring defects
- Promote quality assurance and best practices within your team, including leveraging AI tooling such as Claude, GitHub Copilot, and Microsoft Copilot to accelerate root cause analysis, improve test generation, and enhance code quality
- Drive AI adoption within your team's daily workflows- coach team members on effective use of AI assistants (e.g., Claude, GitHub Copilot, Microsoft Copilot) for debugging, log analysis, test case generation, incident documentation, and knowledge synthesis; establish team norms for responsible AI use
- Create and maintain incident runbooks, testing plans, defect trend reports, and technical diagrams; collaborate with other technical and functional teams to ensure alignment across the organization

