Senior Software Engineer - CTJ - Poly
![]() | |
![]() United States, Virginia, Reston | |
![]() | |
OverviewMicrosoft has an exciting opportunity for a Senior Software Engineer in the Azure Edge + Platform Silver Infrastructure Operations and Insights Team. The Operations and Insights team develops and maintains software platforms for specialized cloud datacenter technicians, partner engineering groups, and other stakeholders at Microsoft to execute workflows and operate Microsoft's datacenters and secure work environments as well as the big data platform for physical asset and virtual resource inventory foundational to security and compliance. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesBuild out software and infrastructure to improve the reliability, scalability, and efficiency of our services. Applies debugging tools and examines logs, telemetry, and other methods to verify assumptions through writing and developing code proactively before issues occur and reactively as issues occur for products. Conducts retrospective debugging of solutions to identify root causes of problems. Engagein improving service deployment and testing processes by designing, and implementing automated tools and solutions. Communicate and collaborate with engineers in the broader scope of the product to deliver solutions specifically for Azure Government, Top Secret, and Secret clouds. Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions. Alerts stakeholders as to status and initiates actions to restore system/product/service for simple problems and complex problems when appropriate. Responds within Service Level Agreement (SLA) timeframe. Drives efforts to reduce incident volume, looking globally at incidences and providing broad resolutions. Escalates issues to appropriate owners. Drives efforts to integrate instrumentation for gathering telemetry data on system behavior such as performance, reliability, availability, usage, and safety mechanisms. Drives sustaining feedback loops from telemetry resulting in subsequent designs. Creates outputs of telemetry such as notifications or dashboards. Responsible for the execution & adherence of Software Engineering Standards for Data Archiving and retention Participate in project team activities and contribute to documentation requirements consistent with methodology Support team efforts in adopting relevant new technologies, tools, methods and processes from Microsoft and industry. Stay educated on existing, emerging technologies and do POCs to evaluate technology fits for customer needs. Embody ourcultureandvalues |