Critical Environment Technician
Posted: 2 days agoJob Description
As a Critical Environment Technician (CET) in Microsoft’s Cloud Operations & Innovation (CO+I) team, you will help to maintain the critical infrastructure that keeps our datacenters up and running. This could be anything from performing basic monitoring and inspection, performing various types of maintenance, participating in equipment troubleshooting, enforcing all safety and security requirements, and developing methods of operating procedures (MOPs) and standard operating procedures (SOPs).Duties and Work Process
Equipment and Systems Operations- Works on tasks (e.g., making rounds, initial assessments on equipment performance) as assigned with varying levels of supervision, sometime minimal. Supports senior colleagues as an assistant. Builds knowledge in operating equipment and systems within a set discipline (e.g., electrical, mechanical, controls). Operates all systems and equipment in a safe and professional manner.
- Performs monitoring physical inspection of critical environment-related facility equipment (e.g., controls, heating, ventilation, and air conditioning [HVAC], mechanical systems), building, and grounds for unsafe or abnormal conditions. Understands critical system alarms for single discipline(s) of equipment, their meanings, and engages with appropriate escalation processes or procedures. Recognizes circumstances where execution would be considered safe to proceed. Performs various inspections and validations of equipment performance. Monitors the performance from central monitoring locations (i.e., Facility Operations Centers) of maintenance and operations of equipment (e.g., electrical, mechanical, fire/life safety) within the data center. Escalates per applicable policies and standards. Utilizes telemetry, control systems, and other platforms to monitor site status, analyze past and current events, as well as other processes, and can identify critical alarms. Advises less experienced colleagues on issues found while monitoring applicable CE systems.
- Utilizes internal computerized maintenance management system (CMMS) to track all equipment assets and to complete work order requests for maintenance work. Tracks hours for performed tasks within applicable task management systems. Adds required data, documents, logs changes, and upkeeps procedures related to building management systems and reports. Properly signals spare equipment and parts utilization within maintenance work orders.
- Safely responds to and stabilizes abnormal conditions within the required timeframes for various abnormal facility events within data center with minimal supervision, utilizing emergency operating procedures (EOPs). Gathers necessary information and participates in the creation of incident timelines/data, root-cause analyses, and/or action items following an abnormal condition as required. Identifies appropriate parties to mitigate incidents as they occur. Assists in developing emergency operating procedures (EOPs), methods of procedure (MOPs), standard operating procedures (SOPs), and digital methods of operating procedures (DMOPs) in relation to incidents. Assists in providing emergency monitoring response to irregular or malfunctioning conditions.
Equipment and Systems Maintenance
- Performs various types of maintenance (e.g., planned, predictive, corrective) and repairs (e.g., basic replacements, changing of parts) for specific disciplines and equipment for which they have been trained, following methods of procedure (MOPs), standard operating procedures (SOPs), and digital methods of operating procedures (DMOPs) with minimal supervision - in consideration of Task Hazard Analysis (THA), Method Statement of Work (MSOW), or varying permit requirements. Communicates and/or escalates maintenance activities per established process and procedure. Prioritizes maintenance activities as required and/or appropriate. Documents tasks or issues during maintenance activities within appropriate systems per process and procedure as needed. Performs lower level maintenance with minimal supervision, and may resolve high level issues with support from more experienced colleagues. Follows recommended maintenance schedules. Maintains all systems and equipment in a safe and professional manner and understands levels of risk (LORs) associated with varying types of maintenance.
- Performs troubleshooting of equipment and systems independently within their trained discipline(s). Documents issues found in troubleshooting process within appropriate systems per process and procedure as needed. Ensures equipment and system settings are consistent with established parameters and designs. Communicates and/or escalates troubleshooting issues or anomalies with assistance from more experienced colleagues. Has a hands-on understanding of how equipment works within disciplines they have been trained.
- Provides necessary escort to third-party contractors, sub-contractors, vendors, and service providers on site based on the appropriate procedure levels of risk (LOR). Takes part in getting third-party work underway (e.g., making sure systems are properly energized/deenergized), ensuring the work is started and completed in a safe manner in accordance with standard practices, procedures, and
- Authority Having Jurisdiction (AHJ) regulations. Ensures work performed by suppliers/vendors is performed to scope, all documentation is performed correctly, and escalates as appropriate. Recognizes circumstances when to stop supplier/vendor work to address potential and/or identified concerns. Coordinates per appropriate LOR applicable to preventative and/or corrective maintenance. Identifies and recommends procedure corrections if/when errors are detected or when appropriate. Reviews and completes appropriate work orders to support approval of vendor supplier field service reports or invoices.
- Prepares and submits required reports (e.g., turnover, preventative maintenance [PM]) as assigned following preexisting scripts and templates. Develops methods of operating procedure (MOPs), standard operating procedures (SOPs), and/or digital methods of operating procedures (DMOPs) for appropriately leveled procedures to ensure safe and reliable execution. Documents completed work using approved tools and procedural templates for more experienced technician review. Completes and provides coaching to support less experienced technicians for mandatory, technical, and procedural training assignments.
- Develops an understanding of method statement of work (MSOW) documents. Coordinates activities and associated schedules with contractors. Performs inspections of equipment in a facility. Participates in testing and commissioning activities.
Necessary Skills and Qualifications
- Occasional climbing of ladders.- Frequent climbing of stairs and/or ramps.
- Prolonged standing.
- Occasional lifting 50lbs. / 22.5kg.
- Occasional push or pull 50-75lbs. / 22.5-34kg. with assistive device.
- Normal visual acuity (near, far and peripheral with correction).
- Normal color vision for electrical work.
- Normal is defined via standard medical terms and applicable criteria.
- Embody our culture and values
Work Experience Requirements
- 1+ year(s) technical services experience (e.g., high-availability assembly/manufacturing/critical infrastructure environments such as data centers, military, nuclear plants, oil and gas refineries, hospitals, pharmaceutical, industrial construction)- OR associate's degree or technical trade certification (e.g., military, trade school) or higher-equivalent education
- OR equivalent experience.