You are viewing a preview of this job. Log in or register to view more details about this job.

Data Center Maintenance and Operations

ob Responsibilities

  • Central monitoring of global data centers, IDC emergency response, and organization to ensure timely response.
  • 7x8 hours facility monitoring, screen watching, emergency handling via platform alarms and event organization, and reporting to superiors.
  • Assist in monitoring the alarm system of the FOC, automation monitoring, and optimization of operation management platforms, supervising the daily work quality of data center operations, and promoting continuous improvement in operational quality.
  • Responsible for daily checks on data center events, changes, maintenance, and other operational tasks, promptly identifying issues and supervising the implementation of improvement measures.
  • Regularly inspect and analyze core operational data, promptly identify common risks, and assist in improving work quality and daily operations.
  • Full process management of events, control of change processes, discovery and blocking of anomalies, global alarm management and problem resolution assist in intelligent construction of operational platforms.
  • Assist in managing global IDC's major protection, network sealing preparation, coordinating the division of labor and cooperation among internal teams, and establishing good collaborative relationships.
  • Assist the facility monitoring manager in completing other temporary tasks, and perform work duties diligently and responsibly.

Skill Requirements

  • Fluency in both English and Chinese communication, with experience in multi-team collaboration, meeting organization and discussions, and facility maintenance and repair management.
  • Have a technical background in IDC infrastructure, good awareness of infrastructure operations, familiar with the operating logic of core systems such as electricity and HVAC, and ability to quickly identify alarm information and its risks.
  • Possess good data analysis capabilities, able to analyze massive data on platforms and identify risks, assisting in management decisions from a data perspective.
  • Be serious and responsible at work, honest and reliable, have a strong sense of responsibility, and possess good team spirit and communication skills.

Key Assessment Dimensions

Emergency response timeliness and quality of the emergency process (including completeness of information summarization, transparency of transmission timeliness, and closure rate of event handling), 7x8 hours monitoring management quality (accuracy of alarm tagging, completeness of operation conditions, and completeness of handover, accurate data analysis).

岗位职责

全球数据中心集中监控,IDC 故障应急处理及响应组织,确保及时响应。
1、7*8小时设施监控盯屏,通过平台告警、事件组织故障应急处理、并向上级汇报

2、协助FOC监控告警体系、自动化监控、运营管理平台优化建设机房运维日常工作质量监督、推动运维工作质量持续提升

日常运营工作

1、负责数据中心事件、变更、维护等运维工作质量日常检查,及时发现问题并监督改进措施落地

2、定期检查分析核心工作运营数据,及时发现共性风险、协助属地提升工作质量日常运营工作

3、事件全流程管理、变更过程管控异常发现及阻断、全局告警管理及问题处置、协助运营平台智能化建设

4、协助管理全球IDC重保、封网筹备组织工作,协调内部团队之间的分工与合作,建立起良好的协作关系

5、协助设施监控主管完成其他临时性工作,尽职尽责的完成工作任务

能力要求

1、电气、暖通空调或相关领域的学位或同等学历。

2、中英文沟通,具有多团队协作、会议组织与讨论、设施维护与维修管理经验。

2、具有IDC基础设施技术背景,较好的基础设施运维意识,熟悉电力、暖通等核心系统的运行逻辑,能够快速识别告警信息及其风险。

3、具备较好的数据分析能力、能够对平台海量数据进行封分析并识别风险,从数据维度辅助管理决策。

4、对工作认真负责、诚实可靠、责任心强,并具有良好的团队合作精神以及沟通能力。

关键考核维度

应急处理响应时效及应急过程质量(包括信息汇总完整度、透传时效性、事件处理闭环率

等)7*8小时监控管理质量(告警标记准确性、运行情况完整度及交接完整性、数据分析准确楚

Job Type: Full-time

Benefits:

 

  • 401(k)
  • 401(k) matching
  • Dental insurance
  • Health insurance
  • Life insurance
  • Paid time off
  • Vision insurance

 

Schedule:

 

  • 8 hour shift

 

Language:

 

  • Mandarin (Required)

 

Ability to Commute:

 

  • Santa Clara, CA (Required)

 

Work Location: In person