You are viewing a preview of this job. Log in or register to view more details about this job.

SRE Engineer

Job responsibility:
Responsible for the reliable, stable and efficient operation of Baidu's large-scale distributed systems and all kinds of online services

-Participate in the design of online systems and various product architectures, lead the implementation of service reliability related automation systems, and meet strict quality and efficiency requirements

- Participate in the overall room construction of Baidu at home and abroad, and provide the best access and experience layout for product users

-Design and develop service operation and maintenance solutions, including website acceleration, continuous delivery, capacity management, elastic calculation, fault analysis, traffic distribution, performance optimization, etc

- Focus on the latest technology trends in the industry, and be responsible for the optimization, evolution and exploration and application of new access technologies for mass traffic access systems

- Focus on industry-related technology dynamics, align mixed technology directions (Docker, etc.), contribute and lead the technology trend of the industry

- Use AI technology to solve the operation and maintenance problems of ultra-large scale Internet applications."

"Deep understanding of the Linux operating system;
Have a good computer network and architecture foundation
-Proficient in at least one major programming language such as C/C++/Python/Go/Shell
- Good logical thinking and analytical skills, keen on solving problems and pursuing perfection
ยท Strong responsibility, initiative, team work and Ownership
- Graduates who are not limited in majors, but prefer computer science, communication, mathematics, etc
- Chinese can be used as the working language(written & oral)
The following conditions are preferred:
- Experience in large-scale distributed programming design and development
- Familiar with network protocols such as TCP/IP, HTTP/HTTPS, proficient in Socket network programming, and capable of tracing network faults."