Kafka Site Reliability Engineer (SRE)
Kforce
Kforce has a client that is seeking a Kafka Site Reliability Engineer (SRE) in Austin, TX.
Essential Functions:
* Kafka Site Reliability Engineer (SRE) will carry out SRE duties for Kafka Streaming Platform
* Have thorough understanding on the Kafka architecture along with the concepts of Producer, Consumer, topics, partitions, etc.
* Keep an eye on the platforms and adhere to runbooks/SOPs to manage platform and application problems
* Familiarize yourself with the cluster maintenance processes and implement changes as per the documented installation and validation plans
* Showcase robust troubleshooting and debugging skills, aiming to pinpoint and rectify the issue, while also offering advice on how to prevent such problems in the future
* As a Kafka SRE, you will conduct thorough root cause analysis of major production incidents, document for future reference, and put in place proactive measures to enhance system reliability
* Automate routine tasks using scripts or automation tools to lessen manual work, decrease the chance of human errors, and boost system reliability
Confirm your E-mail: Send Email
All Jobs from Kforce