Key responsibilities:
- Regularly evaluate the IT system in the area of responsibility, in terms of technical, audit and security requirements, work rules and procedures, availability and continuity.
- Develop, build, test, deploy, run and update permanently the operating procedures and work instructions for monitoring in case of services damage at the level of the entire business (Enterprise).
- Implements active monitoring identifies and tracks relevant performance indicators.
- Analyze the incidents and problems occurred in the IT Infrastructure, to identify the cause of the errors and find the solution.
- Ensure appropriate availability, reliability, and scalability of applications and services.
- Develop and update operational procedures applicable in case of planned and unplanned switching of the systems under responsibility as well as the intervention in case of needed system recoverability in the event of failure.
- Provide support in functionality and acceptance testing during the implementation of application and platforms.
- Ensure excellent performance of the organization’s applications and platforms.
- Regularly monitor and reconcile the performance of the applications to predefined performance thresholds and take appropriate actions where required.
Requirements:
- Experience in Java and HTTP (IIS, Apache);
- Technical knowledge of MSSQL, IIS, WebSphere Portal, WebSphere Application Server, WebSphere MQ, WebSphere Process Server / ESB, JBoss, Appian
- Strong background in Windows, UNIX and Linux
- Experience with data bases administration: Oracle, SQL, PL/SQL
- Experience in web services with APIS, SOAP, REST
- Groovy scripting
- Tools: GIT, Jira, Jenkins
- Experience with Docker, Kubernetes, OpenShift
- Knowledge of monitoring solutions: Grafana, Nagios, Dynatrace, Zabbix