What You'll Work On Design and develop a next-generation scalable observability platform for modern cloud-native and hybrid infrastructures that works in tandem with AI agents. Create intelligent AI agents to analyze logs, traces, and metrics in real time, delivering automated insights and remediation. * Build scalable and fault tolerant AI agent frameworks Engineer and optimize large-scale analytics pipelines to process high-velocity telemetry data. Build resilient distributed systems with high reliability, performance, and fault tolerance. Implement and fine-tune LLMs for natural language querying and automated troubleshooting. Partner with ML engineers to streamline AI model deployment and management. What We're Looking For Strong programming skills in Python and Golang (experience with Rust is a plus) Track record of building distributed systems and large-scale analytics pipelines Hands-on experience with cloud infrastructure (AWS, GCP, or Azure) and Kubernetes Deep understanding of observability technologies (Prometheus, OpenTelemetry, Grafana, Elastic, etc.) Knowledge of LLMs , AI agents , agent frameworks liks langchain, autogen is a plus Experience with stream processing and real-time data processing frameworks Proficiency in database technologies (SQL & NoSQL, Time-Series DBs) 5+ years** of relevant experience Bachelor's degree in Computer Science, Engineering, or related field (Master's/PhD is a plus)
You should have experience with Java and web application development using J2EE technologies, Java scripting, HTML, and MVC pattern. Solid experience with object-oriented development concepts is required. You should also have experience using Java web frameworks such as Spring Boot, Hibernate, JPA, and Angular, with experience in Typescript - Angular being preferred. Solid experience with MFE and Micro Services is a must, and understanding on Gravitee is a plus. Experience with PL/SQL, SQL, and Oracle is preferred. A good understanding of the Agile Scrum methodology and SDLC process is necessary. Knowledge of Service Oriented Architecture (SOA), Distributed Software Architecture, and Web Application Architecture is expected. Experience with Apache, Tomcat, Maven, JIRA, and GIT is required. Experience with RDBMS Oracle is preferred, with understanding of any cloud frameworks - Azure being preferred. UNIX/Linux command level experience and build automation knowledge is essential. Familiarity with web services (SOAP and REST) is also necessary for this role.,