Who we are:
LEXI is a Silicon Valley start-up in the Enterprise IoT space that has already raised several million dollars from a long list of experienced angel investors.
In addition, the company’s executive team includes many of the top IoT executives in the industry, such as multiple board members of the key wireless protocol standard setting bodies, product managers with over 70 IoT patents to their name and sales executives who have took their former company from $0 to $200M/annual revenue in 5 years (in IoT no less).
We are consistently winning major customers against our competitors, whose market caps start in the hundreds of millions of dollars and go into the billions. In short, you will be joining a winning team that is poised to grow the company into the unicorn ranks in just a few short years.
As with all our team members, you will be granted shares to participate in this huge growth trajectory.
Who we want:
Senior DevOps Engineer
Responsibilities:
- Deployment, automation, management, and maintenance of cloud-based production system.
- Have a deep understanding of containerization principles and be able to successfully deploy and scale applications in complex cloud environments.
- Ensuring availability, performance, security, and scalability of production systems.
- Management of creation, release, and configuration of production systems.
- Evaluation of new technology alternatives and vendor products.
- System troubleshooting and problem resolution across various application domains and platforms.
- Pre-production acceptance testing for quality assurance.
- Provision of critical system security by leveraging best practices and prolific cloud security solutions.
- Providing recommendations for architecture and process improvements.
- Definition and deployment of systems for metrics, logging, and monitoring on our platform.
- Designing, maintenance and management of tools for automation of different operational processes.
- Design and implement testing infrastructures and write code to validate a large scale application deployment, partner with software developers to create/code test tools and implement automation concurrent with product development.
- Collaborate with Software Development teams in maintaining consistency and testability. Worked with developers on all aspects of quality in software, database, migration and maintenance routines
- Collaborate with Software Quality Assurance, System Engineering and Architecture, Program Management, and Software Development teams
Required Skills:
- Over
5+ years of experience
in provisioning, operations, and management of cloud environments. Experience in creation of highly automated infrastructures. - Experience in code development in at least one high-level programming language.
- Experience with multi-tier architectures: load balancers, caching, web servers, application servers, databases, and networking.
- Expert level knowledge of cloud platforms (such as
AWS, Azure, or Google Cloud
). - Hands on experience in developing Continuous Integration / Continuous Delivery process and expert at implementing pipelines in Jenkins for respective deployment environments.
- Hands on experience with automation / CI-CD tools like
Jenkins
. - Hands on experience in setting up end to end CI/CD pipeline for Java based application using
Jenkins, Ansible, GIT, Maven and Jfrog
. - Experience in branching, staging and maintaining the version across the environments working on Software Configuration Management
(SCM) GIT and GitHub/Bitbucket
. - Hands on experience with microservices integration with the build and deployment pipeline.
- Expert knowledge of Software containerization platforms like Docker and container orchestration tools like
Kubernetes
. - Expert knowledge of Building and managing containerized applications, creating Docker images, optimizing resource utilization, and troubleshooting containerization issues.
- Expert knowledge of deploying and managing
Kubernetes clusters
, configuring and scaling applications using Kubernetes, managing Helm charts, and ensuring high availability and performance. - Hands on experience on various AWS services like EC2, Auto Scaling, Load balancing, S3, EBS, IAM, Route53, VPC, RDS, CloudWatch, Lambda, Event Bridge, AWS Config, AWS Cloud Trail etc.
- Hands on experience on various DevOps tools like GIT, Terraform, Jenkins, Ansible, Maven, Prometheus and Grafana
- Hands on experience in
Linux and scripting
and programming languages (such as Python, Bash, or PowerShell) - Setup static websites using
AWS S3, CloudFront and Route53.
- Hands on experience with Database Management, replication, master slave setups.
- Knowledge of
MQTT protocol, broker.
- Setup proactive monitoring for applications using CloudWatch, Prometheus and Grafana and Elastic Search - Kibana
- Hands-on experience of Continuous monitoring, logging, and analytics to identify performance bottlenecks, security vulnerabilities, and areas for improvement, implementing monitoring and logging solutions (e.g., Prometheus, ELK stack) to enable proactive issue detection and resolution.
- Setup Infrastructure for new services/application as per requirement using
Terraform
- Update/Modify current infrastructure using Terraform.
- Automate repetitive task/tool using automation playbook (
Python Boto3/Ansible/ Terraform
) - Experience with securing cloud infrastructure and meet compliance requirements.
- Disaster Recovery design and deployment across regions a plus.
- Work closely with our Cloud, Embedded Engineering and QA teams during high severity incidents (P1/P2).
- Expert level knowledge of operating system administration.
Preferred Skills:
- Strong networking knowledge.
- Knowledge of
IoT
a big plus. - Good oral and written communication skills in English
- Must be able to Interface with multiple cross functional teams.