Are you interested in building large-scale distributed infrastructure for the cloud Oracle's Cloud Infrastructure team is building new File Storage Services that operate at high scale in a broadly distributed multi-tenant cloud environment.  Our customers run their businesses on our cloud, and our mission is to provide them with best in class file storage capabilities in conjunction with other compute, storage, networking, database, security offerings.  We're looking for hands-on engineers with a passion for solving difficult problems in distributed systems, virtualized infrastructure, and highly available services.Joining Oracle will give you the opportunity to design and build innovative new systems from the ground up and operate services at scale. Engineers at every level can have significant technical and business impact while delivering critical enterprise level features. 
- As a Software Development Engineer, you will work as part of a highly collaborative team to build new regions at scale, features/tools for File Storage Service while operating and growing the current service offering.
- You should understand distributed systems and have a strong knowledge of software architecture.
- You should value simplicity and scale, work comfortably in a collaborative, agile environment, and be excited to learn. Duties and tasks are varied and require independent judgment.
- You will perform software development tasks associated with developing, debugging, and designing software applications. We expect candidate will participate in design activities, work with Product management on service definition and establish operational best practices across the organization.
- We also expect this role to have an impact in new AI/ML service offerings as well as in enhancements to existing storage services.
- BS or MS degree or equivalent experience relevant to functional area.
Basic Qualifications:
- 5+ years experience developing new tools / features.
- Strong knowledge of Python and infrastructure as code e.g Terraform, Ansible, Chef , operating systems (Linux), and distributed systems fundamentals.
- Write automation code/frameworks for providing reliable monitoring systems, alerts etc. Experience with code development in Python and Bash.
- Deep knowledge of Linux host based networking, Linux performance/stability/troubleshooting, and Linux-based build and analysis tools.
- Deep knowledge of Networking Services (TCP/UDP, DNS, DHCP, HTTP etc.)
- Familiarity with CI/CD processes and tools.
- Strong troubleshooting, debugging and performance tuning skills.
- Strong understanding of Operations, change management, projects execution and technical documentation
- Diagnosing issues around platform services, live customer issues, and provide effective solutions with the focus on eliminating outages.
- Monitor the OCI File Storage Service and proactively debug operational issues.
- Performance and scalability testing, and performance tuning skills.
- 5+ years of experience in production software development with Agile methodologies.
- 5+ years of systems administration on 2000+ server environments.
- Bachelors in Computer Science and Engineering or related engineering fields
Preferred Qualifications:
- Knowledge of configuration management solutions such as, Chef, Puppet, Terraform, Ansible, etc.
- Experience with devising, managing, and extending monitoring solutions for large scale environments.
- Experience in managing host, network, or storage virtualization technologies.
- Deployment processes across a fast-growing number of regions.
- Experience in fleet automation and management solutions.
- Experience or knowledge of Cloud technologies.
Responsibilities:
- Design, develop, troubleshoot, and debug software programs for building new regions at scale, applications, tools, networks.
- As a member of the software engineering division, you will apply your knowledge of software architecture to perform software development tasks associated with developing, debugging, and designing software applications.
- Duties and tasks are varied and require independent judgment.
- Participating in building new OCI regions.
- Be escalation point of contact for FSS Platform components.
- Provide guidance and support as SME to mitigate and unblock Operational incidents , deployments.
Career Level - IC2