This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a global market leader renowned for powering many of the worlds most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing.
"DDNs A3I solutions are transforming the landscape of AI infrastructure." IDC
The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments - Marc Hamilton, VP, Solutions Architecture & Engineering | NVIDIA
DDN is the global leader in AI and multi-cloud data management at scale. Our cutting-edge data intelligence platform is designed to accelerate AI workloads, enabling organizations to extract maximum value from their data. With a proven track record of performance, reliability, and scalability, DDN empowers businesses to tackle the most challenging AI and data-intensive workloads with confidence.
Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management.
Our commitment to innovation, customer success, and market leadership makes this an exciting and rewarding role for a driven professional looking to make a lasting impact in the world of AI and data storage.
Job Description
Job Summary:
We are looking for a Lead Software Engineer for our Pune-based GRIDScaler team.
Responsibilities
- Design, develop, and maintain software libraries and features in
Golang, Python, and Rust
to enhance the GRIDScaler product. - Build and maintain
CLI and REST API interfaces
for internal and customer-facing tools. - Develop and integrate
telemetry, observability, and monitoring solutions
to improve product reliability and customer support. - Write and maintain
unit tests, functional tests, and automation scripts
to ensure robust deliverables. - Collaborate with Engineering Managers and a geographically distributed team to understand requirements and translate them into scalable software solutions.
- Work with QA and DevOps teams to configure and maintain the
Jenkins-based build and CI/CD pipelines
. - Assist in debugging and resolving
customer escalations
, including performance and stability issues. - Contribute to product documentation and knowledge sharing within the engineering team.
Qualifications
- BS/MS in Computer Science, Computer Engineering, or equivalent degree/experience.
-
7+ years
of experience working in Linux system environments
, with a solid understanding of Linux internals
. -
5+ years
of experience in Python
and Golang
development; Rust
experience strongly preferred. -
3+ years
of experience with enterprise-class or HPC storage systems
and/or distributed systems. - Hands-on experience building
telemetry and monitoring tools
(e.g., Prometheus, Grafana, OpenTelemetry, custom monitoring agents). - Strong experience developing and maintaining
command-line tools and REST APIs
. - Familiarity with
CI/CD systems
such as Jenkins, GitLab CI, or similar. - Knowledge of
parallel file systems
, particularly IBM Spectrum Scale (GPFS)
or Lustre
, is highly preferred. - Experience with
bash, Lua, and automation scripting
is a plus. - Prior exposure to
Agile software development processes
. - Strong debugging, troubleshooting, and problem-solving skills in complex, distributed environments.
- Excellent time management, communication, and collaboration skills in a fast-paced, globally distributed team.
DDN