Data Engineer - 2 bengaluru 3 - 5 years INR Not disclosed On-site Full Time

As a Data Engineer specializing in geospatial data, your primary responsibility is to design, build, and maintain data infrastructure and systems that handle geospatial information effectively. You will work closely with cross-functional teams, including data scientists, geospatial analysts, and software engineers, to ensure that geospatial data is collected, processed, stored, and analyzed efficiently and accurately. Key Responsibilities: Data Pipeline Development: Design and implement robust data pipelines to acquire, ingest, clean, transform, and process geospatial data from various sources such as satellites, aerial, drones, and geolocation services. Data Ingestion, Storage and Extraction: Develop data models and schemas tailored to geospatial data structures, ensuring optimal performance and scalability for storage and retrieval operations. Spatial Database Management: Manage geospatial databases, including both traditional relational databases (e.g., PostgreSQL with PostGIS extension) and NoSQL databases (e.g., MongoDB, Cassandra) to store and query spatial data efficiently. Geospatial Analysis Tools Integration: Integrate geospatial analysis tools and libraries (e.g., GDAL, GeoPandas, Fiona) into data processing pipelines and analytics workflows to perform spatial data analysis, visualization, and geoprocessing tasks. Geospatial Data Visualization: Collaborate with data visualization specialists to create interactive maps, dashboards, and visualizations that effectively communicate geospatial insights and patterns to stakeholders. (frontend related) Performance Optimization: Identify and address performance bottlenecks in data processing and storage systems, leveraging techniques such as indexing, partitioning, and parallelization to optimize geospatial data workflows. Data Quality Assurance: Implement data quality checks and validation procedures to ensure the accuracy, completeness, and consistency of geospatial data throughout the data lifecycle. Geospatial Data Governance: Establish data governance policies and standards specific to geospatial data, including metadata management, data privacy, and compliance with geospatial regulations and standards (e.g., INSPIRE, OGC). Collaboration and Communication: Collaborate with cross-functional teams to understand geospatial data requirements and provide technical expertise and support. Communicate findings, insights, and technical solutions effectively to both technical and non-technical stakeholders. Requirements:Must-have: Bachelor's or Master's degree in Computer Science or a related field. 3-5 years of experience working in the field and deploying the pipelines in production. Strong programming skills in languages such as Python, Java, or Scala, with experience in geospatial libraries and frameworks (e.g., Rasterio, Shapely). Experience with distributed computing frameworks (e.g., Apache Spark, Airflow) and cloud-based data platforms (e.g., AWS, Azure, Google Cloud Platform). Familiarity with geospatial data formats and standards (e.g., GeoJSON, Shapefile, KML) and geospatial data visualization tools (e.g., Mapbox, Leaflet, Tableau). Strong analytical and problem-solving skills, with the ability to work with large and complex geospatial datasets. Good-to-have: Proficiency in SQL and experience with geospatial extensions for relational databases (e.g., PostGIS). Excellent communication and collaboration skills, with the ability to work effectively in a cross-functional team environment. Nice to have experience with geospatial libraries such as Rasterio, Xarray, Geopandas, and GDAL. Nice to have Knowledge of distributed computing frameworks such as Dask. STAC, GeoParquet, Cloud native tools. Productionising Data Science code. The role of a Data Engineer for Geospatial Data is crucial in enabling organizations to leverage the power of geospatial information for various applications, including urban planning, environmental monitoring, transportation, agriculture, and emergency response. Benefits: Medical Health Cover for you and your family, including unlimited online doctor consultations Access to mental health experts for you and your family Dedicated allowances for learning and skill development Comprehensive leave policy with casual leaves, paid leaves, marriage leaves, bereavement leaves Twice a year appraisal Job Type: Full-time Work Location: In person

Machine Learning Operations Engineer (ML Ops - 2) bengaluru 5 - 8 years INR Not disclosed Remote Full Time

We are looking for a Machine Learning Operations Engineer to join our team, to design, build, and integrate ML Ops for large-scale, distributed machine learning systems, focusing on cutting-edge tools, distributed GPU training, and enhancing research experimentation. Roles & Responsibilities: Architect, build, and integrate end-to-end life cycles of large-scale, distributed machine learning systems i.e., ML Ops, using cutting-edge tools/frameworks. Develop tools and services for the explainability of ML solutions. Implement distributed cloud GPU training approaches for deep learning models. Build software/tools that improve the rate of experimentation for the research team and extract insights from it. Identify and evaluate new patterns and technologies to improve the performance, maintainability, and elegance of our machine learning systems. Lead and execute technical projects to completion. Communicate with peers to build requirements and track progress. Mentor fellow engineers in your areas of expertise - Contribute to a team culture that values effective collaboration, technical excellence, and innovation. Collaborate with engineers across various functions to solve complex data problems at scale. Qualification: 5 - 8 years of professional experience in implementing the MLOps framework to scale up ML in production. Master’s degree or PhD in Computer Science, Machine Learning / Deep Learning domains Must-have: Hands-on experience with Kubernetes, Kubeflow, MLflow, Sagemaker, and other ML model experiment management tools, including training, inference, and evaluation. Experience in ML model serving (TorchServe, TensorFlow Serving, NVIDIA Triton inference server, etc.) Proficiency with ML model training frameworks (PyTorch, PyTorch Lightning, Tensorflow, etc.). Experience with GPU computing to do data and model training parallelism. Solid software engineering skills in developing systems for production. Strong expertise in Python. Building end-to-end data systems as an ML Engineer, Platform Engineer, or equivalent. Experience working with cloud data processing technologies (S3, ECR, Lambda, AWS, Spark, Dask, ElasticSearch, Presto, SQL, etc.). Having Geospatial / Remote sensing experience is a plus. Competencies: Excellent debugging and critical thinking skills. Excellent analytical and problem-solving skills. Ability to work in a fast-paced, team-based environment. Benefits: Medical Health Cover for you and your family, including unlimited online doctor consultations Access to mental health experts for you and your family Dedicated allowances for learning and skill development Comprehensive leave policy with casual leaves, paid leaves, marriage leaves, bereavement leaves Twice a year appraisal Job Type: Full-time Work Location: In person

Login to

Please Verify Your Phone or Email

Confirm Action

SatSure Analytics India Private Limited

Before You Leave... Find Your Perfect Job!

Login to

Please Verify Your Phone or Email

Confirm Action

Contact Us

SatSure Analytics India Private Limited