SENIOR DATA /AI ENGINEER

7 years

0 Lacs

Posted:1 week ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Requirements:

  • 7+ years of related experience with a bachelor’s degree.
  • Proven experience designing and deploying applications using Generative AI and large language models (e.g., GPT4, Claude, open-weight large language models (LLMs)).
  • Understanding of retrieval-augmented generation, embeddings-based search, agent orchestration, or prompt chaining.
  • Familiarity with modern LLM/GenAI tools such as LangChain, LlamaIndex, HuggingFace Transformers, Semantic Kernel, or LangGraph.
  • Advanced knowledge of SQL and experience working with relational and NoSQL databases, query authoring (SQL), as well as working familiarity with a variety of databases (e.g., SQL Server).
  • Experience building and optimizing data pipelines on Azure Databricks.
  • In-depth knowledge of data engineering, machine learning, data warehousing, and Delta Lake on Databricks.
  • Strong knowledge of Spark and Python.
  • A successful history of manipulating, processing, and extracting value from large, disconnected datasets.
  • Excellent skills in stakeholder management and communication, enabling effective communication across global teams.

Nice To Have:

  • Familiarity with Fivetran.
  • Familiarity with BI tools like Power BI, etc.
  • Understanding building and deploying ML and feature engineering pipelines to production using MLflow.
  • Experience with building a data pipeline from various business applications like Salesforce, NetSuite, etc.
  • Knowledge of message queuing, stream processing, and highly scalable data stores.
  • Experience working in a compliance-based environment, including building and deploying compliant software solutions throughout the software life cycle.
  • Familiarity with cloud-based AI/ML services and Generative AI tools.

Responsibilities:

  • Design and development of systems for the maintenance of the Azure Databricks, ETL processes, business intelligence, and data ingestion pipelines for AI/ML use cases.
  • Build, scale, and optimize GenAI and ML workloads across Databricks and other production environments, with strong attention to cost-efficiency, compliance, and robustness.
  • Build ML pipelines to train, serve, and monitor reinforcement learning or supervised learning models using Databricks and MLflow.
  • Create and support ETL pipelines and table schemas to facilitate the integration of new and existing data sources into the Lakehouse on Databricks.
  • Maintain data governance and data privacy standards.
  • Collaborate with data architects, data scientists, analysts, and other business consumers to quickly and thoroughly analyze business requirements to populate the data warehouse, optimized for reporting and analytics.
  • Perform root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
  • Maintain technical documentation and mentor junior data engineers on best practices in data engineering and Lakehouse architecture.
  • Drive innovation and contribute to the development of cutting-edge Generative AI and analytical capabilities for the Next-Gen research enablement platform.

We Offer:

  • US and EU projects based on advanced technologies.
  • Competitive compensation based on skills and experience.
  • Regular performance appraisals to support your growth.
  • 15 vacation days, 10 national holidays, 5 sick days.
  • Free tech webinars and meetups organized by Svitla.
  • Reimbursement for private medical insurance.
  • Personalized learning program tailored to your interests and skill development.
  • Bonuses for article writing, public talks, and other activities.
  • Fun corporate online\offline celebrations and activities.
  • Awesome team, friendly and supportive community!

Requirements:

  • 7+ years of related experience with a bachelor’s degree.
  • Proven experience designing and deploying applications using Generative AI and large language models (e.g., GPT4, Claude, open-weight large language models (LLMs)).
  • Understanding of retrieval-augmented generation, embeddings-based search, agent orchestration, or prompt chaining.
  • Familiarity with modern VV tools such as LangChain, LlamaIndex, HuggingFace Transformers, Semantic Kernel, or LangGraph.
  • Advanced knowledge of SQL and experience working with relational and NoSQL databases, query authoring (SQL), as well as working familiarity with a variety of databases (e.g., SQL Server).
  • Experience building and optimizing data pipelines on Azure Databricks.
  • In-depth knowledge of data engineering, machine learning, data warehousing, and Delta Lake on Databricks.
  • Strong knowledge of Spark and Python.
  • A successful history of manipulating, processing, and extracting value from large, disconnected datasets.
  • Excellent skills in stakeholder management and communication, enabling effective communication across global teams.

Nice To Have:

  • Familiarity with Fivetran.
  • Familiarity with BI tools like Power BI, etc.
  • Understanding building and deploying ML and feature engineering pipelines to production using MLflow.
  • Experience with building a data pipeline from various business applications like Salesforce, NetSuite, etc.
  • Knowledge of message queuing, stream processing, and highly scalable data stores.
  • Experience working in a compliance-based environment, including building and deploying compliant software solutions throughout the software life cycle.
  • Familiarity with cloud-based AI/ML services and Generative AI tools.

Responsibilities:

  • Design and development of systems for the maintenance of the Azure Databricks, ETL processes, business intelligence, and data ingestion pipelines for AI/ML use cases.
  • Build, scale, and optimize GenAI and ML workloads across Databricks and other production environments, with strong attention to cost-efficiency, compliance, and robustness.
  • Build ML pipelines to train, serve, and monitor reinforcement learning or supervised learning models using Databricks and MLflow.
  • Create and support ETL pipelines and table schemas to facilitate the integration of new and existing data sources into the Lakehouse on Databricks.
  • Maintain data governance and data privacy standards.
  • Collaborate with data architects, data scientists, analysts, and other business consumers to quickly and thoroughly analyze business requirements to populate the data warehouse, optimized for reporting and analytics.
  • Perform root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
  • Maintain technical documentation and mentor junior data engineers on best practices in data engineering and Lakehouse architecture.
  • Drive innovation and contribute to the development of cutting-edge Generative AI and analytical capabilities for the Next-Gen research enablement platform.

We Offer:

We Offer:
  • US and EU projects based on advanced technologies.
  • Competitive compensation based on skills and experience.
  • Regular performance appraisals to support your growth.
  • 15 vacation days, 10 national holidays, 5 sick days.
  • Free tech webinars and meetups organized by Svitla.
  • Reimbursement for private medical insurance.
  • Personalized learning program tailored to your interests and skill development.
  • Bonuses for article writing, public talks, and other activities.
  • Fun corporate online\offline celebrations and activities.
  • Awesome team, friendly and supportive community!

Mock Interview

Practice Video Interview with JobPe AI

Start Job-Specific Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Skills

Practice coding challenges to boost your skills

Start Practicing Now

RecommendedJobs for You