Analyst Data/AI Engineering (ML, statistical modeling)

3 - 8 years

6 - 11 Lacs

Posted:6 hours ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Key Responsibilities
**Text Embeddings & NLP**
- Design and implement pipelines leveraging text embeddings for semantic search, classification, clustering, and document retrieval.
- Work with embedding techniques such as TF-IDF, Word2Vec, GloVe, FastText, and transformer-based models including BERT, Sentence-BERT, OpenAI, and Azure OpenAI embeddings.
- Apply dimensionality reduction methods (PCA, t-SNE, UMAP) to analyze and visualize embedding spaces.
- Use cosine similarity, Euclidean distance, and approximate nearest neighbor algorithms like FAISS and ScaNN for similarity search and clustering.
- Integrate embedding outputs into downstream applications such as intent detection, topic modeling, semantic deduplication, document ranking, and retrieval systems.
**Traditional Machine Learning & Statistical Modeling**
- Build and deploy predictive models with logistic/linear regression, random forests, gradient boosting techniques (XGBoost, LightGBM), SVM, Naive Bayes, k-means, and hierarchical clustering.
- Employ statistical inference techniques including hypothesis testing, confidence intervals, bootstrapping, Bayesian inference, multicollinearity diagnostics, residual analysis, and time series forecasting (ARIMA, SARIMA).
- Evaluate model performance using ROC/Precision-Recall curves, AUC, confusion matrices, F1-score, lift/gain charts, and KS statistics.
- Conduct feature selection via Lasso/Ridge regression, recursive feature elimination (RFE), and SHAP values for interpretability.
**Experimentation & Causal Inference**
- Design and analyze A/B and multivariate tests, DOE experiments, and sophisticated causal inference methods including propensity score matching, causal forests, and difference-in-differences.
- Translate experimental results into clear, actionable business insights that drive measurable outcomes.
**Data Engineering & Productionization**
- Develop scalable data pipelines using PySpark, SQL, and Azure Data Factory on platforms including Azure Data Lake, Databricks, MongoDB, and Cosmos DB.
- Deploy machine learning solutions with FastAPI, Docker containers, and Azure App Services endpoints, while monitoring model health with MLflow and model drift.
**Collaboration & Leadership**
- Partner effectively with engineering, product, and business teams to define problem statements and deliver impactful solutions.
- Lead technical discussions, perform code reviews, and mentor junior data scientists to foster technical growth.
- Communicate complex analytical insights clearly to both technical and non-technical stakeholders.
Required Skills and Qualifications
Hands-on experience in machine learning, statistical modeling, and NLP applications.
- Deep expertise in text embeddings and their real-world applications.
- Proficiency in Python, PySpark, and SQL.
- Strong foundation in statistical inference, model diagnostics, and evaluation metrics.
- Experience working with Azure cloud ecosystem, Databricks, and production deployment of ML models.
- Proven ability to design, execute, and interpret experiments with statistical rigor.
Preferred (Good-to-Have) Skills
- Familiarity with transformer-based large language models (LLMs), LangChain, or OpenAI APIs.
- Experience with MLOps tools such as MLflow and Github Actions CI/CD pipelines with Azure App Services.
- Exposure to graph analytics, retrieval-augmented generation (RAG) pipelines, or agent-based systems.
Day-to-Day Responsibilities
You will architect and implement advanced NLP and machine learning pipelines leveraging diverse text embeddings for semantic search, classification, and clustering tasks. Applying sound statistical modeling and causal inference techniques, you will lead experimentation efforts and build scalable data workflows using PySpark, SQL, and Azure services. Cross-functional collaboration will be a core part of your role as you translate analytical insights into strategic business outcomes.
Location:
IND:KA:Bengaluru / Innovator Building, Itpb, Whitefield Rd - Adm: Intl Tech Park, Innovator Bldg
Job ID R-72059 Date posted 06/26/2025

Mock Interview

Practice Video Interview with JobPe AI

Start Machine Learning Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru