Pune
INR 10.0 - 20.0 Lacs P.A.
Remote
Full Time
Responsibilities: Data Pipeline Design and Implementation: Develop and implement data pipelines that ingest, transform, and load data into a format suitable for machine learning models. Data Quality and Management: Ensure data quality through data cleaning, validation, and transformation processes. Model Development and Deployment: Collaborate with data scientists to develop and deploy machine learning models, including building and testing them. Feature Engineering: Create and select relevant features from the data to improve model performance. Model Monitoring and Maintenance: Monitor model performance and identify areas for improvement. Scalability and Performance: Design and build data infrastructure that can handle large volumes of data and support real-time analytics. Collaboration: Work closely with data scientists, software engineers, and other stakeholders to achieve common goals. Data Security and Compliance: Ensure data security and compliance with relevant regulations. Documentation: Maintain comprehensive documentation for data pipelines and models. Key Skills: Programming Languages: Proficiency in Python Generative Ai: LLM Machine Learning Frameworks: Knowledge of frameworks . Data Warehousing: Experience with data warehousing solutions like Redshift or Panoply. Cloud Computing: Familiarity with cloud platforms like AWS, Azure, or GCP. Data Structures and Algorithms: Strong understanding of data structures and algorithms. Statistical Analysis: Ability to perform statistical analysis and interpret results. Communication and Collaboration: Excellent communication and collaboration skills.
Pune
INR 25.0 - 35.0 Lacs P.A.
Work from Office
Full Time
Role Description This is a full-time on-site role located in Pune for a MSSQL DBA at Pansoft Technologies LLC. The responsibilities include database administration, database design, troubleshooting, managing databases, and replication on a day-to-day basis. Qualifications Database Administration and Troubleshooting skills Experience in Database Design and Databases management Experience in Replication of databases Excellent problem-solving and communication skills Ability to work effectively in a team environment Relevant certifications in MSSQL or database administration Bachelor's degree in Computer Science, Information Technology, or related field Job Description 1. Cluster Installation and Setup Cluster Configuration: Install and configure database clustering software (e.g., SQL Server Always On Availability Groups, Oracle Real Application Clusters (RAC), MySQL Group Replication). Example: Setting up a SQL Server Always On Availability Group across multiple nodes to ensure high availability and automatic failover. Node Configuration: Configure and prepare each node in the cluster environment, ensuring each instance has the correct configurations, storage paths, and required access. 2. Cluster High Availability (HA) and Disaster Recovery (DR) Setup Setting up Failover Mechanisms: Configure automatic failover to another node in the cluster to ensure high availability in case of node failure. This could involve configuring failover clustering or database mirroring. Example: Setting up automatic failover between SQL Server instances in an Availability Group. Disaster Recovery Configurations: Implement disaster recovery solutions to ensure minimal downtime and data loss in the event of a catastrophic failure. This may involve off-site replication, synchronous/asynchronous replication, or a standby replica. Example: Configuring a log shipping or Always On secondary replica to a geographically remote site for disaster recovery. Backup Strategy for Clustered Databases: Ensure that backups are correctly configured for the entire cluster, including full, differential, and transaction log backups. Example: Implementing a backup strategy using SQL Server's Backup to URL for cloud-based backups or ensuring backups happen on the primary node and are replicated to secondary nodes. 3. Database Instance and Resource Management Resource Allocation and Load Balancing: Manage the allocation of resources (CPU, memory, I/O) across nodes to ensure optimal performance and prevent resource contention. Load balancing across clustered nodes is often required for distributed databases. Example: Configuring load balancing in SQL Server Always On to distribute query traffic across secondary replicas, ensuring optimal read performance. Clustered Database Performance Tuning: Monitor the performance of the database cluster, identify bottlenecks, and optimize performance. Example: Adjusting the SQL Server MaxDOP (Maximum Degree of Parallelism) setting or tweaking Oracle's Clusterware settings to ensure balanced performance. Scaling the Cluster: Add or remove nodes from the cluster as needed to scale out the database architecture to meet growing demands. Example: Adding an additional node to a MySQL Cluster to handle increased traffic. 4. Database Synchronization and Replication Data Synchronization: Ensure that data across all nodes in the cluster is consistent, particularly for synchronous replication (where changes on the primary node are replicated to secondary nodes in real-time). Example: Managing the replication lag in SQL Server Always On or ensuring Oracle RAC nodes are synchronized with the shared storage for data consistency. Clustered Database Replication Setup: Configure and maintain data replication across multiple nodes in the cluster to ensure availability and redundancy. This can be asynchronous or synchronous, depending on the use case. Example: Setting up SQL Server replication or Oracle Streams for replicating data between primary and secondary cluster nodes. Conflict Resolution: Handle any conflicts that may arise during replication, ensuring that data consistency is maintained across all nodes in the cluster. Example: Resolving conflicts between replicated data using Oracle GoldenGate or ensuring SQL Server replication conflicts are handled automatically. 5. Database Cluster Monitoring and Alerts Cluster Health Monitoring: Set up continuous monitoring of the cluster for performance metrics, system health, and failover status. This involves checking for disk space, CPU usage, memory consumption, and any cluster-related errors or warnings. Example: Using SQL Server Management Studio (SSMS) or Oracle Enterprise Manager to monitor cluster status and alert the DBA team on potential issues (e.g., replication delays, node failure). Alerts and Notifications: Implement alerting mechanisms to notify administrators of failures, performance degradation, or other critical issues within the cluster. Example: Setting up SQL Server Agent or Oracle Grid Control to send alerts if a cluster node fails or if the replication process is behind schedule. Role & responsibilities Preferred candidate profile
Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.
We have sent an OTP to your contact. Please enter it below to verify.