Job Summary:
You will also be responsible for integrating the front-end elements built by your co-workers into the application; therefore, a basic understanding of front-end technologies is necessary as well.
Key responsibilities:
- Design, implement, and refine machine learning algorithms and models for a variety of applications, including natural language processing, computer vision, and recommendation systems.
- Collaborate with data engineers to collect, preprocess, and analyze large datasets. Ensure data quality and integrity for effective model training and evaluation.
- Optimize model performance through hyperparameter tuning, feature engineering, and advanced techniques such as transfer learning and ensemble methods.
- Develop scalable and efficient pipelines for deploying machine learning models into production environments. Monitor model performance and implement updates as necessary.
- Work with cross-functional teams to understand requirements, provide technical insights, and integrate machine learning solutions into existing systems and workflows.
- Stay current with the latest advancements in machine learning and AI. Contribute to research initiatives and explore new technologies and methodologies to improve our offerings.
Preferred Qualifications:
- Bachelor's or Masters degree in Computer Science, Engineering, Mathematics, or a related field. Ph.D. is a plus.
- Proven experience in developing and deploying machine learning models. Familiarity with various machine learning frameworks and libraries (e.g., TensorFlow, PyTorch, Scikit-learn).
- Proficiency in programming languages such as Python & R
- Experience with cloud platforms (e.g., AWS, Google Cloud, Azure) and containerization tools (e.g., Docker, Kubernetes).
- Strong understanding of machine learning algorithms, statistical analysis, and data modeling.
- Excellent analytical and problem-solving skills. Ability to tackle complex challenges and devise effective solutions.
- Strong verbal and written communication skills. Ability to convey technical concepts to non-technical stakeholders.
- Proven ability to work collaboratively in a fast-paced, team-oriented environment.
- Experience with deep learning frameworks and techniques.
Key traits:
- Should have excellent communication skills.
- Should be self motivated and willing to work as part of a team.
- Should be able to collaborate and coordinate with onsite and offsite
- Be a problem solver and be proactive to solve the challenges that come his way.
About Us
Job Summary:
Responsibilities:
- Develop and maintain scalable web scraping scripts using Python, JavaScript, and related frameworks (e.g., Scrapy, Selenium, Puppeteer, Beautiful Soup, Cheerio.js).
- Implement advanced web scraping techniques, including API interception, sitemap parsing, and handling dynamic content.
- Design and build data pipelines for efficient data extraction, processing, and storage.
- Manage and optimize data extraction workflows, ensuring high speed and accuracy.
- Implement anti-crawling solutions, including IP rotation, proxy management, and CAPTCHA bypassing.
- Collaborate with cross-functional teams to gather requirements and deliver complex data solutions.
- Utilize backend technologies such as Flask, FastAPI, Django, Node.js, Spring Boot, and relational databases (PostgreSQL, MySQL) for data storage and API development.
- Work with cloud platforms like Azure and leverage services such as AzureML and ADLS GEN2.
Employ data processing techniques using libraries like NumPy and Pandas.
- Use tools like Postman, MITM, and DevTools for API testing and network traffic analysis.
- Apply machine learning and NLP techniques for data analysis and processing (e.g., sentiment analysis, content classification).
- Set up and manage server-side scraping environments.
- Monitor and troubleshoot scraping scripts to ensure optimal performance.
Required Skills:
- Strong proficiency in Python and/or JavaScript.
- Experience with web scraping libraries and frameworks (Scrapy, Selenium, Puppeteer, Beautiful Soup, Cheerio.js).
- Knowledge of backend technologies (Flask, FastAPI, Django, Node.js, Spring Boot).
- Experience with relational databases (PostgreSQL, MySQL).
- Understanding of HTTP/HTTPS protocols and API communication.
- Familiarity with cloud platforms (Azure).
- Ability to handle dynamic content and JavaScript-heavy websites.
- Experience with anti-crawling techniques (IP rotation, proxies, CAPTCHA bypassing).
- Data processing and analysis skills (NumPy, Pandas).
- Experience with API testing tools (Postman).
- Knowledge of machine learning and NLP concepts is a plus.
- Strong problem-solving and debugging skills.
- Excellent communication and collaboration skills.
Experience:
- 3+ years of professional experience in web scraping or related fields. Education:
- Bachelor's degree in Computer Science, Engineering, or a related field