Junior Web Scraping Engineer

1 - 5 years

0 Lacs

Posted:18 hours ago| Platform: Shine logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

Role Overview: As a Web Scraper, your main responsibility is to utilize your expertise in fetching data from various online sources. You will be tasked with developing efficient web scrapers and parsers for different websites, extracting both structured and unstructured data, and storing them in SQL/NoSQL databases. Additionally, you will collaborate closely with Project, Business, and Research teams to provide them with the scraped data for analysis. It is crucial to maintain the scraping projects that are delivered to production, develop frameworks to automate data flow from multiple sources, and work independently with minimal supervision. You are expected to have a deep understanding of web data sources and be able to determine the data to scrape, parse, and store. Key Responsibilities: - Develop highly reliable web scrapers and parsers for various websites - Extract structured and unstructured data and store them in SQL/NoSQL data stores - Collaborate with Project, Business, and Research teams to provide scraped data for analysis - Maintain scraping projects in production - Create frameworks for automating and ensuring a constant flow of data from multiple sources - Work independently with minimal supervision - Develop a profound understanding of web data sources and effectively scrape, parse, and store data - Utilize Python for web crawling and web scraping using libraries such as Requests, Beautifulsoup, URLlib, Selenium, and Playwright - Possess strong knowledge of basic Linux commands for system navigation, management, and troubleshooting - Expertise in proxy usage for secure and efficient network operations - Experience with captcha-solving techniques for automation and data extraction - Proficiency in data parsing and knowledge of Regular Expression, HTML, CSS, DOM, XPATH - Familiarity with JavaScript is a plus - Ability to access, manipulate, and transform data from various database and flat file sources - Essential skills in MongoDB and MySQL - Develop reusable code-based scraping products for broader use - Mandatory knowledge of GIT for version control and collaborative development workflows - Experience in handling cloud servers on platforms like AWS, GCP, and LEAPSWITCH for scalable and reliable infrastructure management - Ability to ask pertinent questions and deliver usable results to clients - Track record of tackling complex problems with innovative approaches Qualifications Required: - 1 to 3 years of experience as a Web Scraper - Proficient in Python and web scraping using Requests, Beautifulsoup, URLlib, Selenium, and Playwright - Strong understanding of basic Linux commands - Expertise in proxy usage and captcha-solving techniques - Knowledge of data parsing and Regular Expression, HTML, CSS, DOM, XPATH - Familiarity with JavaScript is advantageous - Skills in MongoDB and MySQL - Proficiency in GIT for version control - Experience with cloud servers on platforms like AWS, GCP, and LEAPSWITCH (Note: The additional details of the company were not included in the provided job description.) Role Overview: As a Web Scraper, your main responsibility is to utilize your expertise in fetching data from various online sources. You will be tasked with developing efficient web scrapers and parsers for different websites, extracting both structured and unstructured data, and storing them in SQL/NoSQL databases. Additionally, you will collaborate closely with Project, Business, and Research teams to provide them with the scraped data for analysis. It is crucial to maintain the scraping projects that are delivered to production, develop frameworks to automate data flow from multiple sources, and work independently with minimal supervision. You are expected to have a deep understanding of web data sources and be able to determine the data to scrape, parse, and store. Key Responsibilities: - Develop highly reliable web scrapers and parsers for various websites - Extract structured and unstructured data and store them in SQL/NoSQL data stores - Collaborate with Project, Business, and Research teams to provide scraped data for analysis - Maintain scraping projects in production - Create frameworks for automating and ensuring a constant flow of data from multiple sources - Work independently with minimal supervision - Develop a profound understanding of web data sources and effectively scrape, parse, and store data - Utilize Python for web crawling and web scraping using libraries such as Requests, Beautifulsoup, URLlib, Selenium, and Playwright - Possess strong knowledge of basic Linux commands for system navigation, management, and troubleshooting - Expertise in proxy usage for secure and efficient network operations - Experience with captcha-solving techniques for automation and data extraction - Proficiency in data parsing and knowledge of Regular Expression, HTML, CSS, DOM, XPATH - Familiarity with JavaScript is a plus - Ability to access, manipulate, and transform data from various database an

Mock Interview

Practice Video Interview with JobPe AI

Start Python Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
AdvaRisk logo
AdvaRisk

Financial Services

Mumbai Maharashtra

RecommendedJobs for You