International Data Collection Intern
Location: Remote / Hybrid (India-based, working with global data) Duration: 3–6 months Department: Data Operations & Research Reports To: Data Operations Manager About Kuinbee: Role Overview As an International Data Collection Intern, you will play a key role in sourcing, extracting, and structuring datasets from global sources. You will work with a combination of manual research and automated Python-based tools such as BeautifulSoup, Selenium, and Playwright to gather accurate and well-structured data from websites, APIs, and databases. Key Responsibilities Web Scraping & Automation – Use Python (BeautifulSoup, Selenium, Playwright, Requests) to extract data, structure, and clean it (CSV, JSON, XLSX). API & Data Sourcing – Integrate with APIs and gather datasets from government portals, research databases, industry reports, and open-data repositories. Data Processing & Quality – Apply Pandas/NumPy for cleaning, validation, and ensuring accuracy, completeness, legality, and compliance with privacy laws (GDPR, CCPA). Documentation & Compliance – Maintain replicable scripts, source lists, methodologies, and adhere to licensing, ethical, and legal standards. Collaboration & Communication – Work with analytics, compliance, and product teams; communicate effectively in English. Preferred Skills – Knowledge of Git/GitHub, cloud platforms (AWS/GCP/Azure), Excel/Google Sheets, multilingual abilities, and an academic background in Data Science/Computer Science. What You’ll Gain Real-world experience in large-scale, international data collection projects. Exposure to high-demand Python scraping and automation tools. Hands-on experience in sectors like finance, economics, energy, environment, and agriculture. Mentorship from experienced data engineers and analysts. Possibility of full-time placement based on performance.