
Suljettu
Julkaistu
Maksettu toimituksen yhteydessä
Project Overview: I am looking to hire an experienced freelancer or team to perform large-scale data scraping, processing, and structuring of product data from multiple competitor websites. The objective is to create a complete, clean, and standardized dataset, along with fully processed images, that can be directly uploaded to my e-commerce platform with minimal manual effort. This project includes: Deep scraping (beyond visible listings) Data cleaning and structuring SKU generation Image downloading, renaming, watermarking, and format conversion Organized cloud storage and delivery Scope of Work 1. Data Scraping (Comprehensive Coverage) You must extract all models and all associated spare parts, not just what is visible on listing pages. A. [login to view URL] Target URLs: [login to view URL] [login to view URL] Critical Requirement: Maxbhi brand pages show limited results (~60), but significantly more models exist via search. You must extract: ALL models and their images ALL spare parts for each model Brands to Cover: Nothing, Apple, Samsung, OnePlus, Oppo, Vivo, Realme, Pixel, Asus, Honor, Infinix, iQOO, Lenovo, LG, Xiaomi, Tecno B. Cellspare Extract: ALL models ALL spare parts Model-level images Brands to Cover: Nothing, Apple (including iPads and Apple Watches), Samsung, OnePlus, Oppo, Vivo, Realme, Pixel, Asus, Honor, Infinix, iQOO, Lenovo, LG, Xiaomi, Tecno, Poco C. [login to view URL] Source: [login to view URL] Extract: All Apple Watches All iPads All spare parts associated with these devices 2. Data Extraction Requirements For each product, extract: Product Name Color (if available) Model Name Category (e.g., display, battery, etc.) Description Actual Price Markup Price / MRP Discount (if available) Brand Name Ensure clean, complete, and consistent data. 3. Data Structuring (CSV Output) Deliver a well-structured CSV file where: Each row = one product Each column = one attribute Mandatory Columns: Product Name Color Model Name Category Description Markup Price Actual Price Brand Name img_1 img_2 img_3 img_4 img_5 img_6 4. SKU Generation Generate a unique SKU for each product Maintain a consistent, scalable format 5. Image Extraction and Processing A. Image Collection Download all product images (not just URLs) B. Naming Convention (Strict) Format: [login to view URL] Example: [login to view URL] Requirements: Include Product Name + Color + SAHII + SKU + Image Number Remove competitor references from names C. Multiple Images Handling Map images into CSV columns (img_1 to img_6) Each column must contain the correct file name D. Image Processing Convert all images to WebP Apply watermark “[login to view URL]” (provided PNG) Maintain quality with compression 6. Storage & Organization Upload all images to a Mega ([login to view URL]) folder Maintain clean folder structure Ensure exact mapping between CSV and images 7. Final Deliverables A. CSV File Clean, structured, complete dataset One row per product All required columns filled Correct image mapping B. Image Dataset All images downloaded, renamed, watermarked, converted to WebP Uploaded to Mega C. Data Integrity No duplicate entries No missing mappings between products and images As little competitor branding in output images as possible Complete coverage of ALL spare parts for each model Key Expectations Exhaustive scraping (not surface-level) Ability to bypass pagination and search limitations High accuracy and attention to detail Output must be ready for direct e-commerce upload
Projektin tunnus (ID): 40317449
11 ehdotukset
Etäprojekti
Aktiivinen 25 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
11 freelancerit tarjoavat keskimäärin ₹9 386 INR tätä projektia

I can build a scalable Python scraper to extract complete product data, generate SKUs, and deliver clean CSV with fully processed images ready for upload.
₹5 000 INR 7 päivässä
3,8
3,8

Hi, I can handle complete end-to-end product data scraping, processing, and structuring for your e-commerce upload. From extracting accurate data to cleaning, organizing, and formatting it for direct upload, I ensure a smooth, error-free workflow. Clean, structured, duplicate-free data with fast delivery — ready to start immediately. Best regards.
₹10 000 INR 7 päivässä
2,9
2,9

I have 21 years of experince, Principal Software engineer at Microsoft, I have extensive experince with data storage, data integration, ETL and development of data pipelines, well versed with sql server with efficient API development. Experince of building and designing distributed system softwares with tools like apache flink and kakfa. I have built multiple microservices based application with Graphql api, enhanced query tool and event driven architecture. I have understood your requirements and can deliver fully.
₹7 000 INR 7 päivässä
2,8
2,8

Hello there, As an expert in large-scale data scraping and e-commerce data pipelines, I have carefully reviewed your requirement for deep extraction, processing, and structuring of product data with full image handling. I understand the need for complete coverage, clean datasets, SKU logic, and ready-to-upload output. I will build automated scrapers using Python with proxy handling to bypass limits, followed by structured processing pipelines for CSV generation, SKU creation, and image workflows including renaming, WebP conversion, and watermarking with exact mapping. With 12+ years in AI, automation, and scalable systems, I deliver reliable, high-accuracy pipelines ready for production use. Happy to share similar work and approach. Thanks Chirag
₹7 000 INR 5 päivässä
2,3
2,3

Hi, I can handle this as a full-scale scraping and product data structuring project using advanced programming tools for efficient and accurate extraction. I understand this requires deep scraping (not just visible listings), along with clean structuring, SKU generation, and full image processing. I use custom scripts and automation tools to ensure complete coverage across all models and spare parts, while maintaining high data accuracy. I will deliver: Clean, standardized CSV (upload-ready) Unique SKU system Processed images (renamed, WebP, watermarked) Well-organized storage with correct mapping My approach focuses on automation, scalability, and data integrity, ensuring no missing entries or mismatches. Ready to start immediately.
₹10 000 INR 3 päivässä
1,9
1,9

Hello, This project matches my experience very well. I have worked on large-scale scraping and catalog-building workflows involving deep product extraction, normalization, SKU generation, and image pipeline automation. In a previous project, I extracted and processed over 40,000 product records and images, then transformed the dataset into a structured, platform-ready format for e-commerce operations. I can support the full pipeline: exhaustive scraping across all required brands and models associated spare parts extraction structured CSV preparation with clean attribute mapping scalable SKU generation image download, rename, watermark, WebP conversion, and CSV mapping final organized delivery for direct import I pay close attention to completeness, deduplication, consistency, and exact image-to-product matching, which is critical for projects of this scale. Best regards, Oliver
₹6 500 INR 9 päivässä
0,0
0,0

New Delhi, India
Maksutapa vahvistettu
Liittynyt marrask. 9, 2025
₹1500-12500 INR
₹1500-12500 INR
₹1500-12500 INR
€12-18 EUR/ tunnissa
₹1500-12500 INR
$250-1500 USD
$15-25 USD/ tunnissa
$30-250 USD
$15-25 USD/ tunnissa
₹600-1500 INR
$10-30 USD
$25-50 USD/ tunnissa
$25-50 USD/ tunnissa
₹600-1500 INR
₹750-1250 INR/ tunnissa
₹750-1250 INR/ tunnissa
$250-750 USD
£20-250 GBP
$750-1500 USD
€30-250 EUR
$250-750 USD
$30-250 USD
$15-25 USD/ tunnissa