
Suoritettu
Julkaistu
Maksettu toimituksen yhteydessä
I need an autonomous script-level AI that can visit a small set of websites I will name after awarding the project, extract rich product information from each page, and deliver everything in a single, well-structured Excel workbook. The agent must pull three kinds of data on every item it encounters: full product specs, a live price that can be compared across sites, and a snapshot of the most recent user reviews. To keep the file actionable, I’d like each row to represent one product and each data group in its own column (for example: “Site”, “Product name”, “Specs”, “Price”, “Review rating”, “Review text”, “Timestamp”). The workbook should be ready for pivot-table analysis the moment I open it. You are free to choose the underlying stack—Python with Scrapy, Selenium, Playwright, or a headless browser combo is fine—as long as it runs reliably on a standard Windows machine and can be scheduled to refresh on demand. Graceful handling of site-specific anti-bot measures and captchas is essential. Acceptance criteria • Agent fetches accurate product details, price, and at least the latest three user reviews per item. • Data for all target sites appears in one consolidated Excel file (.xlsx) with no broken rows or mismatched columns. • Clear setup instructions and commented source code are included so I can rerun or tweak it later. If you have previous experience building multi-site scrapers or AI crawlers that structure results in Excel, let’s talk through your approach and timeline.
Projektin tunnus (ID): 40238799
4 ehdotukset
Etäprojekti
Aktiivinen 20 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista

Hello, This reads as a data-collection agent rather than a one-off scraper, with the real value coming from consistency, structure, and repeatability. The key challenges are visiting multiple sites autonomously, normalizing very different product layouts into a single schema, and keeping the output immediately usable for analysis without manual cleanup. A clean way to approach this is to separate concerns: a crawling layer that handles navigation, throttling, and anti-bot behavior on a per-site basis, and an extraction layer that maps each page into a fixed product schema (site, product identity, specs block, live price, recent reviews, timestamp). That normalized data then feeds directly into a single Excel writer, ensuring every product occupies exactly one row and every column stays aligned for pivot tables and comparisons. To proceed smoothly, the main things to clarify are the target sites, how frequently the refresh should run, and whether review text should be stored raw or lightly summarized. Thanks, Looking forward to working through the details
$25 USD 7 päivässä
0,0
0,0
4 freelancerit tarjoavat keskimäärin $75 USD tätä projektia

꧁ ༺ ❤️ Hello ❤️ ༻ ꧂ I’m a Python automation specialist experienced in building multi-site intelligent scrapers using Playwright/Scrapy and delivering clean Excel-ready datasets. I can create a reliable agent that extracts product specs, live prices, and recent reviews, then consolidates everything into a pivot-ready workbook. ❤️Previous Experience I have experience with projects similar to your requirements. The challenges I faced in those projects were dynamic rendering, anti-bot defenses, and structured Excel output. - I used Playwright + stealth techniques to handle JS-heavy sites and reduce blocking. - I used custom parsing pipelines (Python/Pandas) to normalize specs, prices, and reviews across different layouts. - I used openpyxl automation to generate clean, analysis-ready .xlsx files with consistent schemas. Based on my previous experience and skills, I'm confident I can seamlessly implement your requirements. I'm a full-time developer available to work full-time in your time zone and can start working today. I have excellent communication skills and always strive for better results. I look forward to hearing from you. Thank you❤️
$200 USD 5 päivässä
0,0
0,0

Hi! I can build the autonomous script you need to extract product specs, live pricing, and reviews into a pivot-table-ready Excel workbook. Given your requirement for reliability on Windows and anti-bot handling, I recommend a hybrid approach using Python with Playwright or Selenium to ensure we bypass captchas gracefully. How I will deliver this: Structured Data: I will ensure each row represents a unique product with dedicated columns for Site, Price, Specs, and Reviews (including the latest 3 reviews as requested). Excel Optimization: The output will be a clean .xlsx file, formatted specifically for immediate pivot-table analysis. Anti-Bot Resilience: I will implement custom headers and rotation to handle site-specific measures. Future-Proof Code: You will receive well-commented source code and a simple setup guide so you can refresh the data on demand. My Tech Stack for this project: Engine: Python (Playwright/Selenium). Data Handling: Pandas for structuring and cleaning. Output: Openpyxl for advanced Excel formatting. I have extensive experience building multi-site scrapers and I can start working on your script immediately.
$45 USD 4 päivässä
0,0
0,0

I can build the autonomous scraper exactly as described: a script-level agent that visits the specified sites, extracts product specs, live prices, and recent reviews, then consolidates everything into a single structured Excel workbook ready for analysis. Relevant experience: I previously engineered a full data pipeline that scraped Twitter via an API to collect Saudi stock-market-related tweets, processed Arabic text, ran sentiment classification models, and then analyzed correlations with real market price movements. This involved robust data extraction, normalization, automation, and analytics-ready outputs—directly applicable to multi-site product intelligence scraping. Deliverables • Script that runs on Windows and can be scheduled (Task Scheduler) • Consolidated .xlsx with columns: Site, Product Name, Specs, Price, Review Rating, Review Text, Timestamp • At least the latest three reviews per product • Setup guide + well-commented code Estimated timeline • Initial working version: 5–7 days after site list is provided • Hardening and edge-case handling: +2–3 days
$30 USD 7 päivässä
0,0
0,0

HYDERABAD, India
Liittynyt syysk. 24, 2020
₹600-1000 INR
$50-79 USD
₹5000-5500 INR
$30-31 USD
₹2000-4000 INR
₹600-1500 INR
$750-1500 USD
$10-30 USD
$2-8 USD/ tunnissa
$30-250 USD
$25-40 USD/ tunnissa
$750-1500 USD
$2-8 USD/ tunnissa
₹750-1250 INR/ tunnissa
£10-20 GBP
$30-250 USD
$250-750 USD
£10-20 GBP
$30-250 USD
$250-750 USD
₹1500-12500 INR
₹5000-9000 INR
$10-50 USD
₹600-1500 INR
₹600-1500 INR