
Closed
Posted
Paid on delivery
I need all relevant textual content lifted directly from a defined set of web pages and delivered in a clean, structured format that can be dropped straight into a database. The sites are publicly accessible, but the information is spread across multiple pages, so the scraper will have to crawl links, respect [login to view URL], and handle pagination. Because my ultimate goal is to fill a database, please organise the scraped data into a tidy CSV or JSON with consistent field names. If you have a preferred schema or can recommend the best relational or NoSQL store for this volume, I’m open to suggestions—flexibility here is an advantage. Key requirements • Source: web pages only (no APIs or PDFs involved) • Content: text data exclusively; no images are needed • Approach: any reliable stack you’re comfortable with—Python + BeautifulSoup/Scrapy, Node + Cheerio/Puppeteer, or similar—as long as the code is well-commented and repeatable • Respect polite scraping practices (rate limiting, user-agent, retries) Deliverables 1. Scraping script(s) with clear setup instructions 2. The fully extracted dataset in CSV or JSON 3. Brief read-me outlining how to rerun the scraper and import the results into a database I will validate the output by spot-checking several pages for accuracy and by running the script myself to confirm it reproduces the same dataset.
Project ID: 40307140
Remote project
Active 6 days ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs

Bella Vista, United States
Payment method verified
Member since Jan 10, 2019
$8-15 USD / hour
$8-15 USD / hour
$8-15 USD / hour
$8-15 USD / hour
$8-15 USD / hour
$30-250 USD
₹750-1250 INR / hour
$10-30 AUD
₹600-1500 INR
$30-250 USD
$250-750 USD
$30-250 USD
₹750-1250 INR / hour
₹600-1500 INR
$15-25 USD / hour
₹12500-37500 INR
€12-18 EUR / hour
₹1500-12500 INR
$30-250 USD
$2-8 USD / hour
$10-200 USD
$30-250 CAD
$8-20 USD / hour
$2-8 USD / hour
$250-750 USD