
Suljettu
Julkaistu
Maksettu toimituksen yhteydessä
I have a growing database of skincare and cosmetic products that now needs a deterministic way to attach the correct manufacturer to every SKU. The goal is an auditable pipeline—no probabilistic “best-guess” AI—built on rule-based logic, web search data and a transparent confidence-scoring model. Data you will pull from • External websites the pipeline can crawl or scrape for authoritative details • Any API integrations that reliably surface company or product metadata Verification rules must anchor to two primary sources: the official manufacturer websites themselves and relevant government databases. When information conflicts, the system should automatically weigh each source and assign a confidence score, storing the rationale so we can trace every decision later. What I expect you to deliver • Clean, well-documented ETL scripts (Python, SQL or comparable) that ingest, normalise and enrich my current tables • A modular rules engine where I can tweak source priority, matching logic and thresholds without touching core code • A confidence-scoring function that explains how each record was resolved, including the exact URLs or API records consulted • Logging and error-handling stitched in from the start so audits are effortless • Setup notes and a short read-through video or markdown walkthrough showing a full run on a sample dataset Acceptance criteria 1. Running the pipeline on the sample file enriches ≥95 % of rows with a manufacturer_id plus a confidence_score column. 2. Every enriched row links back to at least one stored reference (URL or API call) that can be opened to verify the data. 3. Re-running the same input produces identical output, confirming determinism. If you have previous experience with regulated data, large-scale scrapers or trust-based platforms, that will help us move quickly. Let me know which tools or frameworks you prefer and your estimated turnaround for the first iteration.
Projektin tunnus (ID): 40191127
97 ehdotukset
Etäprojekti
Aktiivinen 4 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
97 freelancerit tarjoavat keskimäärin $160 USD tätä projektia

With over 7 years of solid industry experience as a Full Stack Developer and a staunch focus on Web Scraping, Web Automation alongside other relevant skills, I believe I am the perfect fit for your Verifiable Manufacturer Data Pipeline project. Having worked with top companies including Metlife GOSC, DXC Technologies and Elite Services, I not only ,possess immense experience working with large scale scrapers and regulated data but also understand the demand for deterministic and reliable systems which is key to your project's completion. I understand the task at hand; building an auditable pipeline devoid of 'probabilistic best-guess AI', integrating web search data and developing a transparent confidence scoring model, all driven by rule-based logic. This aligned well with my core competencies-demostrated ability to create clean, well-documented ETL scripts (Python, SQL or comparable), modular rules engines among others. Plus, my focus on ensuring determinism; as demonstrated by my ability to debug for any incongruencies is testament to my skill in delivering a final product that meets acceptance criterion 1 - Running the pipeline on the sample file enriches ≥95 % of rows with a manufacturer_id plus a confidence_score column.
$220 USD 4 päivässä
7,6
7,6

As a team of full-stack developers with extensive experience across diverse technologies, my team and I have the right skill set to deliver on your project's needs. Our prowess in Python, SQL, and web scraping, make us particularly suited for building clean and well-documented ETL scripts that can ingest, normalize, and enrich your existing data tables. In addition, our deep understanding of large-scale scrapers and regulated data, places us ahead when it comes to integrating the necessary APIs with impeccable accuracy. We'll ensure a robust pipeline that attaches the correct manufacturer to every SKU by sourcing from government databases and official manufacturer websites themselves as your requirements mandate. We guarantee a transparent confidence-scoring model that leaves no room for probabilistic guesswork ─ precisely what you seek. Our commitment doesn't just stop at delivering functional code; we recognize the importance of auditability in your solution. Hence, we'll build in meticulous logging and error-handling systems at every stage so future audits become effortless for you. By choosing us, you're not only buying our skills but our dedication to leaving consistently satisfied clients behind.
$140 USD 7 päivässä
6,8
6,8

As an experienced developer with a wide range of backend and frontend skills, I firmly believe I am the right fit for your project. With previous experience in building reliable ETL scripts, my command over Python and SQL will help me meticulously pull data from diverse sources and ensure high-quality integration into your existing tables. The fact that this data is regulated and requires a deterministic approach resonates with my penchant for detail-oriented work. Furthermore, in line with your project needs, I am well-versed in API integrations which will enable me to gather data from relevant websites and government databases accurately. Having developed scraping projects in the past, I have the technical expertise required to crawl or scrape authoritative details without compromising on the quality of the scraped data. One of my key strengths is the ability to develop modular code that not only allows easy tweaks but also guarantees data consistency, making it easier to trace every decision as you've indicated. My experience with logging and error-handling will ensure that our pipeline remains transparent and auditable at all times. Let's work together on this project to provide you with an efficacious and verifiable manufacturer data pipeline! Thanks....
$250 USD 7 päivässä
6,7
6,7

With over 12 years of experience in programming and technology, my team and I at CodeNomad specialize in delivering robust and scalable solutions, which is exactly what you need for your Verifiable Manufacturer Data Pipeline. As Top Rated Experts in Python and SQL - two languages profoundly relevant to this project - we can guarantee a clean, documented ETL process that will normalize and enrich your existing tables seamlessly. Our demonstrated expertise in developing large-scale scrapers and trust-based platforms predisposes us favorably to creating the determinism you're seeking for your database. Another advantage we bring to the table is our strong understanding of regulated data protocols. Given your emphasis on accurate information from official manufacturer websites and government databases, it's crucial to have a reliable professional with a history of maintaining strict compliance with such sources of truth. We always prioritize reference verification, ensuring that every enriched row is linked back to at least one stored record that guarantees the data's reliability. Thanks....
$250 USD 7 päivässä
6,3
6,3

Hi there, ★★★ Python / SQL / Web Scraping Expert ★★★ 9+ Years of Experience ★★★ To successfully complete this project, I will follow a structured approach to develop the verifiable manufacturer data pipeline. 1. Analyze the existing database and define the requirements for the ETL process (10 hours) 2. Research and integrate external data sources and APIs for retrieving manufacturer information (15 hours) 3. Develop the ETL scripts in Python, ensuring they are clean and well-documented (20 hours) 4. Build a modular rules engine for customizing source priority and matching logic (15 hours) 5. Implement the confidence-scoring function and logging mechanisms (15 hours) 6. Create setup documentation and a walkthrough video (5 hours) What I need from you: 1. Access to the current database and any existing documentation 2. A list of preferred external data sources or APIs you have in mind 3. Clarification on any specific regulations or standards to adhere to during development I look forward to connecting at your convenience to ensure the project's success. Best Regards, TechPlus Team
$800 USD 10 päivässä
6,3
6,3

I’m interested in the full-time FullStack Developer role at Abza First and align well with your tech stack and expectations. I’ve spent over 10 years building and maintaining web applications, working across frontend UI/UX and backend systems with a strong focus on performance and clean code. My strengths match what you’re looking for: • Frontend development with React (modern hooks, responsive UI, clean UX) • Backend development with Node.js (APIs, authentication, business logic) • Experience collaborating with designers, product owners, and QA teams • Writing scalable, maintainable code and debugging production issues • Building responsive interfaces that balance usability and performance While my recent work has been heavily focused on React + Node.js, I’ve also worked alongside .NET-based systems and can comfortably integrate or collaborate within mixed stacks. Professional background: • 10+ years overall software development experience • Former Boeing engineer—enterprise-grade quality mindset • Strong problem-solving skills and clear communication I’m open to a long-term, full-time engagement and can commit consistent hours while staying aligned with team goals and delivery timelines. I’d be happy to discuss availability, weekly commitment, and next steps in a quick call.
$210 USD 5 päivässä
6,0
6,0

https://www.freelancer.com/projects/data-scraping/Automated-Counterfeit-Detection/reviews Dear. Nice to meet you. I am very pleasure to submit my proposal on your scrapping and automation project. I have many experiences in these field using python. Recently, I developed Automated Counterfeit Detection and Reporting System on Amazon. You can check this in my portfolio. I am sure and I can start immediately. I will wait for your good news. Thank you.
$140 USD 2 päivässä
5,6
5,6

Hello client, I’ve carefully reviewed your job description and have strong experience in these Elasticsearch, Web Scraping, Python, Big Data Sales, ETL, SQL, API Integration and Data Mining. I can build a reliable web scraping solution tailored specifically to your needs. Whether using Node.js with Puppeteer/Cheerio or Python with Selenium/BeautifulSoup, I will extract, clean, and organize your data efficiently. I also handle anti-bot protections, pagination, and full automation as required. As you can see from my profile, my web scraping reviews are excellent, reflecting my commitment to quality work. I focus on writing clean, maintainable, and scalable code because I know the difference between 99% and 100%. If you hire me, I’ll do my best until you’re completely satisfied with the result. Let’s discuss your target website and preferred data format. Thanks, Denis
$120 USD 3 päivässä
5,4
5,4

⭐Hi, I’m ready to assist you right away!⭐ I believe I’d be a great fit for your project since I have extensive experience building reliable ETL pipelines focused on data accuracy and auditability. I can deliver well-documented Python and SQL scripts that handle data ingestion, normalization, and enrichment efficiently within your timeline and budget. I specialize in creating modular rules engines and confidence-scoring models that provide full transparency and traceability for each data record. My background includes scraping and integrating data from official sources and APIs while ensuring all outputs are deterministic and auditable. Your project focuses on solving the critical problem of attaching verified manufacturer data to SKUs in a fully traceable way. This system will remove uncertainty by relying strictly on authoritative sources and clear confidence weights, giving you trustworthy, repeatable results. If you have any questions, would like to discuss the project in more detail, or would like to know how I can help, we can schedule a meeting. Thank you. Maxim
$30 USD 6 päivässä
5,4
5,4

Hello, I understand you’re looking for a fully deterministic, auditable data pipeline that can reliably attach the correct manufacturer to every skincare and cosmetic SKU without relying on probabilistic AI guesses. I specialize in rule-based ETL systems where traceability, reproducibility, and verification are core requirements rather than afterthoughts. The pipeline will ingest your existing tables and enrich them using authoritative external sources, prioritising official manufacturer websites and relevant government databases. All matching logic will be handled through a modular rules engine, allowing source weighting, conflict resolution, and threshold tuning without modifying core code. Each resolved record will include a transparent confidence score derived from explicit rules, along with stored references to the exact URLs or API responses used, ensuring every decision is auditable. The solution will be delivered as clean, well-documented Python and SQL scripts with robust logging, error handling, and deterministic outputs. Re-running the same input will always produce identical results. Clear setup notes and a walkthrough will demonstrate a full end-to-end run on sample data, making ongoing audits and maintenance straightforward. Asif
$250 USD 3 päivässä
5,6
5,6

As an experienced data scientist and Python aficionado, I am well-versed in designing clean, well-documented ETL pipelines that can enhance and enrich datasets - exactly what your project needs. With a proficiency in creating modular rules engines for maximum flexibility without compromising core code and using deterministic models for data enrichment, I am confident in delivering an auditable pipeline that aligns with your needs. Moreover, my extensive experience in web scraping and familiarity with various APIs means I can help develop a robust system that relies on authoritative sources such as official manufacturer websites and relevant government databases. Handling conflicting information intelligently using a confidence-scoring function and systematically storing the rationale behind every decision are standard practices for me. Lastly, my technical dexterity extends to handling large-scale scrapers and regulated data. Being detail-oriented and leaving no room for errors is in fact a personal value - I ensure essential logging and error-handling systems are stitched into projects from the start to make audits seamless. So, let's leverage my skillset, turn this around quickly, and build your verifiable manufacturer data pipeline effectively!
$140 USD 7 päivässä
5,8
5,8

Hi, I can build a fully deterministic, rule-based pipeline to attach the correct manufacturer to each SKU using only authoritative sources (official sites + government databases). I’ll deliver clean ETL scripts, a configurable rules engine, transparent confidence scoring with stored source links, and full logging for auditability. The system will be reproducible (same input = same output) and documented with a sample run walkthrough. Experienced with structured scraping, data validation, and compliance-focused systems. Ready to start and deliver the first iteration quickly. Best,
$155 USD 1 päivässä
5,3
5,3

Hello! I have completed so many similar projects so far so I can show you my recent results while chatitng. I can build a fully deterministic manufacturer-matching pipeline for your skincare SKUs, no “best guess” AI. It will crawl official manufacturer sites and the relevant government databases, resolve conflicts with a configurable source-priority rules engine, and output manufacturer_id + confidence_score with a clear explanation and saved evidence links for every row. You’ll get Python ETL scripts, a rules/config file you can tweak without code changes, full logging, and repeatable runs that produce identical output from the same input. Share your sample file, target countries, and current schema, and I’ll deliver a first working iteration quickly. Warm regards, Yulius Mayoru
$50 USD 2 päivässä
5,1
5,1

Hello, I’m excited about the opportunity to contribute to your project. With my expertise in ETL pipelines, web scraping, and data normalization, I can build a robust, deterministic system to link the correct manufacturer to each product SKU, ensuring a fully auditable and transparent process. I’ll deliver clean, well-documented scripts with a modular rules engine and confidence-scoring model, ensuring you can easily tweak priorities and track every decision made during data enrichment. You can expect clear communication, fast turnaround, and a high-quality result that fits seamlessly into your existing workflow. Best regards, Juan
$140 USD 1 päivässä
5,0
5,0

Greetings, It sounds like you're looking to build a reliable system that accurately links manufacturers to skincare and cosmetic SKUs, ensuring every decision is traceable and auditable. My approach would involve developing a robust ETL pipeline that scrapes data from authoritative sources and utilizes rule-based logic to prioritize and verify this information. I’ll leverage Python and SQL to create clean scripts, along with a modular rules engine that allows for easy adjustments without altering the core code. Additionally, I will implement a confidence-scoring model that not only explains how each record was resolved but also maintains a log for easy audits. With experience in handling regulated data and building large-scale scrapers, I’m confident I can deliver a solution that meets your needs effectively. Best regards, Saba Ehsan
$150 USD 2 päivässä
4,8
4,8

Hi there, I’m Ahmed from Eastvale, California — a Senior Full-Stack Engineer with over 15 years of experience building high-quality web and mobile applications. After reviewing your job posting, I’m confident that my background and skill set make me an excellent fit for your project — Verifiable Manufacturer Data Pipeline . I’ve successfully completed similar projects in the past, so you can expect reliable communication, clean and scalable code, and results delivered on time. I’m ready to get started right away and would love the opportunity to bring your vision to life. Looking forward to working with you. Best regards, Ahmed Hassan
$120 USD 2 päivässä
4,8
4,8

Hello, I’ve reviewed your Verifiable Manufacturer Data Pipeline—an auditable, rule-based system that anchors to official manufacturer websites and government databases. I’m confident I can deliver a scalable Python/SQL solution with transparent confidence scoring and full provenance. What I’ll deliver: - Clean, well-documented ETL scripts (Python) to ingest, normalize, and enrich your SKU table, with modular adapters for static web sources and API feeds. - A config-driven rules engine (JSON/YAML) to tweak source priority, matching logic, and thresholds without touching core code. - A confidence-scoring function that logs the rationale and stores exact URLs/API calls used for every decision. - Robust logging and error handling, audit tables, and setup notes plus a markdown walkthrough video. - A compact, reproducible run on a sample dataset to demonstrate determinism. Implementation plan and timeline: MVP in 8–12 days, with a readme and a short video. I’ll start with two primary sources (official sites + government databases) and expand to additional APIs as needed. Technologies I prefer: Python, SQL (PostgreSQL), Elasticsearch for searchability, Scrapy/BeautifulSoup for scraping, and a lightweight rules-engine library. Acceptance criteria alignment: 1) ≥95% enrichment with manufacturer_id and confidence_score; 2) each enriched row references at least one stored URL/API; 3) deterministic re-run. If you’ve worked with regulated data or large-scale scrapers, I can leverage
$100 USD 2 päivässä
4,5
4,5

With my extensive experience in AI/ML solutions and complex systems, I am perfectly suited to undertake your Verifiable Manufacturer Data Pipeline project. I understand the importance of deterministic decisions when it comes to data verification and have built multiple systems based on rule-based logic and web search data. My robust scripts, written mostly in Python, will not only achieve what you desire but also enable future tweaks to source priority, matching logic, and thresholds without altering core code. Lastly, given my background in Data Engineering and being comfortable with large-scale computations, I am positive about meeting your criteria - 95%+ rows enriched, all rows linked to at least one openable reference (URL/API), determinism achieved when re-run on the same input. To prove my commitment beyond delivery, I will provide setup notes alongside a video or markdown walkthrough showcasing a full run on a sample dataset. Let's discuss the best tools/frameworks and arrive at an estimated turnaround for an outstanding first iteration.
$250 USD 2 päivässä
4,3
4,3

Hi there, I understand the critical need for a verifiable data pipeline to ensure accurate attachment of manufacturers to your SKUs. With extensive experience in building deterministic ETL pipelines, I am confident in delivering a solution that meets your requirements without relying on probabilistic AI. I have developed similar systems, integrating web scraping and API data retrieval, ensuring compliance with verification rules rooted in official manufacturer sources and government databases. My approach includes creating clean, well-documented ETL scripts in Python, a modular rules engine for easy adjustments, and a robust confidence-scoring function that traces every logic decision. I will also implement comprehensive logging and error-handling to facilitate audits effortlessly. I aim to deliver the initial iteration within 10 days, ensuring that the pipeline enriches at least 95% of rows with manufacturer_id and confidence_score. The output will be deterministic as per your criteria.
$250 USD 10 päivässä
4,2
4,2

Hi, I understand your need for a deterministic data pipeline to match skincare product SKUs with the correct manufacturers. I will build a rule-based logic system using web search data, transparent confidence scoring, and data from official manufacturer websites and government databases. I'll deliver clean ETL scripts, a modular rules engine, confidence-scoring function, logging, and error-handling for easy audits. I have experience with regulated data and web scraping tools, ensuring a quick and accurate solution for you. What is your preferred timeline for completion?
$155 USD 1 päivässä
3,9
3,9

Regesdorf, Switzerland
Maksutapa vahvistettu
Liittynyt helmik. 5, 2019
$50-80 USD
$30-250 USD
$30-250 USD
$30-250 USD
$1500-3000 USD
₹12500-37500 INR
$15-25 USD/ tunnissa
₹1500-12500 INR
$15-25 USD/ tunnissa
$250-750 USD
$10-30 USD
$2-8 USD/ tunnissa
£25000-50000 GBP
$30-250 USD
₹12500-37500 INR
$30-250 USD
₹750-1250 INR/ tunnissa
$30-250 USD
₹4000-6000 INR
$250-750 USD
$2-8 USD/ tunnissa
₹600-1500 INR
£20-250 GBP
$750-1500 AUD
$10-30 USD