
Suljettu
Julkaistu
Maksettu toimituksen yhteydessä
We are building an receipt processing system and are looking to purchase a high-quality dataset of real-world receipt images. We need 5,000 to 10,000 real receipt photos or scans for machine learning training. This is NOT a scraping job. We are looking to acquire an existing dataset from someone who already has it. ⸻ Dataset Requirements Receipts must be: • Real-world retail or restaurant receipts • Diverse merchants (grocery, restaurants, gas, retail, pharmacy, etc.) • A mix of: • Photos taken with phones • Flatbed scans • Slightly crumpled or angled receipts • Various lighting conditions and backgrounds • English-language receipts (U.S. required) ⸻ Image Quality • JPG or PNG format • Minimum resolution: 800px on shortest side • No heavy blur • Not synthetically generated • Not downloaded from Google Images • Not scraped from websites We want authentic, natural variation. ⸻ Important – Legal & Rights You MUST confirm: • You legally own the dataset • You have full rights to sell it • It does not violate privacy laws • You can grant us commercial usage rights If data contains personal information: • It must either be anonymized • OR you must confirm you have consent to sell We will require written confirmation of rights transfer. ⸻ What to Include in Your Proposal Please provide: 1. Total number of receipts available 2. Sample images (10–50 examples) 3. Source of the dataset 4. Confirmation of commercial resale rights 5. Whether metadata exists (store name, totals, etc.) 6. Your price ⸻ Bonus (Optional but Valuable) Extra value if dataset includes: • Labeled data (merchant name, date, totals, line items) • Bounding boxes or OCR output • JSON structured fields • Country diversity • Handwritten tips or annotations ⸻ Budget Open to fixed price proposals. We are willing to pay more for: • Clean legal documentation • High diversity • Structured/labeled datasets Low-quality scraped data will not be considered. ⸻ Long-Term Opportunity If this dataset works well, we may need additional batches in the future. ⸻ If you have an existing receipt dataset and can legally sell it, please reach out.
Projektin tunnus (ID): 40258360
25 ehdotukset
Etäprojekti
Aktiivinen 15 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
25 freelancerit tarjoavat keskimäärin $459 USD tätä projektia

Hi there. We use to work for a US client years back. It was all scanned and handwritten receipts of US Jewelery showroom customers. If you want to see sample, please message me. My quote is for 10000 receipts. Best, Harish PS : Its dated way back (about 15 yrs back)
$750 USD 1 päivässä
7,7
7,7

Hello, As an experienced freelancer with over 15 years of dedication to my work, I believe I can offer a uniquely valuable solution to your project. Though my profile mainly covers Lead Generation and Web Scraping, I have extensive expertise in Data Processing as well. Over the years, I have developed advanced skills in curating, maintaining, and handling large datasets that align directly with your unique requirements for this project. When it comes to acquiring and managing data at scale, my proficiency with automation tools such as Python, Scrapy, and Beautiful Soup will ensure not only accuracy in delivering receipts images but also the maintenance of privacy and legality for all data you are investing in. As you requested specific information for your proposal like total number of receipts available, sample images, source of the dataset, confirmation of commercial resale rights, metadata availability and pricing- I am more than capable of meeting your expectations. Additionally, having dealt with various clients over time and being fastidious about detail-oriented work myself, I greatly appreciate the importance you place on legal documentation, diversity, structure-labeled datasets and authenticity - these are hallmarks of my work style as well. My commitment to purchasing licenses legitimately and acquiring receipts from real-world sources means you can trust me not just with one dataset but also any future batches you may Thanks!
$600 USD 3 päivässä
5,8
5,8

Dear sir, I have experience working with structured document-image datasets for machine learning applications and understand the importance of authenticity, diversity, and clear legal rights. I can provide a real-world receipt dataset that meets your resolution and variation requirements, along with written confirmation of ownership and commercial usage rights. Sample images and dataset details, including metadata availability, can be shared upon request. Open to fixed pricing based on volume and labeling depth. Regards sujon
$350 USD 5 päivässä
5,8
5,8

Hello I can supply a high-quality existing dataset of 5,000–10,000 real U.S. retail and restaurant receipt images (phone photos and scans) with natural variation, confirmed commercial resale rights, and written rights transfer documentation; I can provide sample images, dataset source details, metadata availability (including optional structured/JSON and labeled fields if included), and a fixed-price proposal upon discussion. Regards Muhammad
$750 USD 1 päivässä
4,7
4,7

As an experienced Full Stack Developer and a Data Processing expert, I've built diverse and complex solutions for clients worldwide, similar to what you need for your receipt processing system. I know how crucial it is to acquire only real-world, high-quality datasets, which makes your project a perfect match for my skillset. With over 6 years in the field, I've honed my proficiency in image recognition and processing, as well as ensuring data quality with minimal errors. I'm immensely familiar with Java, Python, and Machine Learning algorithms, which will be invaluable in handling, anonymizing (if needed), and structuring the data. In terms of your budget and implications of personal information on the dataset — I assure you that I can provide comprehensive legal documentation alongside the dataset. Privacy is respected. If annotated data or bounding boxes are among your requirements - worry not! Leveraging my wide range of skills from C++ to PHP and relevant platforms like Laravel and Django, I can provide structured datasets enhancing line items OCR output etc. But it's not just data management I excel at;
$251 USD 3 päivässä
4,4
4,4

Hello, As an accomplished Full-Stack Developer, my technical skills perfectly align with the objectives of your project. I have a wealth of experience and proficiency in Python, which is crucial for handling large datasets and developing machine-learning models - a skillset that is integral for your receipt processing system. My adeptness in Javascript and development of responsive, seamless front-end interfaces will also ensure that the data collected can be presented logically and sensibly to users in your application. The web application I can create for you will be a refined and high-performance solution capable of managing extensive datasets like the ones you need for this project. I have a track record in delivering secure, optimized applications with clean code at the core. Additionally, my skillfulness in automating tools and working with REST APIs adds value to your project as these techniques can greatly facilitate and streamline data processes. I emphasize on long-term usability in every project I work on meaning if you choose to work with me in this project there’s an opportunity to collaborate on future projects relating to new dataset batches or other requirements that may arise. I am by all means the person perfectly suitable for this task; with my proven competency in handling large-scale datasets, building advanced server-side architectures, and creating user-friendly interfaces, I guarantee not just meeting but Thanks!
$400 USD 9 päivässä
3,6
3,6

Hello, I can supply a legally sourced dataset of real-world U.S. retail and restaurant receipts suitable for ML training. The collection includes 7,800+ authentic receipt images (phone photos and flatbed scans) with natural variation—angles, lighting differences, mild crumpling, and diverse backgrounds. Resolution exceeds 800px on the shortest side, delivered in JPG/PNG format. No scraping, no synthetic generation. The dataset covers grocery, gas, pharmacy, quick service, dine-in, and general retail merchants across multiple U.S. states. All personal data has been anonymized (names, card numbers, phone numbers masked). I legally own the dataset and can grant full commercial usage rights with written transfer confirmation. Optional metadata includes merchant name, date, subtotal, tax, total, and payment type in structured JSON format. A subset (approx. 3,000 receipts) also includes OCR output and bounding boxes. I can provide 25 sample images upon request. Fixed price: $650 for full dataset with documentation and rights transfer agreement. Additional labeled expansion batches available. Client Clarification Questions: Do you require full line-item labeling or only header-level fields (merchant, date, totals)? Should sensitive financial elements be fully redacted or partially masked for model training?
$750 USD 11 päivässä
3,7
3,7

Hello, there! I have developed a comprehensive skill set that is perfectly suited to handle this project. Over the past 7+ years, I have created highly scalable web and mobile applications for various clients in the FinTech, SaaS, and trading sectors. I am adept at managing, processing, and analyzing large sets of data, which is critical for your receipt processing system. My experience extends to all aspects of software development necessary for successful project completion, from database management to integrating tricky third-party APIs. In the realm of trading automation, my expertise foregrounds creating robust and precise automated systems that execute trades in real-time. The impactfulness of this dataset cannot be understated for your machine learning training and as such I'm ready to offer unrivaled dedication to delivering a truly diverse dataset of real-world retail and restaurant receipts. I am open to discuss both the total cost and agreed-upon timeline for completion as well; because while I am committed to providing a comprehensive inventory 5-10k receipts, my concentration on detail will never be diminished even if this entails more time! Assuredly, with me assigned to this project, you will get not only valuable data but also peace of mind knowing that everything is securely obtained complying with all privacy rights and legalities.
$300 USD 5 päivässä
3,4
3,4

Hello, After reviewing your project on domestic charter flight analysis, I clearly understand that this requires not just data processing, but strategic insight that supports financial and operational decision-making. As a Data Scientist with strong expertise in Python, R, and advanced Excel analytics, I specialize in transforming complex datasets and transcripts into structured, decision-ready insights. For your project, I will: ✔ Conduct structured qualitative and quantitative analysis aligned with your objectives ✔ Clean, validate, and structure raw datasets for analytical accuracy ✔ Apply statistical modeling and exploratory data analysis (EDA) using Python/R ✔ Identify key patterns, cost drivers, and performance indicators ✔ Deliver a clear analytical report with visualizations and actionable recommendations My focus is not just on analysis — but on delivering insights that directly support smarter, data-driven decisions. You can expect: • Well-documented methodology • Transparent working files (Python/R scripts or Excel models) • Clear visual dashboards (if required) • On-time delivery with professional communication I am ready to begin immediately and would welcome the opportunity to review your dataset to ensure the analysis fully aligns with your strategic goals. Looking forward to delivering measurable value to your project. Best regards,
$250 USD 2 päivässä
3,5
3,5

Hello, With my diverse skill set in frontend, backend, databases, and AI services, I'm ideally suited to meet your needs of procuring 5,000 - 10,000 real-world receipt images for ML training. Leveraging my expertise in Python libraries like TensorFlow, PyTorch, and scikit-learn, I can ensure that the gathered dataset will not only adhere to your criteria but also offer impeccable accuracy. In addition to being adept in ML and computer vision with advanced tools like OpenCV and AWS, I also specialize in data processing. Coupling this competency with my proficiency in PostgreSQL and MongoDB, I can meticulously curate a structured dataset for you replete with labeled data, bounding boxes, JSON structured fields, and any other components you deem valuable. Furthermore, I am well-acquainted with the significance of legalities surrounding dataset ownership. As a responsible professional and adherent to privacy laws, I guarantee that my offering is backed up by legitimate rights of sale along with written confirmation. In short, by entrusting me with this project you will not only acquire a top-notch dataset but also gain access to a long-term partner for potential future batches. Look forward to hearing from you soon! Thanks!
$250 USD 15 päivässä
0,0
0,0

Hello, How are you? I have checked your job description and I’m confident I can completed exactly what you need. I have extensive experience with image processing and data analysis, specifically in curating datasets for machine learning. With a focus on authentic and diverse receipt images, I can provide a collection that fully meets your stringent requirements. My dataset includes real-world receipts from a variety of merchants, captured under different conditions and quality standards, ensuring genuine representation. Rest assured, I hold full legal rights to the dataset, and all necessary documentation can be provided for confirmation. My experience in this domain makes this task an ideal fit for my skills. Please send me a message so that we can discuss more. Thanks!
$600 USD 4 päivässä
0,0
0,0

I own a real, legally compliant U.S. receipt dataset with high diversity, clean images, full resale rights, and samples ready for review.
$278 USD 7 päivässä
0,0
0,0

I have carefully reviewed your project requirements and I’m confident I can deliver accurate and high-quality data entry work. I have strong attention to detail and experience with Excel, Google Sheets, and web research. ✅ 100% accurate data entry ✅ Fast turnaround time ✅ Well-organized Excel/Google Sheets ✅ Quick communication and updates
$500 USD 7 päivässä
0,0
0,0

Hey , I just finished reading the job description and I see you are looking for someone experienced in Image Analysis, Image Processing, Data Processing and Image Recognition. This is something I can do. Please review my profile to confirm that I have great experience working with these tech stacks. While I have few questions: 1. These are all the requirements? If not, Please share more detailed requirements. 2. Do you currently have anything done for the job or it has to be done from scratch? 3. What is the timeline to get this done? Why Choose Me? 1. I have done more than 250 major projects. 2. I have not received a single bad feedback since the last 5-6 years. 3. You will find 5 star feedback on the last 100+ major projects which shows my clients are happy with my work. Timings: 9am - 9pm Eastern Time (I work as a full time freelancer) I will share with you my recent work in the private chat due to privacy concerns! Please start the chat to discuss it further.
$250 USD 1 päivässä
0,0
0,0

Hello there, We bring 8 years of experience in ML data pipeline engineering and structured extraction from document images — exactly what your receipt dataset project demands. Our approach: source from licensed dataset marketplaces (Roboflow Universe, Kaggle commercial-licensed sets, receipt-management app providers selling anonymized data), validate every image against your specs — 800px minimum, no synthetic or web-scraped data — then run perceptual hashing for deduplication and PII redaction before delivery, with written rights transfers for every source. For metadata extraction we'd use Tesseract paired with a fine-tuned LayoutLMv3 — Tesseract for raw OCR, LayoutLMv3 because it understands document spatial layout, giving accurate bounding boxes and structured JSON for merchant, date, totals, and line items. A human QA pass on a 10% sample ensures extraction quality. As proof: we've built AI extraction pipelines processing 60,000+ records from messy real-world documents for compliance reporting. Biggest risk is source licensing ambiguity — we mitigate this by requiring written commercial-use documentation from every provider before ingesting a single image. Delivery in two phases: Phase 1 (week 1) — 500 validated samples with full metadata. Phase 2 (weeks 2–3) — complete dataset with legal docs and structured labels. Daily updates via your preferred channel. Naveen Brainstack Technologies
$400 USD 21 päivässä
0,0
0,0

If the receipts aren’t legally clean, your ML model becomes a liability instead of an asset. Before anything else, are you strictly looking for a fully owned dataset ready for transfer, or would you consider a curated private dataset assembled from controlled contributors with signed releases? I’ve worked on data acquisition projects where the real challenge wasn’t volume, it was rights validation and variation quality. In one case, we rejected 60 percent of a batch because the lighting, merchant diversity, and angle variation weren’t strong enough for training robustness. We rebuilt it with structured metadata and consent documentation, which made the dataset commercially safe and model ready. If I move forward with you, I would deliver a legally transferable dataset within your 5,000 to 10,000 range, including mixed phone captures and scans, authentic real world variation, minimum 800px resolution, and structured folders. I can also include labeled fields in JSON format with merchant name, totals, dates, and optional OCR output depending on depth required. Written rights transfer documentation would be included. To align properly: do you require all personal data fully anonymized at pixel level, or is consent backed usage acceptable? And is U.S. only a hard requirement for the full batch? If this direction makes sense, I can outline dataset structure, pricing tier options, and delivery timeline immediately.
$300 USD 7 päivässä
0,0
0,0

Boulder, United States
Maksutapa vahvistettu
Liittynyt tammik. 13, 2011
$30-250 USD
$30-250 USD
$30-250 USD
$10-30 USD
$250-750 USD
₹600-1500 INR
£10-20 GBP
₹1500-12500 INR
₹150000-250000 INR
$30-250 USD
€8-30 EUR
₹600-1500 INR
₹12500-37500 INR
$10 AUD
$10-30 USD
$10-30 USD
$250-750 USD
£250-750 GBP
₹12500-37500 INR
$10-30 USD
$15-25 USD/ tunnissa
₹12500-37500 INR
₹600-1500 INR
$30-250 USD
$10-30 USD