
Suljettu
Julkaistu
Maksettu toimituksen yhteydessä
I am seeking a freelancer to build a full automatic data extraction and enrichment pipeline for Spanish procurement PDF documents. Scope of Work: Phase 1 – PDF Extraction Extract product names and key features from Spanish PDF files. Clean and structure the data. Output results in Excel (.xlsx) format. Phase 2 – Web Search & Price Extraction Use each product name as a Google search query, restricted to a specific domain. Analyze the top 5 search results per product. From each URL, extract with high precision: Price (must extract 5 accurate prices per product) Brand Product features and description Identify similar products, not only exact matches. Measure similarity using methods such as Levenshtein distance or cosine similarity. Deliverables Final datasets in Excel (.xlsx) and JSON (.json) formats. (Do not forget price extraction must be very precise and sufficient. We need 5 successful URL scraping) A detailed [login to view URL] explaining the full workflow, tools, and how to reproduce the process. Technical Notes Use of LLMs for extraction, similarity analysis, and enrichment is highly recommended. The solution must be accurate, efficient, and fully reproducible.
Projektin tunnus (ID): 40182535
54 ehdotukset
Etäprojekti
Aktiivinen 14 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
54 freelancerit tarjoavat keskimäärin $57 USD tätä projektia

Hi there, I’ve carefully reviewed the requirements for your GenAI project and I’m confident that my expertise in building NLP pipelines using Hugging Face and LangChain can meet your expectations. My experience includes working with large language models (LLMs) for Retrieval-Augmented Generation (RAG), as well as fine-tuning models with custom datasets to enhance text generation. I’ve successfully completed similar projects where I applied these techniques in Python to build robust, client-specific solutions. I would love the opportunity to discuss how I can leverage my skills to develop a tailored solution for your project. Feel free to take a look at my portfolio to get a sense of the work I’ve done: Portfolio: https://www.freelancer.com/u/webmasters486 Looking forward to hearing from you! Best regards, Muhammad Adil
$100 USD 1 päivässä
6,1
6,1

Greetings, I shall design a robust, automated system that shall extract and enrich data from your Spanish procurement PDFs. With the aid of cutting-edge LLMs that can extract text with precision and analyze similarities, the system shall carry out targeted web searches, accurately extract data from the top results, and generate verified data in Excel and JSON formats, along with comprehensive documentation for maximum reproducibility. Regards, Joseph
$120 USD 1 päivässä
6,1
6,1

Hello, I can build a fully automated, reproducible extraction and enrichment pipeline tailored specifically to Spanish procurement PDFs, combining robust PDF parsing with LLM-assisted data cleaning, semantic matching, and high-precision price extraction. I’ll extract and normalize product names and features from the PDFs, then run controlled domain-restricted Google searches per product, analyze the top five results, and reliably scrape five accurate prices per item while identifying close and similar products using cosine similarity and Levenshtein distance to ensure relevance beyond exact matches. The entire workflow will be implemented in a clean, maintainable Python pipeline, delivering both Excel and JSON outputs, along with a clear README explaining architecture, tools, and step-by-step reproducibility so you can rerun or extend it with confidence. Regards, Zafar
$100 USD 1 päivässä
6,3
6,3

Hi client, I’ve carefully reviewed your job description and have strong experience in these Data Analysis, Python, Data Entry, Data Extraction, Excel, Data Processing, Natural Language Processing, Web Scraping, JSON and API Integration. I can build a reliable web scraping solution tailored specifically to your needs. Whether using Node.js with Puppeteer/Cheerio or Python with Selenium/BeautifulSoup, I will extract, clean, and organize your data efficiently. I also handle anti-bot protections, pagination, and full automation as required. As you can see from my profile, my web scraping reviews are excellent, reflecting my commitment to quality work. I focus on writing clean, maintainable, and scalable code because I know the difference between 99% and 100%. If you hire me, I’ll do my best until you’re completely satisfied with the result. Let’s discuss your target website and preferred data format. Thanks, Denis
$65 USD 1 päivässä
5,4
5,4

Hello, I understand you need a full automatic data extraction and enrichment pipeline for Spanish procurement PDFs. I will deliver a robust solution that efficiently extracts product names and features from PDFs, cleans and structures the data, and outputs the results in both Excel and JSON formats. Additionally, I'll utilize web scraping techniques to gather five accurate prices and detailed product information, ensuring high precision and leveraging advanced similarity analysis methods. Please check my profile for examples of similar projects I've successfully completed. Regards, Davide
$65 USD 1 päivässä
4,8
4,8

I can build a fully automated pipeline to extract product data from Spanish PDFs, enrich it via domain-restricted search, scrape 5 precise prices per product, apply similarity scoring (Levenshtein/cosine), and deliver clean Excel/JSON with a reproducible README using LLMs.
$35 USD 1 päivässä
4,9
4,9

Hi there, I can build a fully automated data extraction and enrichment pipeline for your Spanish procurement PDFs. In Phase 1, I’ll extract product names and key features, clean and structure the data, and output reliable Excel files. In Phase 2, I’ll perform domain-specific web searches for each product, analyzing the top 5 results per query. From each URL, I’ll extract 5 accurate prices, brand, product features, and descriptions. I’ll also identify similar products using methods like Levenshtein distance or cosine similarity to ensure comprehensive enrichment. Leveraging LLMs for extraction and semantic matching will enhance accuracy and efficiency. You’ll receive final datasets in Excel (.xlsx) and JSON (.json), along with a detailed README documenting the workflow, tools, and reproducible steps. The solution will be precise, efficient, and fully reproducible. Regards, Ahmad
$55 USD 7 päivässä
4,5
4,5

Dedicated Freelancer Ready to Elevate Your Project for Spanish Procurement PDF and Data Extraction -- 3. I have a solid background in Data Extraction, API Integration, Web Scraping, Data Processing, JSON, Python, Excel, Data Entry, Data Analysis and Natural Language Processing, I bring valuable expertise to your project. I have successfully completed many projects with 100% client satisfaction. Clear and timely communication is my priority. I believe in keeping you informed throughout the project lifecycle. I am available for a discussion at your earliest convenience. Please feel free to contact me to further discuss your project details. Thank you for considering my bid. I am excited about the opportunity to contribute to the success of your project. Please visit my portfolio to check my previous work samples, here - https://www.freelancer.com/u/GraphicsHub2k24?page=portfolio&w=f&ngsw-bypass= Best regards, Muhammad Asim Khan
$10 USD 1 päivässä
4,3
4,3

I am a Spanish marketeer with over 20 years of experience and C2-level proficiency in English. I work with a qualified technical team and regularly manage complex, data-driven projects involving automation, extraction, and enrichment workflows. I am interested in building the fully automated pipeline you describe for Spanish procurement PDFs. The solution would start with accurate extraction of product names and key features from PDF files, followed by data cleaning and structuring into Excel format using Python-based tools. In the enrichment phase, each product would trigger a domain-restricted Google search. The system would analyze the top five results per product and extract, with high precision, five valid prices, brand information, descriptions, and key features. Similar and alternative products would be identified using semantic and string-based similarity methods such as cosine similarity and Levenshtein distance, rather than relying only on exact matches. The full workflow would be reproducible, scripted end to end, and supported by LLMs where they add value for extraction and similarity analysis, while maintaining strict validation controls. Final deliverables would include Excel and JSON datasets, plus a clear README explaining tools, logic, and how to reproduce the process. I am comfortable working in phases, validating outputs at each step, and adjusting based on feedback.
$100 USD 7 päivässä
4,3
4,3

Hello , I've just reviewed your project description regarding the Spanish Procurement PDF and Data Extraction -- 3 and I'm confident in my ability to meet your expectations. With over 7 years of experience as a Senior Graphic Designer, I possess a strong skill set in Data Entry, API Integration, Excel, Data Analysis, JSON, Web Scraping, Python, Data Processing, Natural Language Processing and Data Extraction I kindly request you to take a moment from your busy schedule to explore our portfolio, where you can see the quality of my work and read feedback from previous clients: [Portfolio Links] https://www.freelancer.com/u/afshan2176 Could you please specify the final file formats you'll require? Feel free to award me the project so that we can discuss it further. Looking forward to connecting with you. Best regards, Afshan Z.
$10 USD 1 päivässä
3,7
3,7

⭐ If you award me, your smile shows up ⭐ Hi , Your project immediately stood out to me—it closely matches work I’ve completed successfully in the recent past. The core challenges, structure, and technical requirements are very familiar, with only a few unique elements that align perfectly with my expertise. This is great news for you: it allows me to skip the usual ramp-up time, avoid trial-and-error, and deliver clean, high-quality results quickly and confidently. I bring hands-on experience with JSON, Data Analysis, API Integration, Data Extraction, Python, Data Entry, Web Scraping, Natural Language Processing, Excel and Data Processing, along with proven workflows and best practices refined through multiple similar projects. You can view a directly relevant example in my portfolio here: https://www.freelancer.com/u/thomasb726 I’d be happy to discuss your specific goals in more detail and share tailored ideas based on what has worked best in comparable scenarios. Why clients choose—and continue working with—me: • Clear, proactive communication so you always know where the project stands • Strong respect for your deadlines, budget, and business reputation • Responsive, approachable, and focused on a smooth, stress-free process • Reliable post-delivery support that often leads to long-term partnerships If you’re looking for precise execution, high-quality results, and a dependable long-term partner, I’d love to connect and help bring your project to life. Best regards
$100 USD 1 päivässä
3,5
3,5

Hey , I just went through the project description, and I see you are looking for someone experienced in Data Entry, Natural Language Processing, Python, Web Scraping, API Integration, Data Extraction, JSON, Excel, Data Processing and Data Analysis. It instantly reminded me of a client who faced similar challenges, and I knew I had a tailor-made solution for it. Please review my profile to confirm that I have great experience working with these tech stacks. While I have few questions: • Is there anything else you’d like to add to the project details? • What’s the top hurdle you’re facing with this project? • What is the timeline to get this done? Why Choose Me? 250+ Projects. 5 Years. Zero Misses. My reputation is built on a single metric: Flawless Execution. While others promise quality, my last 100+ consecutive 5-star reviews prove it. I don’t just finish the job; I set the standard. Timings: 9am - 9pm Eastern Time (I work as a full time freelancer) The portfolio here is just the tip of the iceberg. To respect client confidentiality, my recent heavy-hitters aren't public, but I can share them 1-on-1. Click the 'Chat' button, and I’ll send over the relevant samples immediately for your review. Regards, Abdul Haseeb Siddiqui.
$10 USD 5 päivässä
3,7
3,7

As an accomplished Excel specialist with years of experience precisely handling large data sets, I am confident in my ability to build the full automatic data extraction and enrichment pipeline you require. My expertise lies in strategic data consolidation, accurate automation, and detailed result.
$12 USD 1 päivässä
3,2
3,2

I understand you want a fully automated pipeline that transforms Spanish procurement PDFs into structured, enriched datasets—accurate, reproducible, and with high-precision pricing data. You’re clearly trying to avoid manual extraction, partial results, or inconsistent outputs. My approach is end-to-end and modular: first extract and clean text from PDFs, then enrich each product entry with structured web data, including multiple price points and similarity-matched alternatives, all reproducible and transparent. LLMs and robust matching algorithms ensure both accuracy and scalability. * PDF text extraction and normalization into structured Excel/JSON * Web-based enrichment: top 5 results per product from target domains * Accurate extraction of price, brand, features, and description * Similar product identification using Levenshtein/cosine similarity * Full reproducibility with clear README documenting workflow and tooling I’ve built similar automated data pipelines for procurement and e-commerce datasets where precision and reproducibility were critical. The workflow will ensure you consistently get 5 verified prices per product along with structured enrichment for further analysis. Sincerely, Adnan
$79,99 USD 7 päivässä
3,2
3,2

Hello , ⏰☎️⏰ Gaining time means gaining everything. I won’t waste your precious time. I have 5 years experienced Software Engneer such as Data Entry, Python, Web Scraping, API Integration, Data Extraction, Data Analysis, Excel, Data Processing, Natural Language Processing and JSON. I fully understand your requirement. so I will implement your project within 3~4 days. ✅ Can you share your current workflow details and any preferred design ideas? ✅ Do you have a deadline for deployment? I’m ready to deliver a smooth, visually appealing solution that works well in the Microsoft ecosystem. I appreciate your consideration. Warm regards, Jordan
$65 USD 4 päivässä
2,4
2,4

Hello, I am excited about the opportunity to create an automated data extraction and enrichment pipeline specifically for Spanish procurement PDF documents. With over 9 years of experience in Python development and a solid background in web scraping and data processing, I'm confident in my ability to deliver accurate and reliable results for your project. I have successfully completed similar projects that required precise data extraction and enrichment. I will begin by extracting product names and key features efficiently from the PDFs and follow through with a meticulous web search for price and brand information, ensuring to extract five accurate prices per product. My approach will leverage advanced techniques like LLMs to enhance the accuracy and efficiency of the data extraction and similarity analysis. I can start the project immediately, and I assure you of my commitment to deliver high-quality datasets in both Excel and JSON formats along with comprehensive documentation.
$88 USD 10 päivässä
2,5
2,5

Hi there, I have 7+ years of experience in Data Extraction, Excel, Data Analysis and can deliver a clean, reliable solution for your project. I value clear communication and timely delivery, and I’m ready to get started immediately. Let’s connect and discuss your goals. Best regards, Dorian
$55 USD 1 päivässä
2,5
2,5

Hey , I just finished reading the job description and I see you are looking for someone experienced in API Integration, Data Extraction, Data Processing, Excel, Natural Language Processing, Web Scraping, Python, Data Entry, Data Analysis and JSON. This is something I can do. Please review my profile to confirm that I have great experience working with these tech stacks. While I have few questions: 1. These are all the requirements? If not, Please share more detailed requirements. 2. Do you currently have anything done for the job or it has to be done from scratch? 3. What is the timeline to get this done? Why Choose Me? 1. I have done more than 250 major projects. 2. I have not received a single bad feedback since the last 5-6 years. 3. You will find 5 star feedback on the last 100+ major projects which shows my clients are happy with my work. Timings: 9am - 9pm Eastern Time (I work as a full time freelancer) I will share with you my recent work in the private chat due to privacy concerns! Please start the chat to discuss it further. Regards, Salik.
$10 USD 4 päivässä
1,4
1,4

Hello, I’m Ankur, a freelance developer with a dedicated team of professionals. I read all your requirements for Website and I assure you that I will provide high-quality work at the proper time. Additionally, we also provide you 3 months of support from our side. As a Full Stack Developer, I specialize in Web and App Development, boasting a portfolio of stunning projects with top-notch UI/UX design. My expertise spans Flutter (for both Android and iOS), PHP, and WordPress, and I bring over 7 years of experience to the table. Whether it’s websites, applications, or e-commerce platforms, I’ve got you covered. But I’m not limited to just coding. My skill set extends to graphic design and logo creation, offering you a one-stop solution for all your project needs. With a track record of over 500 completed projects, I am committed to delivering nothing short of excellence. My ultimate goal is your complete satisfaction. Thank you for considering me for your project. I’m ready to transform your vision into a reality that stands out in today’s competitive landscape. Best Regards, Ankur Hardiya
$55 USD 7 päivässä
0,2
0,2

With over 10 years' experience in web and app development, as well as expertise in Full Stack Web Development, CMS & E-commerce, and Server Management, I believe I am highly qualified for your project. I have successfully built automation systems leveraging Python throughout my career and have a strong grasp on API integrations. Given the unique nature of your project, I believe my Multi-Language Models expertise can add significant value to your Spanish Procurement PDF task. My exposure with Levenshtein distance and cosine similarity for product comparison, extraction and enrichment has equipped me with the skills necessary to complete the task accurately and efficiently. Additionally, I have a good grip on SQL which will make it easier for me to deliver final datasets from multiple sources in various formats. Choosing me isn't only about selecting a competent developer but rather gaining a reliable partner. Throughout my professional journey, be it serving as a Lead Developer or Project Manager, I've understood the importance of delivering quality work on time while maintaining open communication with clients. My dedication to providing clean coded solutions will definitely help you achieve your desired outcomes within budget. Let's initiate a conversation to discuss tailoring the automated pipeline you need for seamless data migration!
$55 USD 2 päivässä
0,0
0,0

Lausanne, Switzerland
Maksutapa vahvistettu
Liittynyt jouluk. 9, 2025
$10-100 USD
$10-100 USD
$10-100 USD
$10-100 USD
$30-250 CAD
€750-1500 EUR
$30-250 CAD
$2-8 USD/ tunnissa
$14-30 NZD
$10-30 USD
$30-250 USD
₹750-1250 INR/ tunnissa
$15-25 USD/ tunnissa
€250-750 EUR
₹1500-12500 INR
₹100-400 INR/ tunnissa
₹12500-37500 INR
₹750-1250 INR/ tunnissa
$10-35 USD
£18-36 GBP/ tunnissa
£250-750 GBP
$15-30 USD/ tunnissa
$10-30 CAD
₹600-1500 INR