
Käynnissä
Julkaistu
Maksettu toimituksen yhteydessä
I have uploaded a Google Drive folder here: [login to view URL] Inside each sub-folder you will see one Excel file that contains a voter list for a single part number. The folders themselves are already arranged by part number, so locating the correct source files is straightforward. Voter list is the name of the file in every folder. Close to 10,000 excel files are there in the drive What I need you to do • Open only the voter list Excel in every part-number folder and merge the data into one master Excel workbook. • In that master file perform an exact-match duplicate check using this key: Name + Relative Name + Age + Gender + House Number. • For every duplicate row create two full records – one tagged “Source”, the other “Copy”. Each record must include: – Serial No. – Part No. – Assembly No. & Assembly Name – Name, Relative Name, Age, Gender, Door (House) No. (One version required without door number being part of this list) – EPIC ID of both source and copy • Once the master workbook is complete, split the data back out into separate Excel files, each file representing its original Part-Assembly combination, and place them in the same folder structure that currently exists. Also count and report how many flags are available in each variety, the flags are marked in a separate column. Acceptance criteria 1. A single consolidated Excel workbook containing every voter record with duplicates clearly paired and fully detailed. 2. Part-wise / assembly-wise Excel files regenerated from that master and stored in their respective folders. 3. No data loss: row counts in the split files must equal the counts in the original source plus any duplicate copy rows you have added. You are free to use Python (pandas), Power Query, VBA, or any equivalent tool, so long as every step above is achieved and the final output remains in standard Excel format (.xlsx). Very urgent project to be completed within 4 hours. Code is not required only the output required. Totally 65 lac records are roughly available. This is rough sample output [login to view URL] In this assembly name is to be added and door number to be added into the matching criteria
Projektin tunnus (ID): 40355899
10 ehdotukset
Etäprojekti
Aktiivinen 3 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista

Hi, I can start immediately and handle this as a large-scale data-processing job, not a manual Excel task. I’ll use Python/pandas to merge the voter list files from the Drive structure, apply the exact duplicate logic using Name + Relative Name + Age + Gender + House Number, generate paired Source/Copy duplicate records with all required fields, count the flag varieties, and split the final output back into the original part/assembly-wise folder structure. I understand this is urgent and output is what matters most. My focus will be accuracy, no data loss, clear duplicate pairing, and proper row-count validation so the final master workbook and regenerated Excel files are dependable and ready to use. Best regards. Ankit.
₹3 000 INR 1 päivässä
0,0
0,0
10 freelancerit tarjoavat keskimäärin ₹6 538 INR tätä projektia

Hey there Glane here, hope you're doing well. I can help you data manipulation and creating a single consolidated sheet using pandas. Feel free to get in touch
₹5 500 INR 1 päivässä
6,3
6,3

Hello, I will process 65 lakh voter records across 10,000 Excel files within 4 hours using Python. I will consolidate all data, find duplicates based on Name+Relative+Age+Gender+House Number, tag Source/Copy records, add door‑number‑free version, count flags, and split back into original folder structure with no data loss. Deliverables: master Excel, regenerated folders, flag report. Ready to start immediately. Regards, Zafar
₹12 000 INR 1 päivässä
6,3
6,3

Hi, I can merge all 10,000 voter list Excel files (approx. 65 lakh records) into one master workbook, perform exact duplicate matching using Name + Relative Name + Age + Gender + House No. (including Door No.), and generate clearly tagged “Source” and “Copy” records with all required fields, including Assembly details and EPIC IDs. I will also count flag varieties and regenerate part/assembly-wise Excel files back into the same folder structure with zero data loss. I can complete this urgent task within 4 hours and deliver clean, final .xlsx outputs exactly as required. Best Regards, Virendra
₹7 000 INR 3 päivässä
6,0
6,0

Hello There. ⇛My Philosophy is “Quality is never an accident it is always the result of an intelligent effort! Professional Multilingual Service Provider. Given the chance to be in long-term Cooperation! ▶Expertise and Specialties is? & What Services I Provide? ✔️Excel ✔️DAX ✔️Python ✔️SQL ✔️Tableau ✔️Scraping ✔️PPT ✔️Google-Sheets ✔️Typing ✔️ Web Scraping ✔️Data Extraction ✔️Data collection ✔️Leads ✔️Spreadsheet - Formulas ✔️PDF-MS Expert ✔️power-BI specialist ✔️Functions ✔️Macros ✔️VBA ✔️Dashboards ✔️Graphs ✔️Charts ✔️Pivot tables ✔️Flow charts ✔️ODOO ✔️Data Analysis ✸ Google-sheets ✔️Data Processing ✔️ Please leave a message here in via chat box. Thank You!
₹1 500 INR 1 päivässä
4,7
4,7

I will efficiently merge all 10,000 voter Excel files, detect exact duplicates using your criteria, generate a clean master file, and recreate perfectly structured part-wise outputs with zero data loss within your deadline.
₹2 500 INR 3 päivässä
3,9
3,9

Handling a dataset of this magnitude—approximately 6.5 million records—requires a precise and efficient approach to ensure data integrity while performing complex duplicate detection based on multiple key fields. The challenge lies not only in merging thousands of voter lists scattered across numerous folders but also in accurately tagging duplicates with comprehensive details and then re-segregating the data into their original folder structures without any loss. This task demands meticulous data wrangling and robust processing to deliver a seamless master workbook and correctly partitioned output files. Leveraging advanced data processing capabilities in Python, particularly with pandas for efficient handling of large Excel datasets, will facilitate the exact-match duplicate detection on composite keys including Name, Relative Name, Age, Gender, House Number, and Assembly Name. The process will involve consolidating all source files into a master dataframe, implementing a reliable duplicate tagging system that generates paired “Source” and “Copy” records with all required fields, and then slicing the master data back into partitioned Excel files that mirror the original folder hierarchy. The use of optimized I/O operations and memory management will ensure timely completion within the tight four-hour window. Commitment to accuracy and completeness will be paramount, guaranteeing that no data is lost and that all duplicates are correctly identified and reported with full transparency. The final deliverables will be fully compliant with the specified format, enabling easy verification and further analysis. Ready to start immediately and deliver a comprehensive, error-free solution within the deadline—please share access details so the process can begin without delay
₹11 250 INR 7 päivässä
2,8
2,8

This cannot be done reliably in 4 hours manually—65 lakh (~6.5M) records across 10,000 files requires automation and batching. The only practical way is to process it using Python + pandas and then deliver the final Excel outputs. How I’ll fix it (fast & accurate): Bulk Read & Merge Read all “voter list” Excel files from Google Drive folders Add metadata columns: Part No, Assembly No, Assembly Name Append into one master dataset (chunk-wise to avoid memory crash) Duplicate Detection (Exact Match) Key: Name + Relative Name + Age + Gender + House No Use hashing/groupby for high-speed matching Mark duplicates and create paired rows (Source + Copy) Output Structure Add required fields: Serial No, Part No, Assembly details EPIC IDs (both source & copy) Two versions (with & without house no) Count and summarise flag column distribution Split Back to Files Recreate part-wise / assembly-wise Excel files Maintain same folder structure Ensure no data loss (row validation check) Reality check (important): Processing 6.5M rows + Excel writing = 2–4 hours minimum even with optimized scripts Google Drive download/upload itself will take time Excel has row limits → may require splitting or using optimized writing Best approach to meet urgency: Run script on high-RAM system (16–32GB) Parallel processing (multi-core) Deliver: Master file (or chunked) Split files ready in folders Flag summary report Regard Govind
₹6 000 INR 1 päivässä
2,1
2,1

I saw your project and am confident I can deliver on this. I'm currently working on a similar project and will merge all voter list Excel files, identify duplicates based on Name, Relative Name, Age, Gender, and House Number, and create detailed records for each duplicate. By analyzing the project details, I assure you that I can efficiently complete this task, ensuring the required benefit of a consolidated Excel workbook with accurate duplicate pairs and detailed records for each entry. I invite you to view my portfolio, which showcases the quality and results of my past work. I look forward to hearing from you. Regards, Sadiya
₹6 000 INR 7 päivässä
0,0
0,0

I'll merge all voter list Excel files from your part-number folders into one master Excel workbook, then run exact-match duplicate detection using Name + Relative Name + Age + Gender + House Number. Python with pandas will efficiently batch-process all 10,000 files. Each duplicate pair gets two complete records—one tagged "Source", the other "Duplicate". ₹10625, 5-day delivery. Best regards, Val
₹10 625 INR 5 päivässä
0,0
0,0

Chennai, India
Liittynyt tammik. 23, 2026
₹1500-12500 INR
₹1500-12500 INR
₹1500-12500 INR
₹1500-12500 INR
₹12500-37500 INR
₹12500-37500 INR
$30-250 USD
₹1500-12500 INR
₹37500-75000 INR
£20-250 GBP
₹12500-37500 INR
$10-30 USD
₹750-1250 INR/ tunnissa
$10-50 USD
₹12500-37500 INR
$750-1500 USD
$15-25 USD/ tunnissa
₹12500-37500 INR
$10-60 USD
$2-8 USD/ tunnissa
₹600-1500 INR
$10-30 USD
₹750-1250 INR/ tunnissa
£20-250 GBP
$10-300 USD