
Suljettu
Julkaistu
Maksettu toimituksen yhteydessä
I have a set of voter lists supplied to me in CSV format and need a clean, reliable way to spot and document every duplicate voter record. The key identifier is the EPIC ID, but I also want you to cross-check any supporting fields you feel strengthen the match logic (name spellings, date of birth, address fragments, etc.). Your task is two-fold: 1) Run a duplicate-detection routine and generate a comparative analysis that lays out every matched record in full detail. 2) Merge these findings into my existing booth-wise reports, updating both the spreadsheet versions and the accompanying plain-text summaries so they read seamlessly with the new data. I already have a reporting structure, so please follow the same column order, file naming, and booth codes. Where you add commentary, keep it concise and clearly marked. Deliverables • Cleaned master CSV with duplicates flagged or removed • Booth-wise Excel sheets and their matching TXT summaries reflecting the new counts • A separate “Duplicate_Details” file containing the side-by-side record comparison for each match Accuracy is critical; I will spot-check random booths before releasing final approval. If you work in Python, R, or any tool you prefer, note the scripts and steps so I can rerun the process for future updates.
Projektin tunnus (ID): 40248279
65 ehdotukset
Etäprojekti
Aktiivinen 11 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
65 freelancerit tarjoavat keskimäärin ₹20 178 INR tätä projektia

Hi, I am from banglore. I will write python code for this and having 8 years of experience in same field.I have worked with 116+ clients here. Let’s connect
₹14 000 INR 2 päivässä
6,3
6,3

Hello, I’m an experienced data analyst with strong expertise in CSV processing, record linkage, and duplicate detection, and I can build a precise, repeatable workflow to identify and document duplicate voter records using EPIC ID as the primary key while strengthening match logic with name similarity, date of birth, and address fragments. I will generate a detailed side-by-side comparison for every matched record, produce a cleaned master CSV with duplicates clearly flagged or removed, and seamlessly integrate updated counts into your existing booth-wise Excel sheets and TXT summaries while preserving your exact column order, file naming conventions, and booth codes. Accuracy and traceability are my priorities—I can implement the process in Python or another preferred tool and provide clear documentation or scripts so you can rerun the duplicate-check routine for future data updates with confidence. Regards, Zafar
₹25 000 INR 2 päivässä
6,3
6,3

Hi, I can help you accurately detect and document duplicate voter records using EPIC ID as the primary key, with additional cross-check logic on name variations, DOB, and address fragments to strengthen match reliability. I will generate a detailed side-by-side comparison file (“Duplicate_Details”) and produce a cleaned master CSV with duplicates clearly flagged or removed as required. I’ll also update your existing booth-wise Excel sheets and TXT summaries, strictly following your current column order, file naming, and booth codes. Any added commentary will be clearly marked and concise. The full ETL and duplicate-detection process (Python/Excel-based) will be documented so you can rerun it for future updates. Accuracy and audit transparency will be prioritized throughout. Best Regards, Virendra
₹25 000 INR 7 päivässä
6,2
6,2

Hello, I can do this job using Excel VBA and do it very fast within one day Final product will be well structured Excel spreadsheet with marked duplicates and extra workbook where will be side-by-side record comparison for each match Need to see sample source file You can contact with me via freelancer chat and I will answer all your questions if you have. Regarding my experience you can check in my reviews with 100% feedback I have 17+ year experience of Python, Excel, Excel VBA and Have 100% feedback So, you can trust me result will be as you need Best Regard
₹12 500 INR 1 päivässä
5,8
5,8

As a dedicated Full-Stack Developer, I brings a unique blend of skills and experience to tackle your Duplicate Voter Analysis project. My proficiency in Python and expertise in Data Analysis and Data Processing perfectly aligns with the task at hand. Over the years, I've successfully built complex AI systems and web applications that required a meticulous eye for detail, a quality especially useful for spotting duplicates in your voter lists. The focus of my work ethic is on accuracy and timely completion; two vital requirements of your project given its significance. Through my developed scripts, I will carry out thorough duplicate detection routines not only considering the key identifier (i.e. EPIC ID) but also by cross-checking supporting fields like name spelling, date of birth, and address fragments - aiming to strengthen the match logic further. Further aligning with your project's goals of providing concise analysis, seamless merging with current reports alongside comprehensive documentation, my experience in using various tools including Pandas in Python will ensure delivery of a cleaned master CSV file, booth-wise Excel sheets (updated), accompanying plaintext summaries reflecting new counts as well as"duplicate_details" file capturing side-by-side record comparisons for each match." Selecting me would mean selecting expertise, dedication, and collaborative problem-solving skills tailored specifically to meet your needs.
₹25 000 INR 1 päivässä
5,6
5,6

Hi there, Myself suganya, hope you are doing good. i have gone through the project description and clear with the instruction provided . Immediately available to create a master sheet. I have more than 6 years of experience in data processing . Good in handling digital documents in excel, pdf, word, etc... easily and quickly. More experience in data cleasning, formating, organizing in large volume data. please consider this project for me, i have more time and patience to do this project carefully with 100% quality. Confidently Waiting for you to discuss about this project i have basic python knowledge also. Thanks, suganya kindly request to see the completed projects in similar type of job here, https://www.freelancer.com/projects/data-management/Conversion-PDF-vers-Excel/reviews https://www.freelancer.com/projects/data-extraction/PDF-Contacts-Extraction-Excel/reviews https://www.freelancer.com/projects/pdf/Digita-extratos-banc-rios/reviews https://www.freelancer.com/projects/excel/PDF-Excel-Conversion-Pages-39251625/reviews https://www.freelancer.com/projects/excel/DIGITA-EXTRATOS-BANC-RIOS-PARA-38962625/reviews https://www.freelancer.com/projects/excel/DIGITA-EXTRATOS-BANC-RIOS-PARA-38851910/reviews https://www.freelancer.com/projects/data-entry/Multilple-rows-38332131/reviews
₹12 500 INR 3 päivässä
5,6
5,6

Hello Dear, I am writing to offer my expert services for your voter list duplication project. I understand the critical importance of accuracy for this task, and I have the precise skills in data cleaning and analysis to deliver the reliable results you require. My approach will involve a multi-level analysis of your CSV data. I will use the EPIC ID as the primary identifier for duplicates and supplement this by cross-referencing names, dates of birth, and addresses to ensure the highest degree of accuracy. I am highly proficient in advanced Excel and Google Sheets functions, which are ideal for this kind of comparative analysis. I am confident in my ability to produce all the deliverables exactly to your specifications, including the cleaned master CSV, the updated booth-wise reports in both Excel and TXT formats, and the detailed “Duplicate_Details” file. I always adhere strictly to existing file structures and naming conventions. My past projects, such as **"Urgent excel data need to be reconfigured"** and **"Organize excel spreadsheet by deliverables,"** have prepared me well for tasks that require meticulous data handling and adherence to specific reporting formats. Furthermore, I will document all steps taken to ensure you have a clear and repeatable process for future updates. Accuracy is my top priority. I am ready to begin immediately and look forward to discussing the specifics of your data with you. Best regards, Md Jamrul Mia
₹30 000 INR 7 päivässä
5,2
5,2

Professional SW/Excel developer ready to do the required cleaning/de duplication from voter lists. A reusable Excel Macro/Script for future use , top accuracy and quality works. https://www.freelancer.com/projects/excel/Excel-Auto-Execute-Macro-Upon/reviews https://www.freelancer.com/projects/data-processing/data-processing-use-mail-merge/reviews
₹12 500 INR 1 päivässä
5,2
5,2

Hi, Thank you for considering my proposal. I have over years of experience in data cleaning and processing. I am well suited to deduplicate you data considering excel and their matching TXT summaries. I am ready to start immediately and will follow your reporting structure. Thank you!
₹15 000 INR 5 päivässä
4,4
4,4

Your duplicate detection will fail if you're only matching on EPIC ID - I've worked with 3 election commissions where phonetic name variations and address typos created false negatives that invalidated entire audits. Before I architect the matching logic, I need clarity on two things: What's your acceptable false positive rate (flagging valid voters as duplicates), and do you have access to the original data dictionary that defines how EPIC IDs were assigned? Some states reuse IDs across districts. Here's the data integrity approach: - PYTHON + PANDAS: Build a multi-stage fuzzy matching pipeline using Levenshtein distance for names and address normalization to catch typos like "Rd" vs "Road" that exact matching misses. - EXCEL VBA MACROS: Automate the booth-wise report updates so your existing column structure and file naming conventions stay intact without manual copy-paste errors. - DATA VALIDATION LAYER: Generate a confidence score (0-100) for each duplicate pair based on how many fields match, so you can review borderline cases before final removal. - AUDIT TRAIL: Create a separate reconciliation sheet showing before/after voter counts per booth with variance explanations, because election officials will ask why numbers changed. I've built similar voter roll deduplication systems for 2 state election boards where we reduced duplicate rates from 8% to 0.3% while maintaining zero false positives on manual review. The scripts I'll deliver include step-by-step comments so your team can rerun this quarterly without my involvement. Let's schedule a 15-minute call to walk through your current booth structure and discuss edge cases like married voters sharing addresses.
₹22 500 INR 7 päivässä
5,0
5,0

Hi, As per my understanding: You have booth-wise voter lists in CSV format and need an accurate duplicate-detection process using EPIC ID as the primary key, strengthened by secondary checks (name similarity, DOB, address fragments). The output must integrate seamlessly into your existing reporting structure, preserving column order, booth codes, and file naming. You require a flagged/cleaned master file, updated booth Excel and TXT summaries, and a detailed side-by-side Duplicate_Details report. Reproducibility is essential. Implementation approach: I will build a scripted workflow (Python + pandas + fuzzy matching). Step 1: normalize data (trim, case standardize, remove special chars). Step 2: exact EPIC match detection. Step 3: secondary probabilistic matching (fuzzy name score, DOB equality, partial address match) to catch near-duplicates. Step 4: generate structured comparison output with match score and reason. Step 5: update booth-wise aggregates and regenerate Excel/TXT summaries while preserving your format. All scripts and steps will be documented for reruns. A few quick questions: Approximate record count? Are EPIC IDs always populated? Preferred fuzzy threshold level? Do booth files exist separately or combined?
₹19 000 INR 15 päivässä
4,6
4,6

Hi,I’m a seasoned Applied Data Scientist & I have experience of audit-grade data cleaning and deduplication pipelines (deterministic IDs + fuzzy matching) with reproducible reporting. I can detect and document every duplicate voter record, then merge results into your existing booth-wise Excel/TXT outputs while preserving your naming conventions and column order. Approach *Ingest + standardize: load all CSVs, enforce schema, normalize EPIC (trim/case), clean names (unicode/spacing), parse DOB & standardize address fragments (tokenization) *Duplicate detection: - Primary: exact EPIC duplicates(same EPIC across rows/files). -Secondary strengthening: fuzzy checks on name/DOB/address using configurable thresholds (RapidFuzz similarity + DOB exact/near match) to confirm or flag “possible duplicates” when EPIC varies or is missing *Evidence + audit trail: generate a Duplicate_Details file with side-by-side comparisons, match reason (EPIC exact / EPIC+fuzzy), similarity scores & booth codes for quick spot checks. *Booth-wise integration: update your existing booth reports to include duplicate counts/flags & regenerate matching TXT summaries with concise, clearly marked notes,keeping the same structure & file naming. *Re-runnable pipeline: provide scripts + README Deliverables • Master cleaned CSV with duplicate flags • Updated booth-wise Excel + TXT summaries • “Duplicate_Details” comparison file (full record pairs/groups)
₹12 500 INR 1 päivässä
4,2
4,2

Having extensive experience in data management and analysis, particularly with a strong command over Excel functions, I am equipped to handle your complex Voter Analysis task. My sharpened data entry, cleansing, and processing skills paired with my keen eye for detail have resulted in turning piles of messy data into clean, reliable information. Understanding your unique requirements, I will ensure a comprehensive duplicate-detection routine that not only cross-checks EPIC IDs but also supports fields like name spellings, date of birth, and address fragments enabling us to capture and document every matched record diligently. My expertise in organizing and structuring data adhering to specified column order means the final deliverables will fit seamlessly with your existing reports. What sets me apart from the crowd is my commitment to accuracy, timeliness, and above all, client satisfaction. So you can rest assured that not only will the finalized Master CSV be devoid of duplicates but also the merged booth-wise reports will accurately reflect the new counts. To simplify future updates, I can provide explicit instructions on running the used scripts in Python or R.
₹12 500 INR 3 päivässä
3,7
3,7

Hi, noticed that you are looking for a skilled developer with experience in CSV data handling. I can get it done as I work on csv data on daily basis to check and generate files using that data. So I'm sure that with my experience in that I can get it done within a short amount of time. So let's talk more in DM.
₹20 000 INR 7 päivässä
3,7
3,7

Hello, I can run a precise duplicate-detection process using EPIC ID as the primary key and cross-check supporting fields (name, DOB, address fragments) to strengthen match logic. I’ll generate a full side-by-side comparison for each duplicate, update your booth-wise Excel sheets and TXT summaries following your exact structure, and deliver a cleaned master CSV with duplicates clearly flagged or removed. I’ll also document the Python/R scripts and steps so you can rerun the process for future updates. Accuracy will be strictly maintained for your spot checks. Regards, Bakhtawar
₹12 500 INR 1 päivässä
3,2
3,2

Hi, There I’ve read your requirements carefully and I understand the importance of accuracy in identifying and documenting duplicate voter records. I can run a reliable duplicate analysis based on EPIC ID with supporting field cross-checks, then integrate the results seamlessly into your existing booth-wise reports. I’ll deliver clean, well-structured files along with clear documentation so the process can be repeated anytime. Please share the CSV files and report structure, and I’ll get started right away and send the first results soon. Regards, Safrin L
₹17 500 INR 3 päivässä
3,4
3,4

❤️❤️❤️HELLO, SIR❤️❤️❤️ I can process sir’s voter lists to accurately detect duplicates using EPIC ID and supporting fields like name, DOB, and address fragments. I will deliver a cleaned master CSV, updated booth-wise Excel sheets, TXT summaries, and a “Duplicate_Details” file with side-by-side comparisons. All scripts and steps will be documented so sir can rerun the process reliably for future updates. I look forward to working with sir.
₹25 000 INR 7 päivässä
2,7
2,7

Thank you for considering my proposal for the Duplicate Voter Analysis Report project. The specific detail that caught my attention is the need to not just identify duplicate voter records but also to cross-check additional fields to strengthen the match logic. With over 7 years of experience in software development, including data analysis and reporting, I am confident that I can deliver the results you are looking for. Here is how I plan to approach this project: - Utilize Python for duplicate-detection and data analysis - Implement fuzzy matching algorithms to compare supporting fields - Generate detailed comparative analysis reports for matched records - Merge findings into existing booth-wise reports with consistent formatting - Provide cleaned master CSV, booth-wise Excel sheets, TXT summaries, and a separate "Duplicate_Details" file for record comparisons In a recent project for a political campaign, I developed a similar voter analysis tool that successfully identified and resolved duplicate records, improving overall data accuracy by 15%. I believe this experience directly applies to the requirements of your project. To ensure accuracy, I will conduct thorough spot-checks on random booths before final approval. Additionally, I will provide detailed documentation of the scripts and steps used in Python for future reference. I am excited about the opportunity to wo
₹13 750 INR 7 päivässä
2,0
2,0

: I would be an excellent fit for your project, as I specialize in creating reliable and scalable software solutions, using Python - one of the tools you are comfortable running yourself in the future. Over the years, I have worked on numerous data analysis projects involving vast sets of data like yours, and my familiarity with CSV files and duplicate detection routines is extensive. We share a common objective - clean, maintainable code and future-ready architecture - these are principles that I live by when developing a project. For your task, I will unleash the power of Python to create an intelligent, abstracted routine that will not just identify duplicates based on your given key identifier but also employ supporting fields such as name spellings, date of birth, and address fragments to enhance match logic. My deliverables will be meticulously aligned with your current reporting structure, incorporating every detail from merged findings in a concise yet clear manner. Additionally, my work process has always been marked by transparency and effective communication; you can expect timely updates and precise documentation on every step taken. Accuracy is paramount to me too; before final approval, let's designate some booths for spot-checking purposes. Together we’ll ensure this project produces a reliable duplicate voter analysis report that meets all your needs.”
₹12 500 INR 3 päivässä
1,5
1,5

Hello, I am new to this platform; however, I have seven years of experience as a Data Analyst, primarily using Excel and Python. I have reviewed your project and understand your requirement to detect duplicate voter records using EPIC ID along with supporting fields, generate a detailed comparison analysis, and integrate the results into your existing booth-wise reports while maintaining the same structure and format. I will carefully follow your guidance to ensure that the project meets your needs with high accuracy and clear documentation so the process can be reused for future updates. I would be happy to work on this project and look forward to discussing this opportunity further. Best regards, Eva
₹15 000 INR 3 päivässä
1,0
1,0

Chennai, India
Liittynyt tammik. 23, 2026
₹1500-12500 INR
₹1500-12500 INR
₹1500-12500 INR
₹1500-12500 INR
₹1500-12500 INR
₹1500-12500 INR
$10-30 USD
₹1500-12500 INR
$250-750 USD
$30-250 USD
€30-250 EUR
$10-30 AUD
£250-750 GBP
$10-30 USD
₹12500-37500 INR
₹600-1500 INR
₹750-1250 INR/ tunnissa
$250-750 USD
£18-36 GBP/ tunnissa
$10-30 USD
₹1000-5000 INR
₹750-1250 INR/ tunnissa
₹750-1250 INR/ tunnissa
$10-30 USD
$30-250 USD