Find Jobs
Hire Freelancers

Urgent Python script extract domains from html blobs in csv

$30-250 USD

Suoritettu
Julkaistu yli 4 vuotta sitten

$30-250 USD

Maksettu toimituksen yhteydessä
We have exported all of our CMS content to a CSV (page per row) and I need to be able to parse all of the content for each row and get all of the domains linked to from in the content. Ideally we point the python script to a csv and it will process all of the data and output a csv. In that config file we can specify which column is the html to process. Python will leave the other columns untouched. In the starter csv we will have 2-3 other columns of data we need you to persist to the output file. The output file should have those columns and the new url column, in that url column we need just the domains (including sub domain) in a delimit safe way so it stays in the one column. Should run on Windows. Please state your timeline and questions in the offer. So we can more quickly shortlist. Generic proposals will be deleted.
Projektin tunnus (ID): 22415621

Tietoa projektista

20 ehdotukset
Etäprojekti
Aktiivinen 5 vuotta sitten

Haluatko ansaita rahaa?

Freelancerin tarjouskilpailun edut

Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
Myönnetty käyttäjälle:
Käyttäjän avatar
Hi, I am working on VBA for Excel and Outlook for the last 6 years. I have been doing a job as a senior developer to develop ERP solutions based on Excel-VBA, Outlook, python and SQL server with a collaborative platform. As Well I have developed Excel-tools for the purpose of the data analysis, statistical analytical tools and accounting tools for UpWork clients. VSTO Add-Ins using vb.net for excel, word and outlook. I am a full-time freelancer available to work 10+ hours daily even on weekends. Please schedule a meeting for a detailed discussion if required. Thanks and Regards, Divyesh Makwana
$100 USD 1 päivässä
5,0 (2 arvostelua)
3,9
3,9
20 freelancerit tarjoavat keskimäärin $135 USD tätä projektia
Käyttäjän avatar
Hello, how are you? Thank you for your project posting. Here is Eric, skilled with professional python. As a senior web developer I can deliver good results with quality in time you want. Please invite me so that we can discuss more details through chatting. Best regards.
$150 USD 2 päivässä
5,0 (26 arvostelua)
6,0
6,0
Käyttäjän avatar
Hello I have over 3 years of experience with Python programming, including working with CSV files Also, I am working with Windows OS usually. Looks like there are 2-3 hrs of pure working time - but I would like to check sample of CSV, in order to make sure it looks as I expect.
$84 USD 1 päivässä
4,9 (186 arvostelua)
6,0
6,0
Käyttäjän avatar
Hi, How are you? I have rich experience of working with python to make csv file I can do it within your time Please ping me if you are interested on me.
$180 USD 2 päivässä
5,0 (3 arvostelua)
5,0
5,0
Käyttäjän avatar
Hi Hiring manager I have read your all given information in description and i am ready to do various type of work for you and available 24*7*365 for you. 1* I can do Data RESEARCH from any platform for certain subjects. 2* I can create the accurate DATABASE (EMAIL/CONTACTS/PERSONAL INFO etc*. 2* I can properly fill a Google Sheet/EXCEL/WORD. Have good working skills of Excel/PDF. 4* I am Expert in DATA ENTRY and do any kind of that work. 5* I can Find data from INTERNET RESEARCH/ WEB SCRAPING. 6* I can Work as a VIRTUAL ASSISTANT for daily any kind of task. 7* I can work as per client TIME ZONE if they want and provideS 15hrs services in a day. 8* I have 3YRS of good experience of as a VIRTUAL ASSISTANT and done various type of task. 9* I have the excellent skill of SEO/DIGITAL MARKETING/SOCIAL MEDIA MARKETING/MANAGEMENT. 10* I can create MEMES/posting,strategy for increasing likes,followers of social media. 11* I have expertise in WORDPRESS sites creating/product Uploading/WOOCOMMERCE. 12* I have Good skills in SHOPIFY, PRESTASHOP, MAGENTO etc for DROPSHIPPING. 13* I Can Write Optimized SEO friendly impressive articles for BLOG and get 1 rank. 14* I am fast Learner. Grab The instructions from client instantly and implement. 15* I can get 100% results and Give Guaranteed work satisfaction from client instructions. for work i am always available and take the challenges and never say no to work always be ready and be professional in work. thanks
$140 USD 1 päivässä
4,9 (8 arvostelua)
5,0
5,0
Käyttäjän avatar
Hello Sir, I can build Python script which will extract domains from html blobs in csv and generate output as needed, I can build it within a day and it will run on your windows. Please contact me so we can discuss further. Thanks!
$100 USD 1 päivässä
5,0 (12 arvostelua)
4,5
4,5
Käyttäjän avatar
Hi. I can write this project in 2 days. I am ready to write your project Write apps on your demand in many languages (Visual Basic, VBA, VBS, .NET, C#, JS, Python, Java, PowerShell) Write database apps including many db formats: MS Access, MS SQL, SQL Server, MySQL, SQLite, PostgreSQL, Firebird Write Automation apps including: * Automation Desktop apps. Some examples: Automation Playing Games, Automation Start/Stop/Click 3rd party apps * Automation Web apps. Some examples: Automation Web Scraping apps, Automation Web Crawling apps * Automation Data Processing apps: Automation formatting data to a specified template * Automation Data, Document Converting apps * Automation Macro, VBA for all apps in MS Office (Excel, Word, Outlook), OpenOffice, GoogleSheet * Automation Installers/Setups * Convert your Manual tasks to Automation solutions Write Web Service, Web API, Desktop API apps. Some examples: Google API, Bing API, Facebook API, MS API Fix/Solve any errors in your OS, apps
$70 USD 2 päivässä
5,0 (14 arvostelua)
4,3
4,3
Käyttäjän avatar
timeline: 3 hours I have a any questions. - Is there CMS content in html in csv file? - Show me know config file. - is workflow correct? read csv file and parse the data of the column that contain html grab the domains from a link in html content and join in delimit character(for example, :, ;, \n) write 2-3 columns and domains into csv
$150 USD 7 päivässä
4,7 (14 arvostelua)
4,7
4,7
Käyttäjän avatar
Hi, Thanks for your job posting. I've read your project description carefully. You want to build python script extract domains from html blobs in csv urgently. As a senior python developer, I have rich experience in this kind of works. If you have desire to work with me, please respond me. Thanks
$100 USD 1 päivässä
4,1 (7 arvostelua)
3,9
3,9
Käyttäjän avatar
Hi, I can do this work for you. Timeline maximum will be a 2 days including coding, quality, and documentation. I will use python to automate this job, parse each row and run the extraction to dump it in another CSV or txt. Question - 1. How many rows? ( 1-2 days - 100K records, with Quality processing) It can be scaled up. 2. How many expected domain names each row? 3. Any samples to be provided? Thanks, H
$200 USD 7 päivässä
0,0 (0 arvostelua)
0,0
0,0
Käyttäjän avatar
Hi I am interested. Let me tell u about myself I am PhD in AI/CS working as sr. Data Scientist in india. I know a to z of all AWS and Data Science. 7 yrs of experience. But don't know Japanese. If u think ok to go happy for that. Dr Dimple Sehgal PhD (AI / CS), MCA Sr Data scientist
$166 USD 1 päivässä
0,0 (0 arvostelua)
0,0
0,0
Käyttäjän avatar
Questions : 1) Are there any restrictions on use of python libraries? 2) Usually a page may contain many links which are not very interesting. Do you need all the links or just a subset (like html/jpg/pdf etc) of it based on some criteria? In any case, as long as there is some logic, it wont be difficult. 3) Whether the content (in any column) contains unicode characters? 4) Whether the content is in mono-language? 5) How big is the csv file (MB or GB) ? To make it delimit safe, entire content of the csv file need to be known. Semicolon appears to be promising here unless other columns do not have this character inside it. Won't be difficult to implement. Timeline : This varies based on the complexity of the input csv but in a typical case it will take about a day. The proposal can be tweaked based on actual project details and requirements. Do let me know if you have any queries to me.
$200 USD 1 päivässä
0,0 (0 arvostelua)
0,0
0,0
Käyttäjän avatar
Hello! Thanks for your brief project description. Your task sounds interesting. I have some question If you don't mind. How big is the file to process? Do you need the script to be executable on Windows? When do you need it done? Are you open to other libraries different to python built-in functions? Could you provide few lines example of how the csv is structured? Hope to chat with you soon mate! Thanks Sajid Khan
$200 USD 7 päivässä
0,0 (0 arvostelua)
2,4
2,4
Käyttäjän avatar
If I understood it correctly, all of the domain links must be extracted and will be the content of a new column(strictly 1 column so we will not use comma to delimit these links). But what do you mean by that 'config file'? Does that mean that not all the domain links must be extracted but only those links that are in the specified columns? I think I can do it within a few hours but give me a bit longer time just to be sure. I can do it faster if you allow me to do it in Perl instead of Python.
$135 USD 3 päivässä
0,0 (0 arvostelua)
0,0
0,0
Käyttäjän avatar
I am very good with scraping data from web and files via Python. I would review your CSV files. Money is not a bid problem for me and i am only doing this fir my hobby. I love to do scraping Data with python and facing different challenges. Thanks.
$111 USD 4 päivässä
0,0 (0 arvostelua)
0,0
0,0

Tietoja asiakkaasta

Maan UNITED STATES lippu
Austin, United States
4,9
470
Maksutapa vahvistettu
Liittynyt toukok. 9, 2004

Asiakkaan vahvistus

Kiitos! Olemme lähettäneet sinulle sähköpostitse linkin, jolla voit lunastaa ilmaisen krediittisi.
Jotain meni pieleen lähetettäessä sähköpostiasi. Yritä uudelleen.
Rekisteröitynyttä käyttäjää Ilmoitettua työtä yhteensä
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Ladataan esikatselua
Lupa myönnetty Geolocation.
Kirjautumisistuntosi on vanhentunut ja sinut on kirjattu ulos. Kirjaudu uudelleen sisään.