Käynnissä

script / software to parse file and extract data

i need a script / software to parse an huge "webcrawler index file", extract some informations and save them into my mysql database

data to extract:

-link relations: link from site - link to site

-link info: textlink or image link, link text, alt text, title text

-link type: nofollow, (meta tag nofollow), follow link

-some page infos: content encoding, title, domainname...

the index file sizes are between 15GB and 100GB big, please keep in mind that your script can handle this capacity

the script / software should run on our linux root server

i'll give you an mysql database layout

you find can all informations about the index file here: [url removed, login to view]

please send me an pms if you have any question

Taidot: C-ohjelmointi, Java, Linux, Perl, Python

Näytä lisää: linux script parse file, search engine crawler script, cron parse script, database search, webcrawler software, root info, find software, file extract data, microsoft tag, ruby script, linux parse file data, linux projects free lance, read notepad file, needed script program, php convert files flv linux, Webcrawler, script linux, parse, parse and, meta tag, linux software c++, Linux script, infos, find root, file server

Tietoa työnantajasta:
( 3 arvostelua ) Verden, Germany

Projektin tunnus: #463062

Myönnetty käyttäjälle:

jimcrow

I can do it with python.

40 $ USD 7 päivässä
(3 arvostelua)
5.8

35 freelancers are bidding on average $115 for this job

srinichal

I can write a bash script for the same

120 $ USD 4 päivässä
(92 arvostelua)
6.6
cliver

Hello, Please look at the PMB. Regards, Sergey

180 $ USD 2 päivässä
(23 arvostelua)
6.5
gangabass

I can do this job for you. See PM for details.

70 $ USD 3 päivässä
(182 arvostelua)
6.2
SigmaVisual

We can help in your project, please check PMB to see our related experience.

250 $ USD 4 päivässä
(34 arvostelua)
6.1
ancosys

Hi, Please check PM Thankx

130 $ USD 3 päivässä
(93 arvostelua)
5.8
sureshdevi

I can do this work. Thanks, Suresh

200 $ USD 5 päivässä
(66 arvostelua)
5.7
jporwal

Dear sir, I have experience developing parsers for VHDL and C++ using lex/yacc and ANTLR. Can develop a very efficient, cache optimized parser for you in 2 days. It is and interesting and simple task for me. Rega Lisää

30 $ USD 2 päivässä
(12 arvostelua)
4.8
pawel100

Hello, I'm interested in your project, Please check PMB for more details.

60 $ USD 3 päivässä
(26 arvostelua)
4.6
edatawiz

Hi - Please check PM for details.

150 $ USD 7 päivässä
(5 arvostelua)
3.7
KelvinChen

Please check PM for details.

110 $ USD 3 päivässä
(5 arvostelua)
3.6
ulkas

easy task, don't need any more info, just get me an example file and i can start asap and after then you can try it with your own big file.

100 $ USD 2 päivässä
(2 arvostelua)
2.9
yaroslavm

Hello, I can do that quickly and at low price.

100 $ USD 3 päivässä
(2 arvostelua)
2.9
Ellemer

Please check the PM, thanks

111 $ USD 3 päivässä
(4 arvostelua)
2.8
IstvanAntal

Hello, I understand what needs to be done, and I can start right away. See PM for details.

100 $ USD 2 päivässä
(1 arvostelu)
1.2
nusch

I can do it fast with Python, I have experience with crawlers and other automated systems.

99 $ USD 2 päivässä
(0 arvostelua)
0.0
Kamerer

I am expirienced in Python. I can write such script for you.

150 $ USD 7 päivässä
(0 arvostelua)
0.0
xeNorthwest

can do this easily with perl

50 $ USD 2 päivässä
(0 arvostelua)
0.0
zub1uk

Hi, I am an information extraction specialist and would be happy to help you with this project. All I require is a small sample of the file to be parsed to proceed.

100 $ USD 2 päivässä
(0 arvostelua)
1.4
ishantoraskar

pls see PM.

35 $ USD 3 päivässä
(0 arvostelua)
0.0
periwebindia

hello, I can help you out.

150 $ USD 5 päivässä
(0 arvostelua)
0.0