Suljettu

Media Wiki expert

Only experienced PHP developers with good knowledge of MediaWiki software should bid on this project please.

We require a set of PHP functions that parse MediaWiki wikitext and return arrays:

function section (wikitext, external, internalroot, imageroot, images)

PARAMETERS:

wikitext - (string) raw wikitext (from cur table) to be parsed

external - (boolean) strip out external links or include external links (a href tags)

internalroot - (string) the root for internal links (e.g. '/','[url removed, login to view]','/docs/')

imageroot - (string) the root for images (e.g. '/images','[url removed, login to view]')

images - (boolean) include images, exclude images (img tags)

RETURNS ARRAY (*by SECTION of wikitext from the article):

html - parsed wikitext in html format

title - title of section

function section_link (wikitext, internal, external, language, externalroot, internalroot, internalextentsion)

PARAMETERS:

wikitext - (string) raw wikitext (from cur table) to be parsed

internal - (boolean) return internal links?

external - (boolean) return external links?

internal extension - (string) file extension of internal link (e.g. '.html','.asp','.php','' to return '[url removed, login to view]','[url removed, login to view]' etc)

language - (boolean) return language links? (e.g. [[es:Article title]])

RETURNS MULTI-DIMENSIONAL ARRAY (*by SECTION of wikitext from the article and link)

title - title of section

url - url of link, with externalroot or internalroot specified, and internal extension (e.g. '/doc/[url removed, login to view]','[url removed, login to view]',etc)

anchor - anchor text in link (e.g. 'article title')

function category (wikitext)

PARAMETERS:

wikitext - (string) raw wikitext (from cur table) to be parsed

RETURNS ARRAY:

category - (string) category name

function image (wikitext, imageroot)

PARAMETERS:

wikitext - (string) raw wikitext (from cur table) to be parsed

RETURNS ARRAY:

image - (string) image filename with path specified in imageroot (e.g. '/images/[url removed, login to view]','[url removed, login to view]')

Usage:

We want to include a single php file at the start of a php script, then be able to call these functions to parse our own wikitext in and get formatted data out.

This needs to run FAST - this should just be using pattern recognition/regular expressions for parsing (not checking for broken links, etc)

There should be no language links (from e.g. [[es:Article title]]) returned in parsed html variables

There should be no category links returned in parsed html variables

There should be no table of contents returned in parsed html variables

Needs to be able to parse all wikitext code (e.g. tables, etc.)

Please explain with your bid how you intend to design the code to demonstrate your understanding of our requirements.

Payment - 100% upon successfully completion of testing (2 days after completion of project, if everything works!)

Taidot: Linux, Perl, PHP, Komentosarjan asennus

Näytä lisää: wikitext expert, website media, using regular expressions, using expressions, table of contents design, string pattern, software testing how to start, software developers on call, return path, regular expressions in c, regular expressions c, recognition language, pattern string, multi dimensional data, multi dimensional, media com, how to get cgi, how to design your own house, get external data, external software testing, expert wiki, expert link, design your own house, data set testing, c string pattern

Tietoa työnantajasta:
( 1 arvostelu ) x, France

Projektin tunnus: #22692