I am looking for the creation of either a plugin or standalong process in php possibly using curl to access the following link at specific intervals (more then likely daily) [url removed, login to view]
On that page will be recent arrest records with a link to the arrest booking details and another link to the image (mug shot). I will want the script to grab the data from the booking details page for each record and along with the image, import into wordpress as a post with data mapped to core fields as well as custom fields. I will also need certain data elements to map to wordpress tags which I will provide details on at a later date. The image for each imported post must also be associated with the post as I am using an image thumbnailer to grab the first image associated with a post to display on site.
Also, if you look at the list of records found on [url removed, login to view] you will notice that there is a date column. I will want the script to only pull in records for the current date as to avoid importing duplicates. I am open to other suggested methods for preventing duplicates on import but this seems the simplist to me.
One additional feature that would be nice is that when the import is complete that the script would load the website homepage to invoke the image thumbnailer to create the thumbs. This will minimize the chances that the first user hitting site will see a missing thumbnail.
I would also like for the script to have some levels of error control to avoid hanging and to notify by email when and if the process fails on either parsing and or import.