I am a scientist at the USDA. We frequently submit our DNA sequences to NCBI's Genbank. However, now we would like to create a record of all the sequences we've submitted. I am looking for someone with Perl experience who can write a Perl script that can automate the process.
We have a list of GI Numbers (the ID# to every submitted sequence) and we want the script to create an excel file containing the GI number in one column, followed by the name of the submission, and the specimen voucher parameter. This could be done manually, but we have thousands of submissions.
Every submission looks like this:
[url removed, login to view]
In this example we would want the excel sheet to read:
410445619 | Melanconiella elegans... | BPI 872067
I was told by NCBI that this would be an easy programming task (a few hours) because the submissions are stored in a database that is designed to be accessed through scripts. Please contact me with questions. It should be a really straightforward task, but we just don't have any programmers on hand. I'd prefer to work out a fixed price for this since I think it will only take a few hours.